Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.sigilix.ai/llms.txt

Use this file to discover all available pages before exploring further.

Four parallel specialists produce a lot of overlap. Without a synthesizer, that overlap shows up as 40 redundant comments on the PR. Core is the synthesizer that turns four streams into one signal.

The four stages

1. Collect

Specialists submit findings in a structured shape:
{
  "specialist": "warden",
  "path": "src/api/checkout.ts",
  "line": 142,
  "category": "security",
  "severity": "critical",
  "headline": "missing CSRF verification on POST /checkout",
  "body": "...",
  "confidence": 4
}
Each finding is scored 1-5 by the specialist for its own confidence. Findings missing required fields are dropped at this stage.

2. Cross-reference

Core performs structural-provenance checks against the source code to suppress hallucinations:
  • Line-validity check. Does the finding’s path:line actually exist in the diff? Hallucinated line numbers are dropped.
  • Symbol-resolution check. Does the function/variable referenced in the finding actually exist in the file? Hallucinated identifiers are dropped.
  • Pattern-match check. For security findings, does the claimed unsafe pattern (e.g., “passes user input to SQL template”) actually match the code? Pattern-mismatches are dropped or down-graded.
Findings that pass all three checks proceed. Findings that fail are silently suppressed — they never reach the PR.

3. Calibrate

Core then performs deduplication and severity calibration: Deduplication. Overlapping findings (same path:line from multiple specialists) are merged into one. The merged finding’s body draws from each specialist’s contribution; the severity is the maximum of the inputs. Severity calibration. Each finding’s severity is recalculated:
InputsFinal severity
1 specialist, low confidenceInfo
1 specialist, high confidenceWarning
2+ specialists, agreementWarning or Critical
Critical-tagged + structural check passedCritical
Critical-tagged + structural check skepticalWarning (down-graded)
Score-1 advisory cap. If more than 5 score-1 (Info) findings exist, Core picks the top 5 and aggregates the rest into a single “advisory nits” line in the summary. This prevents review-comment dilution.

4. Render

Core writes the final comment. The shape:
  1. Synthesizer summary — what was reviewed, how many findings survived, what verdict
  2. Inline findings — anchored to specific path:line, tagged by specialist + severity
  3. Suggested patches — included where Core’s structural check confirms a clean fix is in scope
Core never posts a finding inline — only specialists do. Core’s voice is the summary at the top.

Failure modes

Specialist 503s

If one specialist’s model returns 503 (overloaded), Sigilix’s cross-provider fallback kicks in: a different model on a different provider attempts the same prompt. If that fails too, the specialist’s findings are skipped — but Core still synthesizes from the remaining specialists. The verdict is still posted, marked with a footnote: _3 of 4 specialists succeeded._

Stale-head guards

If the user pushes a new commit while Sigilix is mid-review, the old review would be stale. Sigilix has two stale-head guards:
  • Before fan-out. If the PR’s head SHA changed since the webhook fired, abort.
  • Before posting. If the head SHA changed during specialist execution, abort and let the new webhook fire its own review.
Both guards prevent stale reviews from racing to post.

Submit failures

If GitHub rejects the inline-anchor positions in the review payload (typical 422 for a bad line number), Sigilix falls back to an anchorless review with all findings rolled into the body. The user sees one coherent review, just without inline anchors. This recovers verdicts that would otherwise be silently lost.

Why this beats single-agent review

A single-agent reviewer has no synthesis stage. It produces raw output and posts it. There’s no deduplication, no cross-reference, no calibration. Every false positive ships. Every redundant comment ships. Every hallucinated line number ships. Core is the difference. The four-stage pipeline is what makes Sigilix’s reviews readable.

Confidence Scoring

The numeric details behind ranking and suppression.

Review Lifecycle

Trigger conditions, pipeline stages, what happens on each push.