Implements the MVP cutline from the planning artifact: - Phase A: skeleton + CLI dispatch + provider interface + stub model doctor - Phase B: scanner + git probe + 12 static analyzers + reporters + pipeline - Phase B fixtures: clean-repo, insecure-repo, degraded-repo 12 static analyzers per PROMPT.md "Suggested Static Checks For MVP": hardcoded_paths, shell_execution, raw_sql_interpolation, broad_cors, secret_patterns, large_files, todo_comments, missing_tests, env_file_committed, unsafe_file_io, exposed_mutation_endpoint, hardcoded_local_ip. Acceptance gates passing: - B1 (intake produces accurate counts) ✓ - B2 (insecure fixture fires ≥8 distinct check_ids — actually 11/12) ✓ - B3 (clean fixture produces 0 confirmed findings — no false positives) ✓ - B4 (scrum mode produces all 6 required markdown + JSON reports) ✓ - B5 (receipts.json marks degraded phases honestly) ✓ - F (self-review on this repo runs without crashing) ✓ — exit 66 (degraded because Phase C LLM review is hardcoded skipped) Phases C (LLM review), D (validation cross-check), E (memory + diff + rules subcommands) deferred per the cutline. The MVP delivers the evidence-first path; LLM is purely additive. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
3.3 KiB
Executable File
3.3 KiB
Executable File
Review Pipeline Specification
Purpose
This document defines the local review harness pipeline.
The pipeline exists to inspect a repository, collect evidence, identify risks, validate model claims, and generate operational reports without relying on cloud services.
Pipeline Overview
Repo Intake
-> Static Scan
-> Optional LLM Review
-> Validation
-> Report Generation
-> Memory Update
Phase 0: Repo Intake
Goal
Build a factual profile of the target repository.
Inputs
- repository path
- git metadata
- filesystem metadata
- dependency manifests
- build files
- test files
Required Output
reports/latest/repo-intake.json
Required Fields
{
"repo_path": "",
"current_branch": "",
"latest_commit": "",
"git_status": "",
"file_count": 0,
"language_breakdown": {},
"largest_files": [],
"dependency_manifests": [],
"test_manifests": [],
"generated_at": ""
}
Phase 1: Static Scan
Goal
Find evidence-backed problems without using an LLM.
Detection Targets
- hardcoded absolute paths
- unsafe shell execution
- raw SQL interpolation
- exposed mutation endpoints
- broad CORS
- unchecked file reads and writes
- suspicious secret patterns
- large files
- TODO, FIXME, HACK comments
- missing tests near critical modules
Required Output
reports/latest/static-findings.json
Phase 2: LLM Review
Goal
Use a local model to perform higher-level reasoning over bounded evidence chunks.
Rules
- Do not send the entire repository blindly.
- Chunk inputs by file, function, or diff boundary.
- Require strict JSON output.
- Retry invalid JSON once.
- Save degraded output if parsing fails.
- Never trust model claims without validation.
Required Output
reports/latest/llm-findings.raw.json
reports/latest/llm-findings.normalized.json
Phase 3: Validation
Goal
Validate every LLM-generated finding against real repository evidence.
Reject A Finding If
- the file does not exist
- the cited evidence does not exist
- the line hint is impossible
- the claim is unsupported
- the suggested fix targets unrelated code
- the model invents tests, commands, or files
Required Output
reports/latest/validated-findings.json
Phase 4: Report Generation
Goal
Produce human-readable and machine-readable reports.
Required Markdown Reports
reports/latest/scrum-test.md
reports/latest/risk-register.md
reports/latest/claim-coverage-table.md
reports/latest/sprint-backlog.md
reports/latest/acceptance-gates.md
Required JSON Receipt
reports/latest/receipts.json
Phase 5: Memory
Goal
Persist durable review knowledge for future runs.
Required Memory Files
.memory/review-rules.md
.memory/known-risks.json
.memory/fixed-patterns.json
.memory/project-profile.json
Memory Rules
- append-only by default
- version every update
- never silently overwrite
- record source run ID
- record evidence file
- record confidence level
Degraded Mode
A phase is degraded when it cannot fully run but the pipeline can continue.
Examples:
- Ollama unavailable
- model returns invalid JSON
- repository has no git metadata
- dependency manager unavailable
- large dataset missing
Degraded mode must be explicit in reports.
No silent success.