scrum: filter table-header words from bug_fingerprint extraction
Iter 11 surfaced "DeadCode:Flag" in the matrix — a noisy pattern_key where "Flag" is the table column HEADER kimi produces for structured review output, not an actual Rust identifier. Kimi's standard format on recent iters: | # | Change | Flag | Confidence | | 1 | Wire AgentIdentity into.. | Boundary.. | 92% | The extractor's KEYWORDS set already filtered Rust grammar words (self, mut, async, etc) and the FLAG_VARIANTS themselves. Adding markdown-layout words (Flag, Change, Confidence, PRD, Plan) closes the last common noise class. One-line addition — empirically validated against the iter 11 vectord trace that produced DeadCode:Flag. Future iters won't reproduce that specific noise. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
parent
08cc960115
commit
ed85620558
@ -842,6 +842,11 @@ Respond with markdown. Be specific, not generic. Cite file-region + PRD-chunk-of
|
||||
"const", "static", "impl", "trait", "struct", "enum", "use",
|
||||
"mod", "crate", "super", "match", "return", "Some", "None",
|
||||
"Ok", "Err", "true", "false",
|
||||
// Markdown table column headers kimi outputs for structured
|
||||
// reviews — "Flag" / "Change" / "Confidence" are layout words,
|
||||
// not identifiers. Seen as noise in iter 11 vectord extraction
|
||||
// ("DeadCode:Flag" pattern_key).
|
||||
"Flag", "Change", "Confidence", "PRD", "Plan",
|
||||
]);
|
||||
const filtered = codeTokens.filter(t => !FLAG_SET.has(t) && !KEYWORDS.has(t));
|
||||
if (filtered.length === 0) continue;
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user