You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: nightly research report 2026-03-19 (report #14)
New findings from fresh code audit pass targeting previously unreviewed scripts:
- Hardcoded-path epidemic expanded: fix_memory_mb.py:8, extract_build_diary.py:121,
plot_build_diary_supplementary.py:121+ all use /home/stephanie_jarmak/CodeScaleBench
- Deprecated claude-opus-4-5-20251101 in shell scripts (rerun_fixed_tasks.sh:34,
rerun_zero_mcp_tasks.sh:29) — missed when Python/skill files were updated
- configs/run_selected_tasks.sh:648,699,711: mktemp+mv race condition (mv failure
swallowed by subshell, subsequent cp targets missing directory)
- csb_metrics/extractors.py:669: FD leak via pathlib.Path.open() form (missed by
SIM115 grep sweep that only caught json.load(open(...)) pattern)
Recommended next feature: scripts/fix_common_patterns.py — automated mass-fix for
all known safe mechanical bugs (~60% backlog reduction in one pass).
Updates ROOT_AGENT_GUIDE.md with new learnings; condensed Validation/Scoring,
Scripts/Code Quality, and Skills/Automation sections to stay under 12,288-byte limit.
-`verification_modes`/`use_case_category` missing from all 274 tasks; `--use-case-category` silently returns 0.
105
105
106
106
### Validation / Scoring
107
-
-`validators.py` duplicated across`ccb_build` tasks. Changes must hit **all copies** (`sha256sum`).
108
-
- Agent <2s = never ran. `no_changes_guard`: write `reward.txt` in Python, not bash. `timeout 600` on runners; `--forceExit` for Jest; Jest+TS: `memory_mb = 8192`.
-**Falsy bugs**: `max_score=0` as false; `None` MCP metrics misclassified. `promote_run.py` crashes on non-dict env. `generate_eval_report.py:147,1005``mcp_mode or config_name` falls through on empty string (both sites).
115
-
-`models.py``from_dict()` mutates caller's dict via `.pop()`.
107
+
-`validators.py` duplicated in`ccb_build`; update all copies (`sha256sum`).
108
+
- Agent <2s = never ran. `reward.txt` in Python. `timeout 600`; Jest `--forceExit`; `memory_mb=8192`.
-`verification_modes`/`use_case_category` missing from all 274 tasks; `--use-case-category` silently returns 0.
105
105
106
106
### Validation / Scoring
107
-
-`validators.py` duplicated across`ccb_build` tasks. Changes must hit **all copies** (`sha256sum`).
108
-
- Agent <2s = never ran. `no_changes_guard`: write `reward.txt` in Python, not bash. `timeout 600` on runners; `--forceExit` for Jest; Jest+TS: `memory_mb = 8192`.
-**Falsy bugs**: `max_score=0` as false; `None` MCP metrics misclassified. `promote_run.py` crashes on non-dict env. `generate_eval_report.py:147,1005``mcp_mode or config_name` falls through on empty string (both sites).
115
-
-`models.py``from_dict()` mutates caller's dict via `.pop()`.
107
+
-`validators.py` duplicated in`ccb_build`; update all copies (`sha256sum`).
108
+
- Agent <2s = never ran. `reward.txt` in Python. `timeout 600`; Jest `--forceExit`; `memory_mb=8192`.
0 commit comments