Skip to content

Commit 82429d8

Browse files
sjarmakclaude
andcommitted
fix: audit sweep — memory_mb, timeout guards, non-root user across 275 tasks
- Add explicit memory_mb to 219 task.toml files (4096 default, 8192 for TS/JS) - Add timeout 600 guards to 16 test.sh verifiers that lacked them - Add non-root user creation (adduser claude) to 42 baseline Dockerfiles for OpenHands and other non-root harness compatibility - Regenerate script registry and index All 275 canonical tasks now pass structural, MCP config, resource allocation, and harness compatibility checks. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent 0b08c9f commit 82429d8

File tree

280 files changed

+672
-10
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

280 files changed

+672
-10
lines changed

benchmarks/csb_org_compliance/ccx-compliance-052/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "cpp"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 52
1718
repo_set_id = "envoy-service-mesh"

benchmarks/csb_org_compliance/ccx-compliance-053/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "java"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 53
1718
repo_set_id = "apache-kafka-ecosystem"

benchmarks/csb_org_compliance/ccx-compliance-124/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "cpp"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 124
1718
repo_set_id = "mozilla-firefox"

benchmarks/csb_org_compliance/ccx-compliance-182/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "go"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 182
1718
repo_set_id = "kubernetes-ecosystem"

benchmarks/csb_org_compliance/ccx-compliance-183/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "go"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 183
1718
repo_set_id = "kubernetes-ecosystem"

benchmarks/csb_org_compliance/ccx-compliance-184/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "go"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 184
1718
repo_set_id = "kubernetes-ecosystem"

benchmarks/csb_org_compliance/ccx-compliance-187/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "cpp"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 187
1718
repo_set_id = "envoy-service-mesh"

benchmarks/csb_org_compliance/ccx-compliance-189/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "go"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 189
1718
repo_set_id = "grafana-observability"

benchmarks/csb_org_compliance/ccx-compliance-190/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "go"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 190
1718
repo_set_id = "grafana-observability"

benchmarks/csb_org_compliance/ccx-compliance-191/task.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ category = "compliance-audit"
1212
language = "java"
1313
difficulty = "hard"
1414
time_limit_sec = 900
15+
memory_mb = 4096
1516
mcp_suite = "csb_org_compliance"
1617
use_case_id = 191
1718
repo_set_id = "apache-kafka-ecosystem"

0 commit comments

Comments
 (0)