Skip to content

Latest commit

 

History

History
37 lines (26 loc) · 1.79 KB

File metadata and controls

37 lines (26 loc) · 1.79 KB

Security Notes

This is a public GitHub repository. The implementation is intentionally conservative.

Repo Hygiene

  • No secrets, API keys, tokens, or private endpoints are included.
  • .env, virtual environments, caches, editor junk, large result blobs, and local logs are ignored.
  • Output directories are explicit and the CLI refuses to overwrite a populated run directory.
  • The default backend assumption is local Ollama at http://localhost:11434/api with no authentication.
  • The only built-in tool is a deterministic local calculator; it is offline, explicit, and budgeted.

Runtime Safety

  • The harness does not auto-push, auto-sync, or call remote services.
  • Configs are loaded from plain YAML with explicit schema expectations.
  • The repo does not use unsafe eval.
  • The repo does not use pickle loading from untrusted files.
  • The Ollama adapter only targets a local base URL configured by the user; there is no cloud fallback.

Benchmark Asset Hygiene

  • Official GSM8K, MBPP, and HumanEval assets are not redistributed here.
  • The benchmark-style samples under data/samples/ are hand-authored miniature fixtures for plumbing tests only.
  • Real benchmark runs should document dataset provenance and licensing externally.
  • The current live run artifacts under results/runs/20260310_170449_* only contain hand-authored synthetic prompts, machine outputs, and structured logs.
  • Run artifacts are machine-readable and local-only: runs.jsonl, events.jsonl, summary.json, task_diffs.jsonl, task_diffs.csv, and summary.png.

Dependency Posture

The dependency set is intentionally small:

  • PyYAML for configs
  • matplotlib for simple figures
  • pytest for development tests

No external orchestration service, vector database, or model provider SDK is required for the included pilots.