Add commit hook perf test with control baseline and scaling analysis by evisdren · Pull Request #549 · entireio/cli

evisdren · 2026-02-27T20:10:25Z

Summary

Rewrites commit_hook_perf_test.go to compare control commits (no Entire) against commits with hooks active across 100/200/500 sessions
Seeds 75% of ENDED sessions with shadow branch refs (no LastCheckpointID) to match production behavior, where most sessions have unconsumed checkpoint data
Uses full-history clone with 200 seeded branches, packed refs, and unique base commits per session for realistic ref scanning and object resolution overhead
Adds docs/architecture/commit-hook-perf-analysis.md documenting findings

Key findings

PostCommit condensation is the dominant cost, not ref scanning:

Scenario	Sessions	Control	PrepareCommitMsg	PostCommit	Total Overhead
100	100	29ms	815ms	6.5s	7.3s
200	200	20ms	1.7s	14.6s	16.3s
500	500	29ms	4.4s	46.9s	51.3s

The 200-session result (16.3s) matches the real-world user report of ~16s for ~95 sessions, confirming the test methodology faithfully reproduces production overhead.

Cost breakdown per ENDED session (with shadow branch)

Condensation: ~30-50ms — tree building + commit on entire/checkpoints/v1 (dominant)
Ref lookups: ~2-4ms — 2-3 repo.Reference() calls across both hooks (packed-refs linear scan, no caching)
Content detection: ~2-5ms — transcript/overlap check
State I/O: ~0.5-1ms — JSON parse per session file

Highest-ROI optimizations

Batch condensation — condense all sessions in one commit instead of N commits
Session pruning — skip stale ENDED sessions during PostCommit
Batch ref resolution — load all refs into a map for O(1) lookups
Lazy condensation — defer to background process instead of blocking the commit

Test methodology evolution

Version	100 sess	Per-session	Issue
Shallow + shared base	1.74s	~18ms	Packfile too small, repeated ref scan
Full history + shared base	2.00s	~21ms	Same ref scanned N times
Full history + unique bases (cheap ENDED)	337ms	~3ms	ENDED had LastCheckpointID → no-ops
Full history + realistic ENDED (current)	7.3s	~73ms	Matches production

The critical fix was seeding 75% of ENDED sessions with shadow branch refs but no LastCheckpointID, forcing the full expensive path: ref lookup → commit/tree resolution → content detection → PostCommit condensation.

Test plan

go build -tags hookperf ./cmd/entire/cli/strategy/ compiles
go vet -tags hookperf ./cmd/entire/cli/strategy/ passes
go test -v -run TestCommitHookPerformance -tags hookperf -timeout 15m ./cmd/entire/cli/strategy/ passes with results matching real-world reports

🤖 Generated with Claude Code

cursor · 2026-02-27T20:10:30Z

PR Summary

Low Risk
Adds a test-only (build-tagged) performance benchmark plus documentation; no production logic changes, with the main risk being flaky/manual execution due to external git/GitHub and local .git/entire-sessions requirements.

Overview
Introduces a new build-tagged Go perf test (commit_hook_perf_test.go, //go:build hookperf) that benchmarks Entire’s prepare-commit-msg and post-commit hook overhead by comparing a baseline git commit (no Entire settings/hooks) vs. a commit with ManualCommitStrategy.PrepareCommitMsg + PostCommit across 100/200/500 seeded sessions in a locally-cloned repo.

Adds docs/architecture/commit-hook-perf-analysis.md summarizing measured results and attributing the linear per-session cost primarily to repeated go-git ref lookups (e.g., repo.Reference() in session listing/content checks), with a short list of suggested optimization directions.

^{Written by Cursor Bugbot for commit dfdf52a. Configure here.}

Copilot

Pull request overview

Adds a reproducible (tagged) performance test and an accompanying analysis document to quantify and explain the overhead of Entire’s commit hooks as session count scales.

Changes:

Add hookperf-tagged Go test that measures control commits vs PrepareCommitMsg + PostCommit across multiple session counts, using seeded branches/packed refs and real session templates.
Add architecture documentation summarizing results and attributing dominant costs (notably repeated repo.Reference() calls), plus optimization opportunities.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 6 comments.

File	Description
docs/architecture/commit-hook-perf-analysis.md	Documents measured hook overhead, scaling behavior, and suspected hotspots/optimizations.
cmd/entire/cli/strategy/commit_hook_perf_test.go	Implements the `hookperf` performance test harness (repo cloning, branch seeding, session seeding, timing).

docs/architecture/commit-hook-perf-analysis.md

cmd/entire/cli/strategy/commit_hook_perf_test.go

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

Comment @cursor review or bugbot run to trigger another review on this PR

cmd/entire/cli/strategy/commit_hook_perf_test.go

Rewrites commit_hook_perf_test.go to compare control commits (no Entire) against commits with hooks active across 100/200/500 sessions. Uses real session templates from .git/entire-sessions/, seeds 200 branches with packed refs for realistic ref scanning. Documents findings: ~18ms/session linear scaling dominated by repo.Reference() calls in listAllSessionStates and filterSessionsWithNewContent. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Entire-Checkpoint: fd2fcba3de23

Shallow clone (--depth 1) produces a ~900KB packfile vs ~50-100MB for a real repo, understating go-git object resolution costs by ~15%. Switch to --single-branch (full history, one branch) to get a realistic packfile while keeping clone time reasonable (~5s vs timeout on full clone). Updated analysis doc with new numbers: ~21ms/session (was ~18ms). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Entire-Checkpoint: 1c1c8fb25717

… test Previous test used 12 templates with shared BaseCommit (HEAD), causing listAllSessionStates to scan packed-refs for the same nonexistent shadow branch ref hundreds of times — inflating per-session cost from ~3ms to ~21ms. Now each session gets a unique base commit from real repo history (via git log walk), varied FilesTouched, diverse agent types, and unique prompts. Drops template dependency entirely. Results: ~3ms/session (was ~21ms), 500 sessions adds ~1.5s overhead. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Entire-Checkpoint: de85e10839ec

The perf test was 50x too low because all ENDED sessions had LastCheckpointID set (trivial no-ops). In production, ~75% of ENDED sessions have shadow branches with data but NO LastCheckpointID, exercising the full expensive path: ref lookup → commit/tree resolution → transcript/overlap check → PostCommit condensation. Changes: - Create alias shadow branch refs for 75% of ENDED sessions - Add perfLargeFileSets (30-80 files) matching production FilesTouched sizes - Include "perf_control.txt" in FilesTouched for staged-file overlap detection - Update analysis doc with corrected numbers and condensation insights Results now match real-world user report (~16s for ~95 sessions): 100 sessions: 7.3s (was 337ms) 200 sessions: 16.3s (was 617ms) 500 sessions: 51.4s (was 1.5s) PostCommit condensation is the dominant cost (~50-80ms/session). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Entire-Checkpoint: da2c31e68843

Copilot AI review requested due to automatic review settings February 27, 2026 20:10

evisdren requested a review from a team as a code owner February 27, 2026 20:10

Copilot started reviewing on behalf of evisdren February 27, 2026 20:10 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

cursor bot reviewed Feb 27, 2026

View reviewed changes

cmd/entire/cli/strategy/commit_hook_perf_test.go Show resolved Hide resolved

evisdren and others added 4 commits March 4, 2026 10:59

gtrrz-victor force-pushed the sessionPruning branch from c55fbb8 to 80e956d Compare March 3, 2026 23:59

gtrrz-victor approved these changes Mar 4, 2026

View reviewed changes

gtrrz-victor merged commit 54182af into main Mar 4, 2026
3 checks passed

gtrrz-victor deleted the sessionPruning branch March 4, 2026 00:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add commit hook perf test with control baseline and scaling analysis#549

Add commit hook perf test with control baseline and scaling analysis#549
gtrrz-victor merged 4 commits intomainfrom
sessionPruning

evisdren commented Feb 27, 2026 •

edited

Loading

Uh oh!

cursor bot commented Feb 27, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Conversation

evisdren commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key findings

Cost breakdown per ENDED session (with shadow branch)

Highest-ROI optimizations

Test methodology evolution

Test plan

Uh oh!

cursor bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

evisdren commented Feb 27, 2026 •

edited

Loading

cursor bot commented Feb 27, 2026 •

edited

Loading