[3604] Surface mutating tool evidence status in the interactive TUI by njfio · Pull Request #3606 · njfio/Tau

njfio · 2026-03-20T03:25:17Z

Closes #3604

Spec: specs/3604/spec.md

What/why:

surface mutating tool evidence in the interactive TUI during active build/create turns
reset evidence per turn so prior successful writes do not leak into new turns
keep non-build and idle turns quiet
clear impacted package quality-gate blockers required for merge

Test evidence:

cargo test -p tau-tui 3604 -- --nocapture
cargo test -p tau-tui
cargo clippy -p tau-tui --all-targets --all-features -- -D warnings
cargo clippy -p tau-tools --all-targets --all-features -- -D warnings
cargo test -p tau-coding-agent regression_spec_3555_c01_run_local_runtime_uses_cli_request_timeout_for_agent -- --nocapture
./scripts/dev/fast-validate.sh --base origin/master

# Conflicts: # crates/tau-tui/src/interactive/mod.rs

greptile-apps

njfio has reached the 50-review limit for trial accounts. To continue receiving code reviews, upgrade your plan.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 9a11f5fbe7

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-20T03:28:28Z

crates/tau-tui/src/interactive/ui_chat_tool_lines.rs

    let Some(entry) = app.tools.latest_entry() else {
-        return Vec::new();
+        return lines;
    };

-    terminal_summary_lines(entry)
+    lines.extend(terminal_summary_lines(entry));


Limit last-tool summary to current turn in build mode

This still appends terminal_summary_lines from app.tools.latest_entry() (global history), so a new build/create turn with no current-tool activity can show both Build status: no mutating evidence yet and Last tool: write from a prior turn. In the context of this change (resetting evidence per turn), that leaks prior mutating evidence back into the active-turn surface and gives contradictory operator guidance. Use current-turn entries for the terminal summary (or suppress it until the current turn has tool events) when build status is shown.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-20T03:28:28Z

crates/tau-tui/src/interactive/app.rs

+    pub fn current_turn_tools(&self) -> &[ToolEntry] {
+        let start = self.current_turn_tool_start.min(self.tools.entries().len());
+        &self.tools.entries()[start..]


Keep current-turn tool window aligned after truncation

current_turn_tools() slices by a stored vector index, but ToolPanel evicts from the front once history exceeds 200 entries. During long turns, that start index is not adjusted as evictions happen, so early tool events from the same turn are dropped from the slice; a turn with an early successful write/edit can later be misclassified as read-only or missing. The turn boundary needs to be tracked in a truncation-safe way (e.g., monotonic IDs or eviction-aware offset updates).

Useful? React with 👍 / 👎.

Copilot

Pull request overview

Adds a “Build status” banner to the interactive TUI chat surface during active build/create turns, indicating whether the current turn has no successful tool evidence yet, is still read-only, or has confirmed mutating evidence—while resetting evidence per user turn.

Changes:

Introduce build_status classification and render it in the chat summary strip (only for non-idle build/create turns).
Track “current turn” tool entries in App so prior-turn successful mutations don’t affect the next turn’s status.
Add integration-style ratatui render-path tests covering all banner states and the per-turn reset behavior.

Reviewed changes

Copilot reviewed 14 out of 14 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
tasks/todo.md	Roadmap status date bump.
tasks/tau-vs-ironclaw-gap-list.md	Status snapshot date bump.
specs/3604/spec.md	New spec capturing ACs, failure modes, and test evidence.
crates/tau-tui/src/interactive/build_status.rs	New build/create prompt + tool-evidence classifier and unit tests.
crates/tau-tui/src/interactive/ui_chat_tool_lines.rs	Prepends build-status banner lines to the chat summary tool lines.
crates/tau-tui/src/interactive/app.rs	Tracks per-turn tool slice start; exposes `current_turn_tools()` and `latest_user_prompt()`.
crates/tau-tui/src/interactive/ui_build_status_tests.rs	New render-path tests validating banner states and turn reset.
crates/tau-tui/src/interactive/app_commands.rs	Routes user submissions through `App::push_message()` to trigger per-turn reset.
crates/tau-tui/src/interactive/{chat,input,tools}.rs	Adds `Default` impls for panel structs.
crates/tau-tui/src/interactive/mod.rs	Wires new module + tests.
crates/tau-tools/src/tools.rs	Small refactors using `is_some_and` and simplified recursion guard.
crates/tau-coding-agent/src/tests/auth_provider/runtime_and_startup.rs	Adds `restore_current_dir` helper to make tests more robust.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-20T03:30:34Z

crates/tau-tui/src/interactive/app.rs

+        let start = self.current_turn_tool_start.min(self.tools.entries().len());
+        &self.tools.entries()[start..]


current_turn_tools() slices tools.entries() using current_turn_tool_start captured from tools.total_count(). Since ToolPanel::add_entry() can evict the oldest entries when max_entries is exceeded, indices can shift during a turn, causing current_turn_tool_start to point too far into the truncated buffer and incorrectly drop early tool entries from the current turn (breaking per-turn evidence classification after long sessions). Consider tracking a monotonic tool sequence/turn id (e.g., store an ever-increasing counter on each ToolEntry or in ToolPanel and filter by seq >= turn_start_seq), or adjust the start marker when eviction occurs so it remains aligned with the in-memory entries buffer.

Suggested change

let start = self.current_turn_tool_start.min(self.tools.entries().len());

&self.tools.entries()[start..]

// Interpret `current_turn_tool_start` as a global sequence number from

// `ToolPanel::total_count()`, not as a direct index into the current

// in-memory entries buffer. Since `ToolPanel` may evict old entries,

// we translate that sequence number into an index relative to the

// current sliding window.

let entries = self.tools.entries();

if entries.is_empty() {

return entries;

}

let total = self.tools.total_count();

// Global sequence number of the first entry currently in memory.

let first_seq = total.saturating_sub(entries.len());

let start_seq = self.current_turn_tool_start;

let start_index = if start_seq <= first_seq {

// Either all in-memory entries are from the current turn, or

// earlier entries from this turn have been evicted. In both

// cases, we start from the beginning of the buffer.

0

} else {

start_seq - first_seq

};

&entries[start_index..]

Copilot · 2026-03-20T03:30:35Z

crates/tau-tui/src/interactive/build_status.rs

+    matches!(
+        word.to_ascii_lowercase().as_str(),
+        "build" | "create" | "implement" | "make"
+    )


is_build_verb() calls to_ascii_lowercase() for every token, allocating a new String each time current_build_status() runs. Since this is invoked during rendering (potentially every tick), this can create avoidable per-frame allocations for long prompts. Consider using eq_ignore_ascii_case() (e.g., word.eq_ignore_ascii_case("build")) or pre-normalizing without allocation to keep render-path overhead predictable.

Suggested change

matches!(

word.to_ascii_lowercase().as_str(),

"build" | "create" | "implement" | "make"

)

word.eq_ignore_ascii_case("build")

|| word.eq_ignore_ascii_case("create")

|| word.eq_ignore_ascii_case("implement")

|| word.eq_ignore_ascii_case("make")

njfio added 8 commits March 19, 2026 23:04

docs(3604): add clean master port spec for mutating evidence status

2796507

test(3604): red tests for mutating evidence status on master

3641fcf

# Conflicts: # crates/tau-tui/src/interactive/mod.rs

feat(3604): surface mutating evidence status in interactive tui

0a53ba3

refactor(3604): tighten turn tracking and build prompt classification

af8437e

integrate(3604): verify build evidence resets across turns

ed064e4

docs(3604): finalize spec and satisfy tau-tui quality gates

9a3dfa3

refactor(3604): clear impacted tau-tools clippy blockers

442b362

refactor(3604): harden runtime startup cwd restoration test

9a11f5f

Copilot AI review requested due to automatic review settings March 20, 2026 03:25

greptile-apps bot reviewed Mar 20, 2026

View reviewed changes

Copilot started reviewing on behalf of njfio March 20, 2026 03:26 View session

chatgpt-codex-connector bot reviewed Mar 20, 2026

View reviewed changes

njfio merged commit f9cdd98 into master Mar 20, 2026
7 checks passed

Copilot AI reviewed Mar 20, 2026

View reviewed changes

njfio deleted the 3604-tui-mutating-evidence-status-pr branch March 20, 2026 11:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[3604] Surface mutating tool evidence status in the interactive TUI#3606

[3604] Surface mutating tool evidence status in the interactive TUI#3606
njfio merged 8 commits intomasterfrom
3604-tui-mutating-evidence-status-pr

njfio commented Mar 20, 2026

Uh oh!

greptile-apps bot left a comment

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 20, 2026

Uh oh!

chatgpt-codex-connector bot Mar 20, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		let start = self.current_turn_tool_start.min(self.tools.entries().len());
		&self.tools.entries()[start..]

-        let start = self.current_turn_tool_start.min(self.tools.entries().len());
-        &self.tools.entries()[start..]
+        // Interpret `current_turn_tool_start` as a global sequence number from
+        // `ToolPanel::total_count()`, not as a direct index into the current
+        // in-memory entries buffer. Since `ToolPanel` may evict old entries,
+        // we translate that sequence number into an index relative to the
+        // current sliding window.
+        let entries = self.tools.entries();
+        if entries.is_empty() {
+            return entries;
+        }
+        let total = self.tools.total_count();
+        // Global sequence number of the first entry currently in memory.
+        let first_seq = total.saturating_sub(entries.len());
+        let start_seq = self.current_turn_tool_start;
+        let start_index = if start_seq <= first_seq {
+            // Either all in-memory entries are from the current turn, or
+            // earlier entries from this turn have been evicted. In both
+            // cases, we start from the beginning of the buffer.
+        } else {
+            start_seq - first_seq
+        };
+        &entries[start_index..]

-    matches!(
-        word.to_ascii_lowercase().as_str(),
-        "build" | "create" | "implement" | "make"
-    )
+    word.eq_ignore_ascii_case("build")
+        || word.eq_ignore_ascii_case("create")
+        || word.eq_ignore_ascii_case("implement")
+        || word.eq_ignore_ascii_case("make")

Conversation

njfio commented Mar 20, 2026

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants