feat(tracing): custom tracing processor which injects tenant_id #5918

trial2onyx · 2025-10-24T22:05:19Z

Description

Includes tenant_id along with braintrust traces which allows for per-tenant usage tracking.

I'm not entirely sure why the why the custom processor doesn't capture the @traced decorator. AFAICT, it seems like it should -- it does look like it uses the same span.log() machinery, but couldn't get those callsites to even register the custom processor. Defining the metadata along with the decorator seems non-ideal but also not too burdensome. There is also a non-zero chance there is a bug/quirk in the braintrust sdk that is being exposed.

Also upgrades braintrust sdk to latest as that appeared to fix a bug relevant to the TenantContextTracingProcessor.

How Has This Been Tested?

Tested on existing invoke llm, stream llm and fast_chat_turn traces and confirmed their metadata appears in Braintrust,

Additional Options

Override Linear Check

Summary by cubic

Add a custom Braintrust tracing processor that injects tenant_id into trace metadata and updates LLM trace decorators to include tenant_id, enabling per-tenant usage tracking. Also upgrade Braintrust SDK to v0.3.5 to resolve a processor issue.

New Features
- TenantContextTracingProcessor injects tenant_id on trace start.
- Added tenant_id from CURRENT_TENANT_ID_CONTEXTVAR to @Traced for invoke llm, stream llm, and clarifier.
- Switched set_trace_processors to use the new processor.
Dependencies
- braintrust[openai-agents]: 0.2.6 → 0.3.5

vercel · 2025-10-24T22:05:25Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
internal-search	Ready	Preview	Comment	Oct 25, 2025 1:52am

black

trial2onyx · 2025-10-24T22:14:34Z

@greptileai

greptile-apps

Greptile Overview

Greptile Summary

This PR introduces per-tenant usage tracking for Braintrust traces by injecting tenant_id into trace metadata. The implementation uses a custom TenantContextTracingProcessor that adds tenant context when traces start, and updates @traced decorators across LLM invocation points to include tenant_id metadata.

Key changes:

Created TenantContextTracingProcessor that injects tenant_id from context variables into trace metadata
Fixed decorator metadata to use lambda functions, ensuring tenant_id is captured at runtime rather than at module import time (this addresses the issue from the parent commit)
Upgraded braintrust SDK from 0.2.6 to 0.3.5 to resolve processor-related bugs
Applied tenant_id tracking to invoke llm, stream llm, and clarifier stream and process traces

The PR correctly addresses the timing issue mentioned by the author - using metadata=lambda: defers the contextvar evaluation until function execution, which is the proper approach for capturing dynamic runtime context.

Confidence Score: 5/5

Safe to merge - properly implements tenant tracking with correct timing semantics
The PR correctly fixes the timing issue from the parent commit by using lambda functions to defer contextvar evaluation. The custom processor implementation is straightforward and follows expected patterns. The braintrust SDK upgrade addresses a known bug. All changes are isolated to tracing infrastructure with no breaking changes to business logic.
No files require special attention

Important Files Changed

File Analysis

Filename	Score	Overview
backend/onyx/tracing/braintrust_tracing.py	4/5	Adds `TenantContextTracingProcessor` to inject tenant_id into trace metadata; implementation is sound but could handle None case more explicitly
backend/onyx/llm/interfaces.py	5/5	Updated `@traced` decorator with lambda metadata to correctly capture tenant_id at runtime; fixes timing issue from parent commit
backend/onyx/agents/agent_search/shared_graph_utils/llm.py	5/5	Updated `@traced` decorator with lambda metadata for runtime tenant_id capture; properly defers evaluation
backend/onyx/agents/agent_search/dr/nodes/dr_a0_clarification.py	5/5	Fixed `@traced` decorator to use lambda for tenant_id metadata; correctly defers contextvar evaluation until function execution
backend/requirements/default.txt	5/5	Upgrades braintrust SDK from 0.2.6 to 0.3.5 to fix processor-related bug

Sequence Diagram

sequenceDiagram
    participant App as Application Code
    participant Dec as @traced Decorator
    participant Proc as TenantContextTracingProcessor
    participant Ctx as CURRENT_TENANT_ID_CONTEXTVAR
    participant BT as Braintrust SDK
    
    Note over App,BT: Setup Phase (module import)
    App->>Dec: Define @traced(metadata=lambda: {...})
    Note over Dec: Lambda NOT evaluated yet
    
    Note over App,BT: Runtime Phase (function call)
    App->>Dec: Call decorated function
    Dec->>Dec: Evaluate metadata lambda
    Dec->>Ctx: get() tenant_id
    Ctx-->>Dec: Return current tenant_id
    Dec->>Proc: on_trace_start(trace)
    Proc->>Ctx: get() tenant_id
    Ctx-->>Proc: Return current tenant_id
    Proc->>Proc: Set trace.metadata["tenant_id"]
    Proc->>BT: super().on_trace_start(trace)
    BT-->>Proc: Trace started
    Proc-->>Dec: Continue
    Dec->>App: Execute function
    App-->>Dec: Return result
    Dec->>BT: Log trace with metadata
    BT-->>Dec: Trace logged
    Dec-->>App: Return result

_{5 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

cubic-dev-ai

2 issues found across 5 files

Prompt for AI agents (all 2 issues)


Understand the root cause of the following 2 issues and fix them.


<file name="backend/onyx/llm/interfaces.py">

<violation number="1" location="backend/onyx/llm/interfaces.py:94">
Evaluating CURRENT_TENANT_ID_CONTEXTVAR.get() in the decorator runs at import time, so each span ends up with tenant_id stuck at the ContextVar’s default (often None) instead of the active request’s tenant. Move the lookup to runtime so it reads the ContextVar per invocation.</violation>
</file>

<file name="backend/onyx/agents/agent_search/dr/nodes/dr_a0_clarification.py">

<violation number="1" location="backend/onyx/agents/agent_search/dr/nodes/dr_a0_clarification.py:692">
`traced.metadata` expects a mapping (e.g., with a `tenant_id` key). Passing the raw context-var string loses the key/value structure, so the trace won’t include the tenant metadata. Wrap the value in a dict.</violation>
</file>

_{React with 👍 or 👎 to teach cubic. Mention @cubic-dev-ai to give feedback, ask questions, or re-run the review.}

backend/onyx/llm/interfaces.py

backend/onyx/agents/agent_search/dr/nodes/dr_a0_clarification.py

trial2onyx requested a review from a team as a code owner October 24, 2025 22:05

This comment was marked as outdated.

Sign in to view

feat(tracing): custom tracing processor which injects tenant_id

383f7e2

black

trial2onyx force-pushed the jamison/braintrust-processor branch from 88ed676 to 383f7e2 Compare October 24, 2025 22:11

review

3fce54d

vercel bot deployed to Preview October 24, 2025 22:18 View deployment

greptile-apps bot reviewed Oct 24, 2025

View reviewed changes

cubic-dev-ai bot reviewed Oct 24, 2025

View reviewed changes

backend/onyx/llm/interfaces.py Outdated Show resolved Hide resolved

backend/onyx/agents/agent_search/dr/nodes/dr_a0_clarification.py Outdated Show resolved Hide resolved

trial2onyx mentioned this pull request Oct 24, 2025

fix(tracing): include token count with "invoke llm" and "stream llm" #5921

Open

1 task

greptile griefed me

93309cc

vercel bot deployed to Preview October 24, 2025 23:06 View deployment

with_tenant_metadata

b546a6e

vercel bot deployed to Preview October 24, 2025 23:33 View deployment

typing

5316c12

vercel bot deployed to Preview October 25, 2025 01:52 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(tracing): custom tracing processor which injects tenant_id #5918

feat(tracing): custom tracing processor which injects tenant_id #5918

trial2onyx commented Oct 24, 2025 •

edited

Loading

Uh oh!

vercel bot commented Oct 24, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

trial2onyx commented Oct 24, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

feat(tracing): custom tracing processor which injects tenant_id #5918

Are you sure you want to change the base?

feat(tracing): custom tracing processor which injects tenant_id #5918

Conversation

trial2onyx commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How Has This Been Tested?

Additional Options

Summary by cubic

Uh oh!

vercel bot commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

trial2onyx commented Oct 24, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

trial2onyx commented Oct 24, 2025 •

edited

Loading

vercel bot commented Oct 24, 2025 •

edited

Loading