-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
complexity:small1-4 hours estimated1-4 hours estimatedphase:agentAgent backend (Python/ADK/LangGraph)Agent backend (Python/ADK/LangGraph)priority:mediumNice to haveNice to have
Milestone
Description
Overview
Set up Gradient Evaluations with test cases and enable Tracing for pipeline debugging.
Acceptance Criteria
- Evaluation dataset (CSV) with 6+ test cases:
- Simple CRUD app → expected GO
- Stock trading AI → expected NO-GO
- Portfolio website → expected GO
- Video conferencing → expected CONDITIONAL
- Blockchain voting → expected NO-GO
- Recipe community → expected GO
- Evaluations configured in Gradient console
- Tracing enabled (automatic via LangGraph nodes)
- Token usage tracked per run
- Cost tracking per run
Dependencies
- [M2] LangGraph StateGraph: wire all nodes + SSE streaming entrypoint #15 (complete pipeline to evaluate)
Reference
- `docs/reference/10-technical-plan.md` — Agent Evaluation Dataset
- `docs/reference/05-digitalocean-gradient-ai.md` — Evaluations, Tracing
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
complexity:small1-4 hours estimated1-4 hours estimatedphase:agentAgent backend (Python/ADK/LangGraph)Agent backend (Python/ADK/LangGraph)priority:mediumNice to haveNice to have