Skip to content

[M3] Agent Evaluation Dataset + Tracing setup #22

@ComBba

Description

@ComBba

Overview

Set up Gradient Evaluations with test cases and enable Tracing for pipeline debugging.

Acceptance Criteria

  • Evaluation dataset (CSV) with 6+ test cases:
    • Simple CRUD app → expected GO
    • Stock trading AI → expected NO-GO
    • Portfolio website → expected GO
    • Video conferencing → expected CONDITIONAL
    • Blockchain voting → expected NO-GO
    • Recipe community → expected GO
  • Evaluations configured in Gradient console
  • Tracing enabled (automatic via LangGraph nodes)
  • Token usage tracked per run
  • Cost tracking per run

Dependencies

Reference

  • `docs/reference/10-technical-plan.md` — Agent Evaluation Dataset
  • `docs/reference/05-digitalocean-gradient-ai.md` — Evaluations, Tracing

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions