|
| 1 | +--- |
| 2 | +title: "Skills" |
| 3 | +description: "Set up ZeroEval from inside Cursor, Claude Code, Codex, and other coding agents" |
| 4 | +--- |
| 5 | + |
| 6 | +Skills let your coding agent do the ZeroEval setup work for you. Instead of flipping between docs and your editor, you tell your agent "install zeroeval" or "create a judge" and the skill handles the rest in-context. The source lives at [zeroeval/zeroeval-skills](https://github.com/zeroeval/zeroeval-skills) on GitHub. |
| 7 | + |
| 8 | +They work with Cursor, Claude Code, Codex, and 30+ other agents that support the skills format. |
| 9 | + |
| 10 | +## Available skills |
| 11 | + |
| 12 | +<CardGroup cols={2}> |
| 13 | + <Card title="zeroeval-install" icon="download"> |
| 14 | + Installs the SDK (Python or TypeScript), verifies your first trace, migrates prompts to `ze.prompt`, and recommends starter judges. |
| 15 | + </Card> |
| 16 | + <Card title="create-judge" icon="gavel"> |
| 17 | + Helps you pick an evaluation type (binary or scored), write a judge template, define criteria, and create the judge via dashboard or API. |
| 18 | + </Card> |
| 19 | +</CardGroup> |
| 20 | + |
| 21 | +## Install |
| 22 | + |
| 23 | +<CodeGroup> |
| 24 | + |
| 25 | +```bash CLI (recommended) |
| 26 | +# Install all skills |
| 27 | +npx skills add zeroeval/zeroeval-skills |
| 28 | + |
| 29 | +# Install a specific skill |
| 30 | +npx skills add zeroeval/zeroeval-skills --skill zeroeval-install |
| 31 | + |
| 32 | +# List available skills |
| 33 | +npx skills add zeroeval/zeroeval-skills --list |
| 34 | +``` |
| 35 | + |
| 36 | +```bash Claude Code plugin |
| 37 | +# Add the marketplace |
| 38 | +/plugin marketplace add zeroeval/zeroeval-skills |
| 39 | + |
| 40 | +# Install a specific plugin |
| 41 | +/plugin install zeroeval-install@zeroeval-skills |
| 42 | +/plugin install create-judge@zeroeval-skills |
| 43 | + |
| 44 | +# Reload plugins if the new commands do not appear immediately |
| 45 | +/reload-plugins |
| 46 | +``` |
| 47 | + |
| 48 | +```bash Manual copy (Cursor / Claude Code) |
| 49 | +git clone https://github.com/zeroeval/zeroeval-skills.git |
| 50 | + |
| 51 | +# Cursor |
| 52 | +mkdir -p .cursor/skills |
| 53 | +cp -r zeroeval-skills/skills/zeroeval-install .cursor/skills/zeroeval-install |
| 54 | +cp -r zeroeval-skills/skills/create-judge .cursor/skills/create-judge |
| 55 | + |
| 56 | +# Claude Code |
| 57 | +mkdir -p .claude/skills |
| 58 | +cp -r zeroeval-skills/skills/zeroeval-install .claude/skills/zeroeval-install |
| 59 | +cp -r zeroeval-skills/skills/create-judge .claude/skills/create-judge |
| 60 | +``` |
| 61 | + |
| 62 | +</CodeGroup> |
| 63 | + |
| 64 | +<Note> |
| 65 | +On Windows without symlink support, use `npx skills` or the manual copy method. The `plugins/` directory in the repo contains symlinks that may not resolve on Windows. |
| 66 | +</Note> |
| 67 | + |
| 68 | +## Requirements |
| 69 | + |
| 70 | +- A ZeroEval account and API key — [zeroeval.com](https://zeroeval.com) |
| 71 | +- Python 3.8+ or Node 18+ |
| 72 | +- An LLM provider SDK (OpenAI, Vercel AI, LangChain, etc.) |
| 73 | + |
| 74 | +<Tip> |
| 75 | +The [zeroeval-skills GitHub repo](https://github.com/zeroeval/zeroeval-skills) has the latest skill content and deeper reference playbooks. This page covers discovery and install only. |
| 76 | +</Tip> |
| 77 | + |
| 78 | +## After installation |
| 79 | + |
| 80 | +Once installed, your coding agent picks up the skills automatically. Ask it to "set up zeroeval" or "create a judge" and it will use them. |
| 81 | + |
| 82 | +If you want to read the product docs directly: |
| 83 | + |
| 84 | +<CardGroup cols={2}> |
| 85 | + <Card title="Tracing quickstart" icon="rocket" href="/tracing/quickstart"> |
| 86 | + Get your first trace in under 5 minutes |
| 87 | + </Card> |
| 88 | + <Card title="Prompt setup" icon="wrench" href="/autotune/setup"> |
| 89 | + Add ze.prompt() to your Python or TypeScript codebase |
| 90 | + </Card> |
| 91 | + <Card title="Judges" icon="gavel" href="/judges/introduction"> |
| 92 | + How calibrated judges evaluate your production traffic |
| 93 | + </Card> |
| 94 | + <Card title="Judge setup" icon="sliders" href="/judges/setup"> |
| 95 | + Create and calibrate a judge |
| 96 | + </Card> |
| 97 | +</CardGroup> |
0 commit comments