Skip to content

Agent Skill

Install the vitest-evals agent skill when you want Claude Code, Cursor, Codex, or another coding agent to author, review, or debug harness-backed eval suites in your project.

The skill gives the agent focused context about suite authoring, harness selection, judges, tool replay, and verification — so it writes idiomatic vitest-evals code without guessing at the API.

Terminal window
npx @sentry/dotagents add getsentry/vitest-evals

dotagents is the recommended installer. If your environment already uses Vercel Skills, use the Vercel Skills tab instead.

  • Choose the right harness for the app runtime.
  • Use describeEval(...) with one harness per suite.
  • Call run(...) explicitly inside each eval.
  • Assert on result.output and use judges for scoring.
  • Configure tool replay without hiding live model behavior.
  • Run focused verification commands after edits.

The skill installs as a single directory with:

  • SKILL.md — activation rules, runtime defaults, and a reference router that points the agent at the right guide for the current task.
  • Focused references — suite authoring, AI SDK harness, Pi harness, custom harnesses, judges and assertions, tool replay, and troubleshooting.
  • SPEC.md and SOURCES.md — maintenance notes and source provenance for future skill updates.