DescribeEvalOptions
Suite-level configuration for a harness-backed eval block.
Example
Section titled “Example”const options: DescribeEvalOptions< string, { status: "approved" | "denied" }, { expected: { status: "approved" | "denied" } }> = { harness: refundHarness, judges: [ToolCallJudge(), StructuredOutputJudge()], judgeThreshold: 1,};Type Parameters
Section titled “Type Parameters”TInput
Section titled “TInput”TInput = unknown
TOutput
Section titled “TOutput”TOutput extends JsonValue | undefined = JsonValue | undefined
TMetadata
Section titled “TMetadata”TMetadata extends HarnessMetadata = HarnessMetadata
THarness
Section titled “THarness”THarness extends Harness<TInput, TOutput, TMetadata> = Harness<TInput, TOutput, TMetadata>
Properties
Section titled “Properties”harness
Section titled “harness”harness:
THarness
Harness used for every explicit run(...) call in the suite.
judgeHarness?
Section titled “judgeHarness?”
optionaljudgeHarness?:JudgeHarness
Optional judge-side harness used only by judges that call ctx.runJudge(...).
judges?
Section titled “judges?”
optionaljudges?:Judge<JudgeContext<TInput,TOutput,TMetadata,THarness>>[]
Automatic judges applied after each successful run(...).
judgeThreshold?
Section titled “judgeThreshold?”
optionaljudgeThreshold?:number|null
Passing threshold for automatic suite-level judges. null disables fail-on-score.
skipIf?
Section titled “skipIf?”
optionalskipIf?: () =>boolean
Skips the entire eval suite when the predicate returns true.
Returns
Section titled “Returns”boolean