Judges¶
Evaluate attack success.
Judge¶
Bases: ABC
Abstract base class for response judges.
Source code in src/rotalabs_redqueen/llm/judge.py
judge(stimulus: Stimulus, transcript: Transcript) -> JudgeResult
abstractmethod
async
¶
JudgeResult¶
Result of judging a response (redqueen-spec types.md §JudgeResult).
Source code in src/rotalabs_redqueen/llm/judge.py
HeuristicJudge¶
Bases: Judge
Fast, deterministic heuristic judge using keyword matching.
Source code in src/rotalabs_redqueen/llm/judge.py
LLMJudge¶
Bases: Judge
LLM-based judge: uses a separate model to score the response.