Detectors¶
Detection modules for identifying metacognitive behaviors in AI systems.
Available Detectors¶
| Detector | Description |
|---|---|
| SandbaggingDetector | Detect context-dependent underperformance |
| SituationalAwarenessDetector | Probe for evaluation awareness |
| ObserverEffectMonitor | Monitor behavioral changes when observed |
Base Classes¶
Base detector class for metacognition pattern detection.
BaseDetector
¶
Bases: ABC
Abstract base class for all detectors.
All detector implementations should inherit from this class and implement the detect method.
Source code in src/rotalabs_probe/detectors/base.py
__init__() -> None
¶
detect(text: str) -> Dict[str, Any]
abstractmethod
¶
Detect metacognitive patterns in the given text.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
text
|
str
|
The input text to analyze |
required |
Returns:
| Type | Description |
|---|---|
Dict[str, Any]
|
A dictionary containing detection results with keys: - detected: bool indicating if pattern was found - confidence: float between 0 and 1 - details: additional information about the detection |
Raises:
| Type | Description |
|---|---|
NotImplementedError
|
If the method is not implemented |
Source code in src/rotalabs_probe/detectors/base.py
__repr__() -> str
¶
Return string representation of the detector.
Returns:
| Type | Description |
|---|---|
str
|
String representation |