Isomorphic Perturbation Testing (IPT)
Diagnose whether a model output is a genuine rule or a reward shortcut. A shortcut passes the standard verifier (extensional) but fails when object constants are renamed (isomorphic) — exposing that it memorised training instances rather than learning a generalizable rule.
Result
This interface evaluates one hypothesis at a time. Use the Python API for batch processing.