faithcheck
Python
★ 1
updated 1mo ago
Causal Chain-of-Thought step faithfulness evaluation harness — quantifies whether reasoning steps actually drive model outputs or are decorative
No plain-English explanation yet — one is being written right now. Check back in a minute.