elk
Python
★ 220
updated 5d ago
Keeping language models honest by directly eliciting knowledge encoded in their activations.
No plain-English explanation yet — one is being written right now. Check back in a minute.