gitmyhub

elk

Python ★ 220 updated 5d ago

Keeping language models honest by directly eliciting knowledge encoded in their activations.

No plain-English explanation yet — one is being written right now. Check back in a minute.