AEnt
Python
★ 2
updated 8mo ago
An implementation of the regularization method "AEnt" introduced in "on entropy control in LLM-RL algorithms".
No plain-English explanation yet — one is being written right now. Check back in a minute.