gitmyhub

AEnt

Python ★ 2 updated 8mo ago

An implementation of the regularization method "AEnt" introduced in "on entropy control in LLM-RL algorithms".

No plain-English explanation yet — one is being written right now. Check back in a minute.