outer-value-function-meta-rl

Jupyter Notebook ★ 13 updated 2mo ago

Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function

No plain-English explanation yet — one is being written right now. Check back in a minute.