outer-value-function-meta-rl
Jupyter Notebook
★ 13
updated 2mo ago
Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
No plain-English explanation yet — one is being written right now. Check back in a minute.