gitmyhub

trpo

Jupyter Notebook ★ 0 updated 6y ago ⑂ fork

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

No plain-English explanation yet — one is being written right now. Check back in a minute.