reinforcement-learning

Python ★ 285 updated 6y ago

Reinforcement learning baseline agent trained with the Actor-critic (A3C) algorithm.

No plain-English explanation yet — one is being written right now. Check back in a minute.