reinforcement-learning
Python
★ 285
updated 6y ago
Reinforcement learning baseline agent trained with the Actor-critic (A3C) algorithm.
No plain-English explanation yet — one is being written right now. Check back in a minute.