pg_pong

Python ★ 0 updated 8y ago

Train ATARI pong agent by stochastic policy gradient method from raw playing images.

No plain-English explanation yet — one is being written right now. Check back in a minute.