REINFORCE-Algorithm-CartPole
Jupyter Notebook
★ 0
updated 1y ago
This project implements the REINFORCE algorithm to solve the CartPole-v1 environment. The algorithm uses policy gradients to optimize the agent’s performance by reinforcing actions that lead to higher returns.
No plain-English explanation yet — one is being written right now. Check back in a minute.