My Reinforcement Learning Experience

CartPole

My solution to the CartPole-v1 environment. I used a DDQN agent with uniform replay sampling.

Cartpole-v1 is considered solved when the agent scores at least 475 points on average over 100 consecutive episodes.
The maximum score possible in a single run is 500 points.