Policy Gradient algorithm based on Monte Carlo exploration.
You can use the arrow keys to control the car by yourself.
Click on "Load trained agent" to load the model. Then click on "Play" to see the result.
The left window gives you an overview of what the autonomous vehicle (in red) sees.
The algorithm is implemented here using the metacar environment.