the quick convergence proof for the CartPole-v0 #9

zhouwenchi · 2024-04-07T03:11:43Z

Hello, thank you for sharing. Your work has been very helpful to me!
I encountered some issues while training in the CartPolo environment. Although the training time has accelerated, the reward continues to decrease in the later stages of training, as shown in the figure. My hyperparameters are the same as your example.
Can you tell me where the quick convergence proof is in the code? Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the quick convergence proof for the CartPole-v0 #9

the quick convergence proof for the CartPole-v0 #9

zhouwenchi commented Apr 7, 2024

the quick convergence proof for the CartPole-v0 #9

the quick convergence proof for the CartPole-v0 #9

Comments

zhouwenchi commented Apr 7, 2024