You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
No. I guess you doubt the training has failed because of standardization by zfilter, right? I checked but it was not the key to solving problem in my implementation.
I trained your ppo first.
python examples/ppo_gym.py --env-name Ant-v2 --save-model-interval 100
After 500 episodes, I made trajectories.
python gail/save_expert_traj.py --model-path assets/learned_models/Ant-v2_ppo.p
Last, I ran gail.
python gail/gail_gym.py --env-name Ant-v2 --expert-traj-path assets/expert_traj/Ant-v2_expert_traj.p
I implemented Gail and Vail, but I failed to train it too.(but hopper worked well)
Any Ideas?
The text was updated successfully, but these errors were encountered: