Skip to content

lajoiepy/Reinforcement_Learning_PPO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement_Learning_PPO

Comparison of state-of-the-art reinforcement learning algorithms :

Other references

Experimentations

Tests were done on 3 different environments: OpenAi-Gym (MountainCarContinuous), Mujoco (Reacher) and Atari (Breakout).

OpenAi-Gym (MountainCarContinuous)

  • DQN (src/mountaincar-continuous/dqn and results/gym-mountaincarcontinuous/dqn)
  • DDPG (src/mountaincar-continuous/ddpg and results/gym-mountaincarcontinuous/ddpg)
  • PPO (src/mountaincar-continuous/ppo and results/gym-mountaincarcontinuous/ppo)

Mujoco (Reacher)

  • DDPG (src/baselines/baselines/ddpg and results/mujoco-reacher/ddpg)
  • TRPO (src/baselines/baselines/trpo_mpi and results/mujoco-reacher/trpo)
  • PPO (src/baselines/baselines/ppo2 and results/mujoco-reacher/ppo)

Atari (Breakout)

  • ACER (src/baselines/baselines/acer and results/atari-breakout/acer)
  • TRPO (src/baselines/baselines/trpo_mpi and results/atari-breakout/trpo)
  • PPO (src/baselines/baselines/ppo2 and results/atari-breakout/ppo)

Source code

To get the source code, execute the following commands:

git clone https://github.com/lajoiepy/Reinforcement_Learning_PPO.git
cd Reinforcement_Learning_PPO
git submodule init
git submodule update

Requirements

  • Pytorch for src/mountaincar-continuous/dqn and src/mountaincar-continuous/ppo.
  • Tensorflow for src/mountaincar-continuous/ddpg and src/baselines.
  • Gym for src/mountaincar-continuous.
  • Mujoco and Atari for src/baselines.

Execute tests

  • Follow README files for code in src/baselines.
  • For PPO on Gym environnements run python3 src/mountaincar-continuous/pytorch-ppo/main.py --env-name MountainCarContinuous-v0.
  • For the DQN implementation run python3 mountaincar_dqn.py.
  • For the DDPG implementation run python3 mountaincar_ddpg.py.

About

Reinforcement learning with Proximal Policy Optimization (https://arxiv.org/abs/1707.06347)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages