Skip to content

Files

Latest commit

385b6e5 · Apr 27, 2019

History

History
 
 

hw2

CS294-112 HW 2: Policy Gradient

Dependencies:

  • Python 3.5
  • Numpy version 1.14.5
  • TensorFlow version 1.10.5
  • MuJoCo version 1.50 and mujoco-py 1.50.1.56
  • OpenAI Gym version 0.10.5
  • seaborn
  • Box2D==2.3.2

Before doing anything, first replace gym/envs/box2d/lunar_lander.py with the provided lunar_lander.py file.

The only file that you need to look at is train_pg_f18.py, which you will implement.

See the HW2 PDF for further instructions.