Skip to content

Latest commit

 

History

History

hw4_RL

HW4 assignment for the UROB course

Introduction to deep reinforcement learning using policy gradient methods.

The file evaluation.ipynb walks through implementation steps of the individual functions that are located in solution.py. The file walker_training.ipynb then serves as a template for the walker policy implementation, though it is nearly empty. You are allowed and expected to use any code from evaluation.ipynb to train the WalkerPolicy.

Only the files solution.py and WalkerPolicy.py should be uploaded into the evaluation system in a zip file. You can however also include the learned weights of your WalkerPolicy implementation and load them at initialization.

This homework has been developed using Python 3.10. You are advised to use it as well to avoid complications. Required libraries are listed in requirements.txt .