Deep PILCO PyTorch Implementation

A reimplmentation of "Improving PILCO with Bayesian Neural Network Dynamics Models" by Yarin Gal et al. in PyTorch.

Average cost per iter. for the original MC Dropout and an Ensemble variant:

The Deep Ensembles variant's hyperparameters have not been optimised, hence the comparatively poor performce.

Even after an extensive hyperparameter search of the parameters not mentioned in the paper, the results obtained do not appear to quite match those obtained by original authors neither in [1] or [2].

Run

Install dependencies

pip install requirements.txt

Install this repository in development mode

From the root of this repository (.../deep-pilco-torch):

pip install -e .

Run training

python torchpilco/run/train_deep_pilco.py

Make rewards plot

python run_plot_rewards.py --log_dirs {runs/deep_pilco_XX runs/deep_pilco_XX2} --labels {label-for-logdir-1 label-for-logdir-2} --save_path {where to save}

Make trajectory plots

python run_plot_trajectories.py --log_dir {runs/deep_pilco_XX} --iter {chosen iteration} --save_path {where to save}

Plots of sample trajectories from the dynamics model

Sample trajectories using the trained policy at iteration 5: At iteration 40:

[1] Improving PILCO with Bayesian Neural Network Dynamics Models, Yarin Gal and Rowan Thomas McAllister and Carl Edward Rasmussen

[2] Uncertainty in Deep Learning

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
figures		figures
torchpilco		torchpilco
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep PILCO PyTorch Implementation

Average cost per iter. for the original MC Dropout and an Ensemble variant:

Run

Install dependencies

Install this repository in development mode

Run training

Make rewards plot

Make trajectory plots

Plots of sample trajectories from the dynamics model

About

Releases

Packages

Contributors 2

Languages

BrunoKM/deep-pilco-torch

Folders and files

Latest commit

History

Repository files navigation

Deep PILCO PyTorch Implementation

Average cost per iter. for the original MC Dropout and an Ensemble variant:

Run

Install dependencies

Install this repository in development mode

Run training

Make rewards plot

Make trajectory plots

Plots of sample trajectories from the dynamics model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages