Skip to content
/ dapo Public

Source code for the paper "Divergence-Augmented Policy Optimization"

Notifications You must be signed in to change notification settings

lns/dapo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Distributed Accelerated Reinforcement Learning

This is an implementation of distributed reinforcement learning, used in several published work including Divergence-Augmented Policy Optimization and Exponentially Weighted Imitation Learning for Batched Historical Data

The project depends on a custom distributed replay memory called memoire. We remove the commit logs to protect sensitive IP and password information.

Examples for how to use this project for (distributed) reinforcement learning can be found in example.

For replicating the results of our paper, please refer to the scripts in tools. The main entry point is tools/gen_atari_env.py which can generate the shell script for running experiments in parallel, and plotting results with R.

About

Source code for the paper "Divergence-Augmented Policy Optimization"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published