Skip to content

Latest commit

 

History

History
134 lines (91 loc) · 3.08 KB

README.md

File metadata and controls

134 lines (91 loc) · 3.08 KB

DDQN RL Doom Agent

A DDQN reinforcement learning agent that can learn to play doom under a level. The idea is kill largest amount of enemies in an episode.

Demo video

Click to see video

Requeriments

  • conda
  • A GPU can improve training time: Is required that nvidia module is loaded.

Getting started

Step 1: Create project environment.

conda env create --file environment.yml

Step 2: Before all you need activate environment every time you use it with this:

source activate doom-agent

or forget use it defining shortcuts(aliases) as follow:

  • If you are a bash shell user:

    ./setup-shortcuts bash; source ~/.bashrc
  • If you are a zsh shell user:

    ./setup-shortcuts zsh; source ~/.zshrc

Also you can use gpu via optiprime adding gpu param like this:

./setup-shortcuts bash gpu; source ~/.zshrc

Step 3: Run agent demo under basic scenario.

agent-demo-basic

Step 4: Run agent demo under defend the center scenario.

agent-demo-defend-the-center

Use

Train

agent-train [--weights weights_file_path]

See evolution of train process running tensor board:

agent-metrics

After go to dash: http://localhost:6006

Note: When the training process ends you will find best weights file under reports path.

Play

Play agent in defend the center scenario:

agent-play --config scenarios/defend_the_center/agent.yml \
           --weights scenarios/defend_the_center/weights/best_weights-loss_0.0208.h5

Play agent in basic scenario:

agent-play --config scenarios/basic/agent.yml \
           --weights scenarios/basic/weights.h5

Help

To see all available args:

agent-train/agent-play  --help

Scenarios

Basic

The purpose of the scenario is just to check if using this framework to train some AI i 3D environment is feasible.

Map is a rectangle with gray walls, ceiling and floor. Player is spawned along the longer wall, in the center. A red, circular monster is spawned randomly somewhere along the opposite wall. Player can only (config) go left/right and shoot. 1 hit is enough to kill the monster. Episode finishes when monster is killed or on timeout.

Defend the center

The purpose of this scenario is to teach the agent that killing the monsters is GOOD and when monsters kill you is BAD. In addition, wasting amunition is not very good either. Agent is rewarded only for killing monsters so he has to figure out the rest for himself.

Map is a large circle. Player is spawned in the exact center. 5 melee-only, monsters are spawned along the wall. Monsters are killed after a single shot. After dying each monster is respawned after some time. Episode ends when the player dies (it's inevitable because of limitted ammo).