Reinforcement Learning: An Introduction

Python code for Sutton & Barto's book Reinforcement Learning: An Introduction

Figure 8.3: Average learning curves for Dyna-Q agents varying in their number of planning steps
Figure 8.5: Average performance of Dyna agents on a blocking task
Figure 8.6: Average performance of Dyna agents on a shortcut task
Figure 8.7: Prioritized sweeping significantly shortens learning time on the Dyna maze task

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
chapter1		chapter1
chapter2		chapter2
chapter3		chapter3
chapter4		chapter4
chapter5		chapter5
chapter6		chapter6
chapter7		chapter7
chapter8		chapter8
utils		utils
.gitignore		.gitignore
README.md		README.md

Provide feedback