Skip to content

tyang404/pub

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

pub

Miscellaneous public code.

qlearn.py

Simple discrete, deterministic Q learning.

Can be run in command line, or in notebook (e.g. colab).

The implemented environment example is from Tom Mitchell's machine learning textbook (1997). See http://www.cs.cmu.edu/~tom/mlbook.html .

  • Environment: All actions in the goal state G loop back, with reward 0.
  • Optimal V(s):
  • Optimal Q(s,a):

Note: V(s) and Q(s,a) assume a discount rate of 0.9.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%