Skip to content

Latest commit

 

History

History
5 lines (5 loc) · 305 Bytes

File metadata and controls

5 lines (5 loc) · 305 Bytes

Exercise 07

This exercise investigates the perspectives of learning from past experiences, which is called planning. The inverted pendulum is revisited for this task.

Tasks:

  1. Q learning with integrated planning from experience: Dyna-Q
  2. Dyna-Q with integrated planning from a simulation model