Reinforcement Learning
Reinforcement Learning
Reinforcement Learning
---------------------------------------------------------------------
Low Level Higher Level
Modern view:
GOAL STATE
R3
S
R2
R1
Three elements of associationist theory:
1) stimulus: a problem solving situation
2) response: a particular problem solving behavior
3) associations: strength between stimulus and response
Thorndike’s work on cats in a puzzle box
R3
S
R2
R1
• What about response chains?
• E.g.:
start
goal
start
goal
Demo’s
Reinforcement learning in mazes:
http://www.ise.pw.edu.pl/~cichosz/rl-java/
gorwn S R1 grown
R2 wrong
R3 wrgno
R4 …
The Set-Up For this puzzle you need two people, some rope and some
empty space to do the puzzle in. Each person will need a piece of rope
with a loop tied in both ends, so it can be worn as handcuffs. The rope
should be reasonably long, so that the person wearing it can easily step
over it if they want.
Each person puts on a complete set of handcuffs. Before putting them
on, they loop their handcuffs around each other so they are tied
together. Each person should wear a complete set of handcuffs. They
then have to get themselves apart while following these rules: