Reinforcement Learning

An Anology Between Animals And Computers

Reinforcement learning(RL) is a kind of deep learning that has been growing in the past few years. When I read about the RL concepts I thought it was very similar to how animals learn.

Mouse Maze Experiment:

Below is a very famous video by Matthias Wandel. He made different types of mazes and observed the mice while they were exploring the mazes at different difficulties.

I want to make an analogy between this experiment and RL concepts.

Reward Maximization:

Exploration:

Exploitation:

Exploration-Exploitation Dilemma

Markov Decision Process:

1. estimating the action values called the sample-average method.

We will use this method to compute the value

of each treatment in our medical trial example.

2. we will describe greedy action selection.