An Anology Between Animals And Computers
Reinforcement learning(RL) is a kind of deep learning that has been growing in the past few years. When I read about the RL concepts I thought it was very similar to how animals learn.
Mouse Maze Experiment:
Below is a very famous video by Matthias Wandel. He made different types of mazes and observed the mice while they were exploring the mazes at different difficulties.
I want to make an analogy between this experiment and RL concepts.
Reward Maximization:
Exploration:
Exploitation:
Exploration-Exploitation Dilemma
Markov Decision Process:
1. estimating the action values called the sample-average method.
We will use this method to compute the value
of each treatment in our medical trial example.
2. we will describe greedy action selection.
3.