Skip to content

Commit

Permalink
Merge pull request #454 from lutzvdb/patch-1
Browse files Browse the repository at this point in the history
Added clarification to the meaning of the rows of the Q-table
  • Loading branch information
simoninithomas authored Jan 15, 2024
2 parents 6dc5937 + 162110a commit 32d5564
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion units/en/unit2/q-learning.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,8 @@ Let's go through an example of a maze.

<img src="https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/unit3/Maze-1.jpg" alt="Maze example"/>

The Q-table is initialized. That's why all values are = 0. This table **contains, for each state and action, the corresponding state-action values.**
The Q-table is initialized. That's why all values are = 0. This table **contains, for each state and action, the corresponding state-action values.**
For this simple example, the state is only defined by the position of the mouse. Therefore, we have 2*3 rows in our Q-table, one row for each possible position of the mouse. In more complex scenarios, the state could contain more information than the position of the actor.

<img src="https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/unit3/Maze-2.jpg" alt="Maze example"/>

Expand Down

0 comments on commit 32d5564

Please sign in to comment.