-
Notifications
You must be signed in to change notification settings - Fork 605
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[UPDATE] Misleading characterization of state in the Q-Table #452
Comments
Hello, as I am relatively new to RL terminology, my comment below is partially guessing. I think in this case, the state is only the current position of the agent. In the hands-on, this environment is used: https://www.gymlibrary.dev/environments/toy_text/frozen_lake/ If we take a look at the observations:
So for an environment such as this, we can calculate the best action for the agent to take based on its position in the grid. While the full state of the environment itself may include various additional information (position of everything on the grid, image data for rendering graphical elements, additional internal game state), the agent receives only the observations that are necessary for learning, and in a simple environment, that can be only the current position of the agent. However, the inputs to the Q function are called states (the function gives you the
There is a note here that clarifies the terminology used:
Of course, for a more complex environment, we may have to provide the agent with more information about the current state of the environment. |
Thank you for weighing in. I think you're right in that for these simple environments, the position alone is considered sufficient for describing the state of the environment. A clarification in the text explaining this would however be very helpful in understanding. I'll create a PR adding a clarification statement! |
See PR 454. Closing this issue! |
What do you want to improve?
In the images depicting the Q-table in Unit 2, I am assuming that the first column is supposed to represent the state. In this courses glossary, the state is defined as the "Complete description of the state of the world.", hence I'd expect the state to be something like a screenshot of a given game situation. However, the column items depict the single six tiles of the board.
It is not really clear how this relates to the state of the game, which - if I understand it correctly - should be a state of all six tiles of the board at the same time. Two examples:
I hope I'm not misreading the concepts here, but the way I see it the table is not exactly aiding the understanding of what a "state" is. Especially, in my perception, the Q-table has a lot more than six rows since there are a lot more than six possible states.
The text was updated successfully, but these errors were encountered: