Comments (3)
Thank you for weighing in. I think you're right in that for these simple environments, the position alone is considered sufficient for describing the state of the environment. A clarification in the text explaining this would however be very helpful in understanding. I'll create a PR adding a clarification statement!
from deep-rl-class.
Hello, as I am relatively new to RL terminology, my comment below is partially guessing.
I think in this case, the state is only the current position of the agent. In the hands-on, this environment is used: https://www.gymlibrary.dev/environments/toy_text/frozen_lake/
If we take a look at the observations:
Observation Space
The observation is a value representing the agent’s current position as current_row * nrows + current_col (where both the row and col start at 0). For example, the goal position in the 4x4 map can be calculated as follows: 3 * 4 + 3 = 15. The number of possible observations is dependent on the size of the map. For example, the 4x4 map has 16 possible observations.
So for an environment such as this, we can calculate the best action for the agent to take based on its position in the grid.
While the full state of the environment itself may include various additional information (position of everything on the grid, image data for rendering graphical elements, additional internal game state), the agent receives only the observations that are necessary for learning, and in a simple environment, that can be only the current position of the agent.
However, the inputs to the Q function are called states (the function gives you the value
for any state
and action
that you input).
From: https://huggingface.co/learn/deep-rl-course/unit2/q-learning
Given a state and action, our Q-function will search its Q-table for the corresponding value.
There is a note here that clarifies the terminology used:
https://huggingface.co/learn/deep-rl-course/unit1/rl-framework
In this course, we use the term "state" to denote both state and observation, but we will make the distinction in implementations.
Of course, for a more complex environment, we may have to provide the agent with more information about the current state of the environment.
from deep-rl-class.
See PR 454. Closing this issue!
from deep-rl-class.
Related Issues (20)
- [UPDATE] An error in quiz 2 HOT 2
- [QUESTION] How P(τ;θ) disappeared while estimating the gradients using trajectory samples? HOT 2
- MARCH 2024 Big Update
- [UPDATE] An unproper description in the Prerequisites in unit1.ipynb HOT 1
- [QUESTION]Q-learning with Monte Carlo updating HOT 1
- [QUESTION] Why is Unit 7 missing HOT 1
- [HANDS-ON BUG] Bonus-Unit 1: Error while uploading the model
- [HANDS-ON BUG] Unit8 - part 1
- Unable to Publish to repo [HANDS-ON BUG] HOT 2
- [HANDS-ON BUG] HOT 3
- [HANDS-ON BUG] Unit 3 error while running "virtual_display = Display(visible=0, size=(1400, 900))" HOT 2
- [UPDATE] Some texts are too close to math signs
- [UPDATE] : Unit 3 DQN HOT 3
- BIG UPDATE April
- [HANDS-ON BUG] Unit 7 on Apple Silicon HOT 2
- [QUESTION] Missing Σ (sigma) symbol in Unit 4 "Gradient Ascent and the Policy-gradient Theorem"
- [HANDS-ON BUG] Deprecated Python 2 dependeny on the course can no longer be installed HOT 1
- FileNotFoundError: [Errno 2] No such file or directory: 'results/Huggy/configuration.yaml' HOT 1
- [HANDS-ON BUG]
- [HANDS-ON BUG] boolean is a digital number other than 1 or 0
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deep-rl-class.