You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In paper Section 4.3.2, it says it uses positional encoding in RND estimation. However, I noticed that for every environment, there is a wrapper that directly embeds the env's observations. Does it mean that the modified observation is used for all training procedures, not only the RND part? Please point me out if I misunderstand something :)
Additionally, the observation is actually modified a lot depending on the environment. I am wondering how you define the observation space for each environment?
Thanks!
The text was updated successfully, but these errors were encountered:
Hi @Stevenhunter167,
In paper Section 4.3.2, it says it uses positional encoding in RND estimation. However, I noticed that for every environment, there is a wrapper that directly embeds the env's observations. Does it mean that the modified observation is used for all training procedures, not only the RND part? Please point me out if I misunderstand something :)
Additionally, the observation is actually modified a lot depending on the environment. I am wondering how you define the observation space for each environment?
Thanks!
The text was updated successfully, but these errors were encountered: