You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
An alternative to this is to always return zero and implement wrappers for each game that set the reward. That seems a bit nicer to me in terms of layering things.
Same goes for the observation spaces. The base class here could just return an image and everything else in the info dict, and the wrapper could parse things from info into the obs.
Same goes for the observation spaces. The base class here could just return an image and everything else in the info dict, and the wrapper could parse things from info into the obs.
WDYT?
https://gymnasium.farama.org/tutorials/gymnasium_basics/implementing_custom_wrappers/#inheriting-from-gymnasium-wrapper
Originally posted by @garymm in #86 (comment)
The text was updated successfully, but these errors were encountered: