diff --git a/main/.buildinfo b/main/.buildinfo index 1ab133e1b..a2227afe7 100644 --- a/main/.buildinfo +++ b/main/.buildinfo @@ -1,4 +1,4 @@ # Sphinx build info version 1 # This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done. -config: d0994afbf4829f55c72679d8625ef803 +config: 52f055dfc6fd0bcfbece04e77429c793 tags: d77d1c0d9ca2f4c8421862c7c5a0d620 diff --git a/main/_downloads/315c4c52fb68082a731b192d944e2ede/tutorials_python.zip b/main/_downloads/315c4c52fb68082a731b192d944e2ede/tutorials_python.zip index dab85dd05..ddd2d58d8 100644 Binary files a/main/_downloads/315c4c52fb68082a731b192d944e2ede/tutorials_python.zip and b/main/_downloads/315c4c52fb68082a731b192d944e2ede/tutorials_python.zip differ diff --git a/main/_downloads/a5659940aa3f8f568547d47752a43172/tutorials_jupyter.zip b/main/_downloads/a5659940aa3f8f568547d47752a43172/tutorials_jupyter.zip index 21fc61925..d9e0317f0 100644 Binary files a/main/_downloads/a5659940aa3f8f568547d47752a43172/tutorials_jupyter.zip and b/main/_downloads/a5659940aa3f8f568547d47752a43172/tutorials_jupyter.zip differ diff --git a/main/gymnasium_release_notes/index.html b/main/gymnasium_release_notes/index.html index d3e28fc18..bc2ddac7e 100644 --- a/main/gymnasium_release_notes/index.html +++ b/main/gymnasium_release_notes/index.html @@ -375,6 +375,302 @@

Gymnasium Release Notes

+
+

v1.0.0

+

Released on 2024-10-08 - GitHub - PyPI

+

v1.0.0 release notes

+

Over the last few years, the volunteer team behind Gym and Gymnasium has worked to fix bugs, improve the documentation, add new features, and change the API where appropriate so that the benefits outweigh the costs. This is the complete release of v1.0.0, which will be the end of this road to change the project's central API (Env, Space, VectorEnv). In addition, the release has included over 200 PRs since 0.29.1, with many bug fixes, new features, and improved documentation. So, thank you to all the volunteers for their hard work that has made this possible. For the rest of these release notes, we include sections of core API changes, ending with the additional new features, bug fixes, deprecation and documentation changes included.

+

Finally, we have published a paper on Gymnasium, discussing its overall design decisions and more at https://arxiv.org/abs/2407.17032, which can be cited using the following:

+
@misc{towers2024gymnasium,
+      title={Gymnasium: A Standard Interface for Reinforcement Learning Environments}, 
+      author={Mark Towers and Ariel Kwiatkowski and Jordan Terry and John U. Balis and Gianluca De Cola and Tristan Deleu and Manuel Goulão and Andreas Kallinteris and Markus Krimmel and Arjun KG and Rodrigo Perez-Vicente and Andrea Pierré and Sander Schulhoff and Jun Jet Tai and Hannah Tan and Omar G. Younis},
+      year={2024},
+      eprint={2407.17032},
+      archivePrefix={arXiv},
+      primaryClass={cs.LG},
+      url={https://arxiv.org/abs/2407.17032}, 
+}
+
+

Removing The Plugin System

+

Within Gym v0.23+ and Gymnasium v0.26 to v0.29, an undocumented feature for registering external environments behind the scenes has been removed. For users of Atari (ALE), Minigrid or HighwayEnv, then users could previously use the following code:

+
import gymnasium as gym
+
+env = gym.make("ALE/Pong-v5")
+

Despite Atari never being imported (i.e., import ale_py), users can still create an Atari environment. This feature has been removed in v1.0.0, which will require users to update to

+
import gymnasium as gym
+import ale_py
+
+gym.register_envs(ale_py)  # optional, helpful for IDEs or pre-commit
+
+env = gym.make("ALE/Pong-v5")
+

Alternatively, users can use the following structure, module_name:env_id, ' so that the module is imported first before the environment is created. e.g., ale_py:ALE/Pong-v5`.

+
import gymnasium as gym
+
+env = gym.make("ale_py:ALE/Pong-v5")
+

To help users with IDEs (e.g., VSCode, PyCharm), when importing modules to register environments (e.g., import ale_py) this can cause the IDE (and pre-commit isort / black / flake8) to believe that the import is pointless and should be removed. Therefore, we have introduced gymnasium.register_envs as a no-op function (the function literally does nothing) to make the IDE believe that something is happening and the import statement is required.

+

Vector Environments

+

To increase the sample speed of an environment, vectorizing is one of the easiest ways to sample multiple instances of the same environment simultaneously. Gym and Gymnasium provide the VectorEnv as a base class for this, but one of its issues has been that it inherited Env. This can cause particular issues with type checking (the return type of step is different for Env and VectorEnv), testing the environment type (isinstance(env, Env) can be true for vector environments despite the two acting differently) and finally wrappers (some Gym and Gymnasium wrappers supported Vector environments, but there are no clear or consistent API for determining which do or don't). Therefore, we have separated out Env and VectorEnv to not inherit from each other.

+

In implementing the new separate VectorEnv class, we have tried to minimize the difference between code using Env and VectorEnv along with making it more generic in places. The class contains the same attributes and methods as Env in addition to the attributes num_envs: int, single_action_space: gymnasium.Space and single_observation_space: gymnasium.Space. Further, we have removed several functions from VectorEnv that are not needed for all vector implementations: step_async, step_wait, reset_async, reset_wait, call_async and call_wait. This change now allows users to write their own custom vector environments, v1.0.0 includes an example vector cartpole environment that runs thousands of times faster written solely with NumPy than using Gymnasium's Sync vector environment.

+

To allow users to create vectorized environments easily, we provide gymnasium.make_vec as a vectorized equivalent of gymnasium.make. As there are multiple different vectorization options ("sync", "async", and a custom class referred to as "vector_entry_point"), the argument vectorization_mode selects how the environment is vectorized. This defaults to None such that if the environment has a vector entry point for a custom vector environment implementation, this will be utilized first (currently, Cartpole is the only environment with a vector entry point built into Gymnasium). Otherwise, the synchronous vectorizer is used (previously, the Gym and Gymnasium vector.make used asynchronous vectorizer as default). For more information, see the function docstring. We are excited to see other projects utilize this option to make creating their environments easier.

+
env = gym.make("CartPole-v1")
+env = gym.wrappers.ClipReward(env, min_reward=-1, max_reward=3)
+
+envs = gym.make_vec("CartPole-v1", num_envs=3)
+envs = gym.wrappers.vector.ClipReward(envs, min_reward=-1, max_reward=3)
+

Due to this split of Env and VectorEnv, there are now Env only wrappers and VectorEnv only wrappers in gymnasium.wrappers and gymnasium.wrappers.vector respectively. Furthermore, we updated the names of the base vector wrappers from VectorEnvWrapper to VectorWrapper and added VectorObservationWrapper, VectorRewardWrapper and VectorActionWrapper classes. See the vector wrapper page for new information.

+

To increase the efficiency of vector environments, autoreset is a common feature that allows sub-environments to reset without requiring all sub-environments to finish before resetting them all. Previously in Gym and Gymnasium, auto-resetting was done on the same step as the environment episode ends, such that the final observation and info would be stored in the step's info, i.e., info["final_observation"] and info[“final_info”] and standard obs and info containing the sub-environment's reset observation and info. Thus, accurately sampling observations from a vector environment required the following code (note the need to extract the infos["next_obs"][j] if the sub-environment was terminated or truncated). Additionally, for on-policy algorithms that use rollout would require an additional forward pass to compute the correct next observation (this is often not done as an optimization assuming that environments only terminate, not truncate).

+
replay_buffer = []
+obs, _ = envs.reset()
+for _ in range(total_timesteps):
+    next_obs, rewards, terminations, truncations, infos = envs.step(envs.action_space.sample())
+
+    for j in range(envs.num_envs):
+        if not (terminations[j] or truncations[j]):
+            replay_buffer.append((
+                obs[j], rewards[j], terminations[j], truncations[j], next_obs[j]
+            ))
+        else:
+            replay_buffer.append((
+                obs[j], rewards[j], terminations[j], truncations[j], infos["next_obs"][j]
+            ))
+
+    obs = next_obs
+

However, over time, the development team has recognized the inefficiency of this approach (primarily due to the extensive use of a Python dictionary) and the annoyance of having to extract the final observation to train agents correctly, for example. Therefore, in v1.0.0, we are modifying autoreset to align with specialized vector-only projects like EnvPool and SampleFactory where the sub-environment's doesn't reset until the next step. As a result, the following changes are required when sampling:

+
replay_buffer = []
+obs, _ = envs.reset()
+autoreset = np.zeros(envs.num_envs)
+for _ in range(total_timesteps):
+    next_obs, rewards, terminations, truncations, _ = envs.step(envs.action_space.sample())
+
+    for j in range(envs.num_envs):
+        if not autoreset[j]:
+            replay_buffer.append((
+                obs[j], rewards[j], terminations[j], truncations[j], next_obs[j]
+            ))
+
+    obs = next_obs
+    autoreset = np.logical_or(terminations, truncations)
+

For on-policy rollout, to account for the autoreset requires masking the error for the first observation in a new episode (done[t+1]) to prevent computing the error between the last and first observations of episodes.

+

Finally, we have improved AsyncVectorEnv.set_attr and SyncVectorEnv.set_attr functions to use the Wrapper.set_wrapper_attr to allow users to set variables anywhere in the environment stack if it already exists. Previously, this was not possible and users could only modify the variable in the "top" wrapper on the environment stack, importantly not the actual environment itself.

+

Wrappers

+

Previously, some wrappers could support both environment and vector environments, however, this was not standardized, and was unclear which wrapper did and didn't support vector environments. For v1.0.0, with separating Env and VectorEnv to no longer inherit from each other (read more in the vector section), the wrappers in gymnasium.wrappers will only support standard environments and wrappers in gymnasium.wrappers.vector contains the provided specialized vector wrappers (most but not all wrappers are supported, please raise a feature request if you require it).

+

In v0.29, we deprecated the Wrapper.__getattr__ function to be replaced by Wrapper.get_wrapper_attr, providing access to variables anywhere in the environment stack. In v1.0.0, we have added Wrapper.set_wrapper_attr as an equivalent function for setting a variable anywhere in the environment stack if it already exists; otherwise the variable is assigned to the top wrapper.

+

Most significantly, we have removed, renamed, and added several wrappers listed below.

+ +

For all wrappers, we have added example code documentation and a changelog to help future researchers understand any changes made. See the following page for an example.

+

Functional Environments

+

One of the substantial advantages of Gymnasium's Env is it generally requires minimal information about the underlying environment specifications; however, this can make applying such environments to planning, search algorithms, and theoretical investigations more difficult. We are proposing FuncEnv as an alternative definition to Env which is closer to a Markov Decision Process definition, exposing more functions to the user, including the observation, reward, and termination functions along with the environment's raw state as a single object.

+
from typing import Any
+import gymnasium as gym
+from gymnasium.functional import StateType, ObsType, ActType, RewardType, TerminalType, Params
+
+class ExampleFuncEnv(gym.functional.FuncEnv):
+  def initial(self, rng: Any, params: Params | None = None) -> StateType:
+    ...
+  def transition(self, state: StateType, action: ActType, rng: Any, params: Params | None = None) -> StateType:
+    ...
+  def observation(self, state: StateType, rng: Any, params: Params | None = None) -> ObsType:
+    ...
+  def reward(
+      self, state: StateType, action: ActType, next_state: StateType, rng: Any, params: Params | None = None
+  ) -> RewardType:
+    ...
+  def terminal(self, state: StateType, rng: Any, params: Params | None = None) -> TerminalType:
+    ...
+

FuncEnv requires that initial and transition functions return a new state given its inputs as a partial implementation of Env.step and Env.reset. As a result, users can sample (and save) the next state for a range of inputs to use with planning, searching, etc. Given a state, observation, reward, and terminal provide users explicit definitions to understand how each can affect the environment's output.

+

Collecting Seeding Values

+

It was possible to seed with both environments and spaces with None to use a random initial seed value, however it wouldn't be possible to know what these initial seed values were. We have addressed this for Space.seed and reset.seed in #1033 and #889. Additionally, for Space.seed, we have changed the return type to be specialized for each space such that the following code will work for all spaces.

+
seeded_values = space.seed(None)
+initial_samples = [space.sample() for _ in range(10)]
+
+reseed_values = space.seed(seeded_values)
+reseed_samples = [space.sample() for _ in range(10)]
+
+assert seeded_values == reseed_values
+assert initial_samples == reseed_samples
+

Additionally, for environments, we have added a new np_random_seed attribute that will store the most recent np_random seed value from reset(seed=seed).

+

Environment Version Changes

+ +

Additional Bug Fixes

+ +

Additional new features

+ +

Deprecation

+ +

Documentation changes

+ +

Full Changelog: v0.29.1...v1.0.0

v1.0.0a2: v1.0.0 alpha 2

Released on 2024-05-21 - GitHub - PyPI

@@ -1119,6 +1415,7 @@

v0.26.0: Initial Release
  • Gymnasium Release Notes
      +
    • v1.0.0
    • v1.0.0a2: v1.0.0 alpha 2
    • v1.0.0a1: v1.0.0 alpha1
    • v0.29.1
    • diff --git a/main/introduction/migration_guide/index.html b/main/introduction/migration_guide/index.html index f0920bbf3..9675b648e 100644 --- a/main/introduction/migration_guide/index.html +++ b/main/introduction/migration_guide/index.html @@ -375,7 +375,7 @@

      Migration Guide - v0.21 to v1.0.0

      -

      Gymnasium is a fork of OpenAI Gym v0.26, which introduced a large breaking change from Gym v0.21. In this guide, we briefly outline the API changes from Gym v0.21 - which a number of tutorials have been written for - to Gym v0.26. For environments still stuck in the v0.21 API, see the guide

      +

      Gymnasium is a fork of OpenAI Gym v0.26, which introduced a large breaking change from Gym v0.21.In this guide, we briefly outline the API changes from Gym v0.21 - which a number of tutorials have been written for - to Gym v0.26 (and later, including 1.0.0). For environments still stuck in the v0.21 API, see the guide

      Example code for v0.21

      import gym
      @@ -394,8 +394,8 @@ 

      Example code for v0.21

      -
      -

      Example code for v0.26

      +
      +

      Example code for v0.26 and later, including v1.0.0

      import gym
       env = gym.make("LunarLander-v3", render_mode="human")
       observation, info = env.reset(seed=123, options={})
      @@ -418,7 +418,7 @@ 

      Seed and random number generator

      Environment Reset

      -

      In v0.26, reset() takes two optional parameters and returns one value. This contrasts to v0.21 which takes no parameters and returns None. The two parameters are seed for setting the random number generator and options which allows additional data to be passed to the environment on reset. For example, in classic control, the options parameter now allows users to modify the range of the state bound. See the original PR for more details.

      +

      In v0.26+, reset() takes two optional parameters and returns one value. This contrasts to v0.21 which takes no parameters and returns None. The two parameters are seed for setting the random number generator and options which allows additional data to be passed to the environment on reset. For example, in classic control, the options parameter now allows users to modify the range of the state bound. See the original PR for more details.

      reset() further returns info, similar to the info returned by step(). This is important because info can include metrics or valid action mask that is used or saved in the next step.

      To update older environments, we highly recommend that super().reset(seed=seed) is called on the first line of reset(). This will automatically update the np_random with the seed value.

      @@ -430,11 +430,11 @@

      Environment Step

      TimeLimit Wrapper

      In v0.21, the TimeLimit wrapper added an extra key in the info dictionary TimeLimit.truncated whenever the agent reached the time limit without reaching a terminal state.

      -

      In v0.26, this information is instead communicated through the truncated return value described in the previous section, which is True if the agent reaches the time limit, whether or not it reaches a terminal state. The old dictionary entry is equivalent to truncated and not terminated

      +

      In v0.26+, this information is instead communicated through the truncated return value described in the previous section, which is True if the agent reaches the time limit, whether or not it reaches a terminal state. The old dictionary entry is equivalent to truncated and not terminated

      Environment Render

      -

      In v0.26, a new render API was introduced such that the render mode is fixed at initialisation as some environments don’t allow on-the-fly render mode changes. Therefore, users should now specify the render_mode within gym.make as shown in the v0.26 example code above.

      +

      In v0.26, a new render API was introduced such that the render mode is fixed at initialisation as some environments don’t allow on-the-fly render mode changes. Therefore, users should now specify the render_mode within gym.make as shown in the v0.26+ example code above.

      For a more complete explanation of the changes, please refer to this summary.

      @@ -519,7 +519,7 @@

      Removed codeMigration Guide - v0.21 to v1.0.0
      • Example code for v0.21
      • -
      • Example code for v0.26
      • +
      • Example code for v0.26 and later, including v1.0.0
      • Seed and random number generator
      • Environment Reset
      • Environment Step
      • diff --git a/main/searchindex.js b/main/searchindex.js index 7ef8eb94a..e4f865476 100644 --- a/main/searchindex.js +++ b/main/searchindex.js @@ -1 +1 @@ -Search.setIndex({"alltitles": {"0.12.5": [[154, "release-0-12-5"]], "0.19.0": [[154, "release-0-19-0"]], "0.22.0": [[154, "release-0-22-0"]], "0.23.0": [[154, "release-0-23-0"]], "0.23.1": [[154, "release-0-23-1"]], "0.24.0": [[154, "release-0-24-0"]], "0.24.1": [[154, "release-0-24-1"]], "0.25.0": [[154, "release-0-25-0"]], "0.25.1": [[154, "release-0-25-1"]], "0.25.2": [[154, "release-0-25-2"]], "0.26.0": [[154, "release-0-26-0"]], "0.26.1": [[154, "release-0-26-1"]], "0.26.2": [[154, "release-0-26-2"]], "404 - Page Not Found": [[0, null]], "Acrobot": [[131, null]], "Action Space": [[127, "action-space"], [128, "action-space"], [129, "action-space"], [131, "action-space"], [132, "action-space"], [133, "action-space"], [134, "action-space"], [135, "action-space"], [137, "action-space"], [138, "action-space"], [139, "action-space"], [140, "action-space"], [141, "action-space"], [142, "action-space"], [143, "action-space"], [144, "action-space"], [145, "action-space"], [146, "action-space"], [147, "action-space"], [150, "action-space"], [151, "action-space"], [152, "action-space"], [153, "action-space"]], "Action Wrappers": [[16, null]], "Action and observation spaces": [[157, "action-and-observation-spaces"]], "Adding a new environment": [[1, "adding-a-new-environment"]], "Additional Methods": [[10, "additional-methods"], [11, "additional-methods"], [12, "additional-methods"]], "Additional functions": [[4, "additional-functions"]], "Advantage Actor-Critic (A2C)": [[170, "advantage-actor-critic-a2c"]], "Adventure": [[22, null]], "AgileRL": [[172, "agilerl"]], "Air Raid": [[23, null]], "Algorithmic heuristics": [[162, "algorithmic-heuristics"]], "Alien": [[24, null]], "Alternative solutions": [[165, "alternative-solutions"]], "Amidar": [[25, null]], "Ant": [[137, null]], "Arguments": [[127, "arguments"], [128, "arguments"], [129, "arguments"], [131, "arguments"], [132, "arguments"], [133, "arguments"], [134, "arguments"], [135, "arguments"], [137, "arguments"], [138, "arguments"], [139, "arguments"], [140, "arguments"], [141, "arguments"], [142, "arguments"], [143, "arguments"], [144, "arguments"], [145, "arguments"], [146, "arguments"], [147, "arguments"], [150, "arguments"], [151, "arguments"], [152, "arguments"], [153, "arguments"]], "Assault": [[26, null]], "Asterix": [[27, null]], "Asteroids": [[28, null]], "AsyncVectorEnv": [[11, null]], "Atari": [[21, null]], "Atlantis": [[29, null]], "Atlantis2": [[30, null]], "Attributes": [[2, "attributes"], [5, "attributes"], [10, "attributes"], [15, "attributes"]], "Autonomous Driving environments": [[148, "autonomous-driving-environments"], [148, "id2"]], "Available Action Wrappers": [[16, "available-action-wrappers"]], "Backgammon": [[31, null]], "Bank Heist": [[32, null]], "Base Class": [[16, "base-class"]], "Basic Math": [[33, null]], "Basic Usage": [[157, null]], "Battle Zone": [[34, null]], "Beam Rider": [[35, null]], "Berzerk": [[36, null]], "Biological / Medical environments": [[148, "biological-medical-environments"]], "Bipedal Walker": [[127, null]], "Blackjack": [[37, null], [150, null]], "Bowling": [[38, null]], "Box2D": [[126, null]], "Boxing": [[39, null]], "Breakout": [[40, null]], "Build the Documentation": [[1, "build-the-documentation"]], "Building an agent": [[163, "building-an-agent"], [175, "building-an-agent"], [177, "building-an-agent"]], "Car Racing": [[128, null]], "Carnival": [[41, null]], "Cart Pole": [[132, null]], "Casino": [[42, null]], "Centipede": [[43, null]], "Chopper Command": [[44, null]], "Classic Control": [[130, null]], "Cliff Walking": [[151, null]], "Close": [[165, "close"]], "Common Wrappers": [[17, "common-wrappers"]], "Comparing training performance across versions": [[136, "comparing-training-performance-across-versions"]], "Compatibility with Gym": [[159, null]], "Composite Spaces": [[5, "composite-spaces"], [6, null]], "Constructing Observations": [[158, "constructing-observations"]], "Constructing Observations From Environment States": [[165, "constructing-observations-from-environment-states"]], "Converting Jax-based Functional environments to standard Env": [[3, "converting-jax-based-functional-environments-to-standard-env"]], "Core variables": [[4, "core-variables"]], "Crazy Climber": [[45, null]], "Create a Custom Environment": [[158, null]], "Creating Environment Instances": [[165, "creating-environment-instances"]], "Creating a Package": [[165, "creating-a-package"]], "Creating environments": [[2, "creating-environments"]], "Creating the Q-table": [[173, "creating-the-q-table"]], "Credits": [[127, "credits"], [128, "credits"], [129, "credits"]], "Crossbow": [[46, null]], "Darkchambers": [[47, null]], "Data Conversion Wrappers": [[17, "data-conversion-wrappers"]], "Declaration and Initialization": [[165, "declaration-and-initialization"]], "Defender": [[48, null]], "Demon Attack": [[49, null]], "Dependencies": [[173, "dependencies"]], "Description": [[127, "description"], [128, "description"], [129, "description"], [131, "description"], [132, "description"], [133, "description"], [134, "description"], [135, "description"], [137, "description"], [138, "description"], [139, "description"], [140, "description"], [141, "description"], [142, "description"], [143, "description"], [144, "description"], [145, "description"], [146, "description"], [147, "description"], [150, "description"], [151, "description"], [152, "description"], [153, "description"]], "Domain Randomization": [[170, "domain-randomization"]], "Donkey Kong": [[50, null]], "Double Dunk": [[51, null]], "Earthworld": [[52, null]], "Economic / Financial environments": [[148, "economic-financial-environments"]], "Editing an environment page": [[1, "editing-an-environment-page"]], "Electrical / Energy environments": [[148, "electrical-energy-environments"]], "Elevator Action": [[53, null]], "Enduro": [[54, null]], "Entombed": [[55, null]], "Env": [[2, null]], "Environment Checking": [[9, "environment-checking"]], "Environment Render": [[160, "environment-render"]], "Environment Reset": [[160, "environment-reset"]], "Environment Step": [[160, "environment-step"]], "Environment __init__": [[158, "environment-init"]], "Environment pickling": [[9, "environment-pickling"]], "Epilogue": [[169, "epilogue"]], "Episode End": [[131, "episode-end"], [132, "episode-end"], [133, "episode-end"], [134, "episode-end"], [137, "episode-end"], [138, "episode-end"], [139, "episode-end"], [140, "episode-end"], [141, "episode-end"], [142, "episode-end"], [143, "episode-end"], [144, "episode-end"], [145, "episode-end"], [146, "episode-end"], [147, "episode-end"], [150, "episode-end"], [151, "episode-end"], [152, "episode-end"], [153, "episode-end"]], "Episode Termination": [[127, "episode-termination"], [128, "episode-termination"], [129, "episode-termination"]], "Episode Truncation": [[135, "episode-truncation"]], "Et": [[56, null]], "Exact reproducibility": [[136, "exact-reproducibility"]], "Example code for v0.21": [[160, "example-code-for-v0-21"]], "Example code for v0.26": [[160, "example-code-for-v0-26"]], "Executing an action": [[163, "executing-an-action"], [175, "executing-an-action"]], "Explaining the code": [[157, "explaining-the-code"]], "External Environments": [[148, null]], "First-Party Environments": [[148, "first-party-environments"]], "Fishing Derby": [[57, null]], "Flag Capture": [[58, null]], "Freeway": [[59, null]], "Frogger": [[60, null]], "Frostbite": [[61, null]], "Frozen Lake": [[152, null]], "Frozenlake benchmark": [[173, null]], "Functional Env": [[3, null]], "Fundamental Spaces": [[5, "fundamental-spaces"], [7, null]], "Galaxian": [[62, null]], "Game environments": [[148, "game-environments"]], "Generate your environment": [[165, "generate-your-environment"]], "Gopher": [[63, null]], "Gravitar": [[64, null]], "Gym Release Notes": [[154, null]], "Gym v0.21 Environment Compatibility": [[159, "gym-v0-21-environment-compatibility"]], "Gymnasium Basics": [[164, null], [168, null], [171, "gymnasium-basics"]], "Gymnasium Release Notes": [[155, null]], "Gymnasium-docs": [[1, null]], "Gymnasium.vector.VectorEnv": [[10, "gymnasium-vector-vectorenv"]], "Half Cheetah": [[138, null]], "Handling Time Limits": [[166, null]], "Hangman": [[65, null]], "Haunted House": [[66, null]], "Hero": [[67, null]], "Hopper": [[139, null]], "Human Cannonball": [[68, null]], "Humanoid": [[140, null]], "Humanoid Standup": [[141, null]], "Ice Hockey": [[69, null]], "Implemented Action wrappers": [[14, "implemented-action-wrappers"]], "Implemented Data Conversion wrappers": [[14, "implemented-data-conversion-wrappers"]], "Implemented Observation wrappers": [[14, "implemented-observation-wrappers"]], "Implemented Reward wrappers": [[14, "implemented-reward-wrappers"]], "Implemented Wrappers": [[18, "implemented-wrappers"], [19, "implemented-wrappers"]], "Implementing Custom Wrappers": [[167, null]], "Implementing environments": [[2, "implementing-environments"]], "Importance in learning code": [[166, "importance-in-learning-code"]], "Imports and Environment Setup": [[175, "imports-and-environment-setup"]], "Information": [[150, "information"], [151, "information"], [152, "information"], [153, "information"]], "Inheriting from gymnasium.ActionWrapper": [[167, "inheriting-from-gymnasium-actionwrapper"]], "Inheriting from gymnasium.ObservationWrapper": [[167, "inheriting-from-gymnasium-observationwrapper"]], "Inheriting from gymnasium.RewardWrapper": [[167, "inheriting-from-gymnasium-rewardwrapper"]], "Inheriting from gymnasium.Wrapper": [[167, "inheriting-from-gymnasium-wrapper"]], "Initializing Environments": [[157, "initializing-environments"]], "Instructions for modifying environment pages": [[1, "instructions-for-modifying-environment-pages"]], "Interacting with the Environment": [[157, "interacting-with-the-environment"]], "Introduction": [[170, "introduction"]], "Inverted Double Pendulum": [[142, null]], "Inverted Pendulum": [[143, null]], "Jamesbond": [[70, null]], "Journey Escape": [[71, null]], "Kaboom": [[72, null]], "Kangaroo": [[73, null]], "Keystone Kapers": [[74, null]], "King Kong": [[75, null]], "Klax": [[76, null]], "Koolaid": [[77, null]], "Krull": [[78, null]], "Kung Fu Master": [[79, null]], "Laser Gates": [[80, null]], "List of Wrappers": [[20, null]], "Load custom quadruped robot environments": [[169, null]], "Loading OpenAI Gym environments": [[159, "loading-openai-gym-environments"]], "Lost Luggage": [[81, null]], "Lunar Lander": [[129, null]], "Make and register": [[4, null]], "Make your own custom environment": [[165, null]], "Making Vector Environments": [[10, "making-vector-environments"]], "Map size: 11 \\times 11": [[173, "map-size-11-times-11"]], "Map size: 4 \\times 4": [[173, "map-size-4-times-4"]], "Map size: 7 \\times 7": [[173, "map-size-7-times-7"]], "Map size: 9 \\times 9": [[173, "map-size-9-times-9"]], "Mario Bros": [[82, null]], "Mathematics / Computational": [[148, "mathematics-computational"]], "Methods": [[2, "methods"], [5, "methods"], [10, "methods"], [15, "methods"]], "Migration Guide - v0.21 to v1.0.0": [[160, null]], "Miniature Golf": [[83, null]], "Misc Wrappers": [[17, null]], "Miscellaneous": [[13, "miscellaneous"]], "Modifying the environment": [[157, "modifying-the-environment"]], "Montezuma Revenge": [[84, null]], "More information": [[157, "more-information"], [161, "more-information"]], "Mountain Car": [[133, null]], "Mountain Car Continuous": [[134, null]], "Mr Do": [[85, null]], "Ms Pacman": [[86, null]], "MuJoCo": [[136, null]], "Name This Game": [[87, null]], "Notes": [[129, "notes"], [146, "notes"]], "Notice": [[170, "notice"]], "Observation Space": [[127, "observation-space"], [128, "observation-space"], [129, "observation-space"], [131, "observation-space"], [132, "observation-space"], [133, "observation-space"], [134, "observation-space"], [135, "observation-space"], [137, "observation-space"], [138, "observation-space"], [139, "observation-space"], [140, "observation-space"], [141, "observation-space"], [142, "observation-space"], [143, "observation-space"], [144, "observation-space"], [145, "observation-space"], [146, "observation-space"], [147, "observation-space"], [150, "observation-space"], [151, "observation-space"], [152, "observation-space"], [153, "observation-space"]], "Observation Wrappers": [[18, null]], "Observing the environment": [[175, "observing-the-environment"]], "Old to New Step API Compatibility": [[9, "old-to-new-step-api-compatibility"]], "Optimizing training": [[162, "optimizing-training"]], "Othello": [[88, null]], "Other": [[148, "other"]], "Other environments": [[148, "other-environments"]], "Other steps": [[1, "other-steps"]], "Pacman": [[89, null]], "Parameters we\u2019ll use": [[173, "parameters-we-ll-use"]], "Pendulum": [[135, null]], "Performance Analysis of Synchronous and Asynchronous Vectorized Environments": [[170, "performance-analysis-of-synchronous-and-asynchronous-vectorized-environments"]], "Phoenix": [[90, null]], "Pitfall": [[91, null]], "Pitfall2": [[92, null]], "Plot learning curve": [[177, "plot-learning-curve"]], "Plotting": [[170, "plotting"]], "Policy Network": [[177, "policy-network"]], "Pong": [[93, null]], "Pooyan": [[94, null]], "Private Eye": [[95, null]], "Pusher": [[144, null]], "Qbert": [[96, null]], "Reacher": [[145, null]], "Recommended solution": [[165, "recommended-solution"]], "Recording Agents": [[161, null]], "Recording Every Episode": [[161, "recording-every-episode"]], "Recording the Agent during Training": [[161, "recording-the-agent-during-training"]], "References": [[128, "references"], [131, "references"], [150, "references"], [151, "references"], [153, "references"], [170, "references"], [173, "references"], [177, "references"]], "Registering Envs": [[165, "registering-envs"]], "Registering and making the environment": [[158, "registering-and-making-the-environment"]], "Removed code": [[160, "removed-code"]], "Rendering": [[165, "rendering"]], "Rendering Arguments": [[136, "rendering-arguments"]], "Rendering Backend": [[136, "rendering-backend"]], "Reset": [[165, "reset"]], "Reset Arguments": [[128, "reset-arguments"]], "Reset function": [[158, "reset-function"]], "Reward": [[134, "reward"], [151, "reward"]], "Reward Wrappers": [[19, null]], "Reward:": [[133, "reward"]], "Rewards": [[127, "rewards"], [128, "rewards"], [129, "rewards"], [131, "rewards"], [132, "rewards"], [135, "rewards"], [137, "rewards"], [138, "rewards"], [139, "rewards"], [140, "rewards"], [141, "rewards"], [142, "rewards"], [143, "rewards"], [144, "rewards"], [145, "rewards"], [146, "rewards"], [147, "rewards"], [150, "rewards"], [152, "rewards"], [153, "rewards"]], "Riverraid": [[97, null]], "Road Runner": [[98, null]], "Robotank": [[99, null]], "Robotics environments": [[148, "robotics-environments"], [148, "id1"]], "Running the environment": [[173, "running-the-environment"]], "Runtime Performance benchmark": [[9, "runtime-performance-benchmark"]], "Save Rendering Videos": [[9, "save-rendering-videos"]], "Saving/ Loading Weights": [[170, "saving-loading-weights"]], "Seaquest": [[100, null]], "Seed and random number generator": [[160, "seed-and-random-number-generator"]], "Seeding": [[9, "seeding"]], "Setup": [[165, "setup"], [169, "setup"], [170, "setup"]], "Shared Memory for a Space": [[13, "shared-memory-for-a-space"]], "Showcase the Agent": [[170, "showcase-the-agent"]], "Sir Lancelot": [[101, null]], "Skiing": [[102, null]], "Solaris": [[103, null]], "Solution": [[166, "solution"]], "Solving Blackjack with Q-Learning": [[175, null]], "Space Invaders": [[104, null]], "Space War": [[105, null]], "Spaces": [[5, null]], "Spaces Utils": [[8, null]], "Speeding Up Training": [[162, null]], "Star Gunner": [[106, null]], "Starting State": [[127, "starting-state"], [128, "starting-state"], [129, "starting-state"], [131, "starting-state"], [132, "starting-state"], [133, "starting-state"], [134, "starting-state"], [135, "starting-state"], [137, "starting-state"], [138, "starting-state"], [139, "starting-state"], [140, "starting-state"], [141, "starting-state"], [142, "starting-state"], [143, "starting-state"], [144, "starting-state"], [145, "starting-state"], [146, "starting-state"], [147, "starting-state"], [150, "starting-state"], [151, "starting-state"], [152, "starting-state"], [153, "starting-state"]], "Step": [[165, "step"]], "Step 0.1 - Download a Robot Model": [[169, "step-0-1-download-a-robot-model"]], "Step 1 - Load the model": [[169, "step-1-load-the-model"]], "Step 2 - Tweaking the Environment Parameters": [[169, "step-2-tweaking-the-environment-parameters"]], "Step 2.1 - Tweaking the Environment Simulation Parameters": [[169, "step-2-1-tweaking-the-environment-simulation-parameters"]], "Step 2.2 - Tweaking the Environment Termination Parameters": [[169, "step-2-2-tweaking-the-environment-termination-parameters"]], "Step 2.3 - Tweaking the Environment Reward Parameters": [[169, "step-2-3-tweaking-the-environment-reward-parameters"]], "Step 2.4 - Tweaking the Environment Observation Parameters": [[169, "step-2-4-tweaking-the-environment-observation-parameters"]], "Step 3 - Train your Agent": [[169, "step-3-train-your-agent"]], "Step API Compatibility": [[159, "step-api-compatibility"]], "Step function": [[158, "step-function"]], "Subclassing gymnasium.Env": [[165, "subclassing-gymnasium-env"]], "Superman": [[107, null]], "Surround": [[108, null]], "Swimmer": [[146, null]], "SyncVectorEnv": [[12, null]], "Taxi": [[153, null]], "Telecommunication Systems environments": [[148, "telecommunication-systems-environments"]], "Tennis": [[109, null]], "Termination": [[137, "termination"], [138, "termination"], [139, "termination"], [140, "termination"], [141, "termination"], [142, "termination"], [143, "termination"], [144, "termination"], [145, "termination"], [146, "termination"], [147, "termination"], [166, "termination"]], "Tetris": [[110, null]], "The FrozenLake environment": [[173, "the-frozenlake-environment"]], "The requested page could not be found.": [[0, "the-requested-page-could-not-be-found"]], "Think you can do better?": [[175, "think-you-can-do-better"]], "Third-Party Environments using Gym": [[148, "third-party-environments-using-gym"]], "Third-Party Tutorials": [[172, null]], "Third-Party tutorials about gymnasium": [[172, "third-party-tutorials-about-gymnasium"]], "Third-Party tutorials about libraries that use gymnasium": [[172, "third-party-tutorials-about-libraries-that-use-gymnasium"]], "Third-party environments with Gymnasium": [[148, "third-party-environments-with-gymnasium"]], "Tic Tac Toe 3D": [[111, null]], "Time Pilot": [[112, null]], "TimeLimit Wrapper": [[160, "timelimit-wrapper"]], "Toy Text": [[149, null]], "Training A2C with Vector Envs and Domain Randomization": [[170, null]], "Training Agents": [[171, "training-agents"], [174, null], [176, null]], "Training an Agent": [[163, null]], "Training the A2C Agent": [[170, "training-the-a2c-agent"]], "Training the agent": [[163, "training-the-agent"]], "Training using REINFORCE for Mujoco": [[177, null]], "Transition Dynamics:": [[133, "transition-dynamics"], [134, "transition-dynamics"]], "Trondead": [[113, null]], "Truncation": [[137, "truncation"], [138, "truncation"], [139, "truncation"], [140, "truncation"], [141, "truncation"], [142, "truncation"], [143, "truncation"], [144, "truncation"], [145, "truncation"], [146, "truncation"], [147, "truncation"], [166, "truncation"]], "Try playing the environment yourself": [[170, "try-playing-the-environment-yourself"]], "Turmoil": [[114, null]], "Tutankham": [[115, null]], "Tutorials": [[171, null]], "Uncommon Wrappers": [[17, "uncommon-wrappers"]], "Up N Down": [[116, null]], "Using Vectorized Environments": [[170, "using-vectorized-environments"]], "Using Wrappers": [[158, "using-wrappers"], [165, "using-wrappers"]], "Utility functions": [[5, "utility-functions"], [9, null], [13, null]], "Vector Only wrappers": [[14, "vector-only-wrappers"]], "Vector Utility functions": [[5, "vector-utility-functions"]], "Vector only Wrappers": [[20, "vector-only-wrappers"]], "Vectorize": [[10, null]], "Vectorized Common wrappers": [[14, "vectorized-common-wrappers"]], "Vectorized environment": [[132, "vectorized-environment"]], "Vectorized environments": [[162, "vectorized-environments"]], "Vectorizing Spaces": [[13, "vectorizing-spaces"]], "Venture": [[117, null]], "Version History": [[127, "version-history"], [128, "version-history"], [129, "version-history"], [131, "version-history"], [132, "version-history"], [133, "version-history"], [134, "version-history"], [135, "version-history"], [137, "version-history"], [138, "version-history"], [139, "version-history"], [140, "version-history"], [141, "version-history"], [142, "version-history"], [143, "version-history"], [144, "version-history"], [145, "version-history"], [146, "version-history"], [147, "version-history"], [150, "version-history"], [151, "version-history"], [152, "version-history"], [153, "version-history"]], "Versions": [[136, "versions"]], "Video Checkers": [[118, null]], "Video Chess": [[119, null]], "Video Cube": [[120, null]], "Video Game environments": [[148, "video-game-environments"]], "Video Pinball": [[121, null]], "Visualising the policy": [[163, "visualising-the-policy"], [175, "visualising-the-policy"]], "Visualization": [[9, "visualization"], [173, "visualization"]], "Visualizing the training": [[175, "visualizing-the-training"]], "Walker2D": [[147, null]], "Wizard Of Wor": [[122, null]], "Word Zapper": [[123, null]], "Wrappers": [[14, null], [15, null]], "Writing Tutorials": [[1, "writing-tutorials"]], "Yars Revenge": [[124, null]], "Zaxxon": [[125, null]], "v0.20.0": [[154, "release-v0-20-0"]], "v0.21.0": [[154, "release-v0-21-0"]], "v0.26.0: Initial Release": [[155, "release-v0-26-0"]], "v0.26.1": [[155, "release-v0-26-1"]], "v0.26.2 ": [[155, "release-v0-26-2"]], "v0.26.3": [[155, "release-v0-26-3"]], "v0.27.0": [[155, "release-v0-27-0"]], "v0.27.1": [[155, "release-v0-27-1"]], "v0.28.0": [[155, "release-v0-28-0"]], "v0.28.1": [[155, "release-v0-28-1"]], "v0.29.0": [[155, "release-v0-29-0"]], "v0.29.1 ": [[155, "release-v0-29-1"]], "v0.7.3": [[154, "release-v0-7-3"]], "v0.7.4": [[154, "release-v0-7-4"]], "v0.9.5": [[154, "release-v0-9-5"]], "v0.9.6": [[154, "release-v0-9-6"]], "v1.0.0a1: v1.0.0 alpha1 ": [[155, "release-v1-0-0a1"]], "v1.0.0a2: v1.0.0 alpha 2": [[155, "release-v1-0-0a2"]]}, "docnames": ["404", "README", "api/env", "api/functional", "api/registry", "api/spaces", "api/spaces/composite", "api/spaces/fundamental", "api/spaces/utils", "api/utils", "api/vector", "api/vector/async_vector_env", "api/vector/sync_vector_env", "api/vector/utils", "api/vector/wrappers", "api/wrappers", "api/wrappers/action_wrappers", "api/wrappers/misc_wrappers", "api/wrappers/observation_wrappers", "api/wrappers/reward_wrappers", "api/wrappers/table", "environments/atari", "environments/atari/adventure", "environments/atari/air_raid", "environments/atari/alien", "environments/atari/amidar", "environments/atari/assault", "environments/atari/asterix", "environments/atari/asteroids", "environments/atari/atlantis", "environments/atari/atlantis2", "environments/atari/backgammon", "environments/atari/bank_heist", "environments/atari/basic_math", "environments/atari/battle_zone", "environments/atari/beam_rider", "environments/atari/berzerk", "environments/atari/blackjack", "environments/atari/bowling", "environments/atari/boxing", "environments/atari/breakout", "environments/atari/carnival", "environments/atari/casino", "environments/atari/centipede", "environments/atari/chopper_command", "environments/atari/crazy_climber", "environments/atari/crossbow", "environments/atari/darkchambers", "environments/atari/defender", "environments/atari/demon_attack", "environments/atari/donkey_kong", "environments/atari/double_dunk", "environments/atari/earthworld", "environments/atari/elevator_action", "environments/atari/enduro", "environments/atari/entombed", "environments/atari/et", "environments/atari/fishing_derby", "environments/atari/flag_capture", "environments/atari/freeway", "environments/atari/frogger", "environments/atari/frostbite", "environments/atari/galaxian", "environments/atari/gopher", "environments/atari/gravitar", "environments/atari/hangman", "environments/atari/haunted_house", "environments/atari/hero", "environments/atari/human_cannonball", "environments/atari/ice_hockey", "environments/atari/jamesbond", "environments/atari/journey_escape", "environments/atari/kaboom", "environments/atari/kangaroo", "environments/atari/keystone_kapers", "environments/atari/king_kong", "environments/atari/klax", "environments/atari/koolaid", "environments/atari/krull", "environments/atari/kung_fu_master", "environments/atari/laser_gates", "environments/atari/lost_luggage", "environments/atari/mario_bros", "environments/atari/miniature_golf", "environments/atari/montezuma_revenge", "environments/atari/mr_do", "environments/atari/ms_pacman", "environments/atari/name_this_game", "environments/atari/othello", "environments/atari/pacman", "environments/atari/phoenix", "environments/atari/pitfall", "environments/atari/pitfall2", "environments/atari/pong", "environments/atari/pooyan", "environments/atari/private_eye", "environments/atari/qbert", "environments/atari/riverraid", "environments/atari/road_runner", "environments/atari/robotank", "environments/atari/seaquest", "environments/atari/sir_lancelot", "environments/atari/skiing", "environments/atari/solaris", "environments/atari/space_invaders", "environments/atari/space_war", "environments/atari/star_gunner", "environments/atari/superman", "environments/atari/surround", "environments/atari/tennis", "environments/atari/tetris", "environments/atari/tic_tac_toe_3d", "environments/atari/time_pilot", "environments/atari/trondead", "environments/atari/turmoil", "environments/atari/tutankham", "environments/atari/up_n_down", "environments/atari/venture", "environments/atari/video_checkers", "environments/atari/video_chess", "environments/atari/video_cube", "environments/atari/video_pinball", "environments/atari/wizard_of_wor", "environments/atari/word_zapper", "environments/atari/yars_revenge", "environments/atari/zaxxon", "environments/box2d", "environments/box2d/bipedal_walker", "environments/box2d/car_racing", "environments/box2d/lunar_lander", "environments/classic_control", "environments/classic_control/acrobot", "environments/classic_control/cart_pole", "environments/classic_control/mountain_car", "environments/classic_control/mountain_car_continuous", "environments/classic_control/pendulum", "environments/mujoco", "environments/mujoco/ant", "environments/mujoco/half_cheetah", "environments/mujoco/hopper", "environments/mujoco/humanoid", "environments/mujoco/humanoid_standup", "environments/mujoco/inverted_double_pendulum", "environments/mujoco/inverted_pendulum", "environments/mujoco/pusher", "environments/mujoco/reacher", "environments/mujoco/swimmer", "environments/mujoco/walker2d", "environments/third_party_environments", "environments/toy_text", "environments/toy_text/blackjack", "environments/toy_text/cliff_walking", "environments/toy_text/frozen_lake", "environments/toy_text/taxi", "gym_release_notes/index", "gymnasium_release_notes/index", "index", "introduction/basic_usage", "introduction/create_custom_env", "introduction/gym_compatibility", "introduction/migration_guide", "introduction/record_agent", "introduction/speed_up_env", "introduction/train_agent", "tutorials/gymnasium_basics/README", "tutorials/gymnasium_basics/environment_creation", "tutorials/gymnasium_basics/handling_time_limits", "tutorials/gymnasium_basics/implementing_custom_wrappers", "tutorials/gymnasium_basics/index", "tutorials/gymnasium_basics/load_quadruped_model", "tutorials/gymnasium_basics/vector_envs_tutorial", "tutorials/index", "tutorials/third-party-tutorials", "tutorials/training_agents/FrozenLake_tuto", "tutorials/training_agents/README", "tutorials/training_agents/blackjack_tutorial", "tutorials/training_agents/index", "tutorials/training_agents/reinforce_invpend_gym_v26"], "envversion": {"sphinx": 62, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.viewcode": 1}, "filenames": ["404.md", "README.md", "api/env.md", "api/functional.md", "api/registry.md", "api/spaces.md", "api/spaces/composite.md", "api/spaces/fundamental.md", "api/spaces/utils.md", "api/utils.md", "api/vector.md", "api/vector/async_vector_env.md", "api/vector/sync_vector_env.md", "api/vector/utils.md", "api/vector/wrappers.md", "api/wrappers.md", "api/wrappers/action_wrappers.md", "api/wrappers/misc_wrappers.md", "api/wrappers/observation_wrappers.md", "api/wrappers/reward_wrappers.md", "api/wrappers/table.md", "environments/atari.md", "environments/atari/adventure.md", "environments/atari/air_raid.md", "environments/atari/alien.md", "environments/atari/amidar.md", "environments/atari/assault.md", "environments/atari/asterix.md", "environments/atari/asteroids.md", "environments/atari/atlantis.md", "environments/atari/atlantis2.md", "environments/atari/backgammon.md", "environments/atari/bank_heist.md", "environments/atari/basic_math.md", "environments/atari/battle_zone.md", "environments/atari/beam_rider.md", "environments/atari/berzerk.md", "environments/atari/blackjack.md", "environments/atari/bowling.md", "environments/atari/boxing.md", "environments/atari/breakout.md", "environments/atari/carnival.md", "environments/atari/casino.md", "environments/atari/centipede.md", "environments/atari/chopper_command.md", "environments/atari/crazy_climber.md", "environments/atari/crossbow.md", "environments/atari/darkchambers.md", "environments/atari/defender.md", "environments/atari/demon_attack.md", "environments/atari/donkey_kong.md", "environments/atari/double_dunk.md", "environments/atari/earthworld.md", "environments/atari/elevator_action.md", "environments/atari/enduro.md", "environments/atari/entombed.md", "environments/atari/et.md", "environments/atari/fishing_derby.md", "environments/atari/flag_capture.md", "environments/atari/freeway.md", "environments/atari/frogger.md", "environments/atari/frostbite.md", "environments/atari/galaxian.md", "environments/atari/gopher.md", "environments/atari/gravitar.md", "environments/atari/hangman.md", "environments/atari/haunted_house.md", "environments/atari/hero.md", "environments/atari/human_cannonball.md", "environments/atari/ice_hockey.md", "environments/atari/jamesbond.md", "environments/atari/journey_escape.md", "environments/atari/kaboom.md", "environments/atari/kangaroo.md", "environments/atari/keystone_kapers.md", "environments/atari/king_kong.md", "environments/atari/klax.md", "environments/atari/koolaid.md", "environments/atari/krull.md", "environments/atari/kung_fu_master.md", "environments/atari/laser_gates.md", "environments/atari/lost_luggage.md", "environments/atari/mario_bros.md", "environments/atari/miniature_golf.md", "environments/atari/montezuma_revenge.md", "environments/atari/mr_do.md", "environments/atari/ms_pacman.md", "environments/atari/name_this_game.md", "environments/atari/othello.md", "environments/atari/pacman.md", "environments/atari/phoenix.md", "environments/atari/pitfall.md", "environments/atari/pitfall2.md", "environments/atari/pong.md", "environments/atari/pooyan.md", "environments/atari/private_eye.md", "environments/atari/qbert.md", "environments/atari/riverraid.md", "environments/atari/road_runner.md", "environments/atari/robotank.md", "environments/atari/seaquest.md", "environments/atari/sir_lancelot.md", "environments/atari/skiing.md", "environments/atari/solaris.md", "environments/atari/space_invaders.md", "environments/atari/space_war.md", "environments/atari/star_gunner.md", "environments/atari/superman.md", "environments/atari/surround.md", "environments/atari/tennis.md", "environments/atari/tetris.md", "environments/atari/tic_tac_toe_3d.md", "environments/atari/time_pilot.md", "environments/atari/trondead.md", "environments/atari/turmoil.md", "environments/atari/tutankham.md", "environments/atari/up_n_down.md", "environments/atari/venture.md", "environments/atari/video_checkers.md", "environments/atari/video_chess.md", "environments/atari/video_cube.md", "environments/atari/video_pinball.md", "environments/atari/wizard_of_wor.md", "environments/atari/word_zapper.md", "environments/atari/yars_revenge.md", "environments/atari/zaxxon.md", "environments/box2d.md", "environments/box2d/bipedal_walker.md", "environments/box2d/car_racing.md", "environments/box2d/lunar_lander.md", "environments/classic_control.md", "environments/classic_control/acrobot.md", "environments/classic_control/cart_pole.md", "environments/classic_control/mountain_car.md", "environments/classic_control/mountain_car_continuous.md", "environments/classic_control/pendulum.md", "environments/mujoco.md", "environments/mujoco/ant.md", "environments/mujoco/half_cheetah.md", "environments/mujoco/hopper.md", "environments/mujoco/humanoid.md", "environments/mujoco/humanoid_standup.md", "environments/mujoco/inverted_double_pendulum.md", "environments/mujoco/inverted_pendulum.md", "environments/mujoco/pusher.md", "environments/mujoco/reacher.md", "environments/mujoco/swimmer.md", "environments/mujoco/walker2d.md", "environments/third_party_environments.md", "environments/toy_text.md", "environments/toy_text/blackjack.md", "environments/toy_text/cliff_walking.md", "environments/toy_text/frozen_lake.md", "environments/toy_text/taxi.md", "gym_release_notes/index.md", "gymnasium_release_notes/index.md", "index.md", "introduction/basic_usage.md", "introduction/create_custom_env.md", "introduction/gym_compatibility.md", "introduction/migration_guide.md", "introduction/record_agent.md", "introduction/speed_up_env.md", "introduction/train_agent.md", "tutorials/gymnasium_basics/README.rst", "tutorials/gymnasium_basics/environment_creation.rst", "tutorials/gymnasium_basics/handling_time_limits.rst", "tutorials/gymnasium_basics/implementing_custom_wrappers.rst", "tutorials/gymnasium_basics/index.rst", "tutorials/gymnasium_basics/load_quadruped_model.md", "tutorials/gymnasium_basics/vector_envs_tutorial.rst", "tutorials/index.rst", "tutorials/third-party-tutorials.md", "tutorials/training_agents/FrozenLake_tuto.rst", "tutorials/training_agents/README.rst", "tutorials/training_agents/blackjack_tutorial.rst", "tutorials/training_agents/index.rst", "tutorials/training_agents/reinforce_invpend_gym_v26.rst"], "indexentries": {"action() (gymnasium.actionwrapper method)": [[16, "gymnasium.ActionWrapper.action", false]], "action_space (gymnasium.env attribute)": [[2, "gymnasium.Env.action_space", false]], "action_space (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.action_space", false]], "action_space (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.action_space", false]], "actions() (gymnasium.vector.vectoractionwrapper method)": [[14, "gymnasium.vector.VectorActionWrapper.actions", false]], "actionwrapper (class in gymnasium)": [[16, "gymnasium.ActionWrapper", false]], "addrenderobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.AddRenderObservation", false]], "asyncvectorenv (class in gymnasium.vector)": [[11, "gymnasium.vector.AsyncVectorEnv", false]], "ataripreprocessing (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.AtariPreprocessing", false]], "autoreset (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.Autoreset", false]], "batch_space() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.batch_space", false]], "benchmark_init() (in module gymnasium.utils.performance)": [[9, "gymnasium.utils.performance.benchmark_init", false]], "benchmark_render() (in module gymnasium.utils.performance)": [[9, "gymnasium.utils.performance.benchmark_render", false]], "benchmark_step() (in module gymnasium.utils.performance)": [[9, "gymnasium.utils.performance.benchmark_step", false]], "box (class in gymnasium.spaces)": [[7, "gymnasium.spaces.Box", false]], "call() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.call", false]], "call() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.call", false]], "callback() (gymnasium.utils.play.playplot method)": [[9, "gymnasium.utils.play.PlayPlot.callback", false]], "capped_cubic_video_schedule() (in module gymnasium.utils.save_video)": [[9, "gymnasium.utils.save_video.capped_cubic_video_schedule", false]], "check_env() (in module gymnasium.utils.env_checker)": [[9, "gymnasium.utils.env_checker.check_env", false]], "clear_mpi_env_vars() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.clear_mpi_env_vars", false]], "clipaction (class in gymnasium.wrappers)": [[16, "gymnasium.wrappers.ClipAction", false]], "clipaction (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.ClipAction", false]], "clipreward (class in gymnasium.wrappers)": [[19, "gymnasium.wrappers.ClipReward", false]], "clipreward (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.ClipReward", false]], "close() (gymnasium.env method)": [[2, "gymnasium.Env.close", false]], "close() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.close", false]], "close() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.close", false]], "close() (gymnasium.vector.vectorenv method)": [[10, "gymnasium.vector.VectorEnv.close", false]], "close() (gymnasium.vector.vectorwrapper method)": [[14, "gymnasium.vector.VectorWrapper.close", false]], "close() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.close", false]], "closed (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.closed", false]], "cloudpicklewrapper() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.CloudpickleWrapper", false]], "concatenate() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.concatenate", false]], "contains() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.contains", false]], "convert_to_done_step_api() (in module gymnasium.utils.step_api_compatibility)": [[9, "gymnasium.utils.step_api_compatibility.convert_to_done_step_api", false]], "convert_to_terminated_truncated_step_api() (in module gymnasium.utils.step_api_compatibility)": [[9, "gymnasium.utils.step_api_compatibility.convert_to_terminated_truncated_step_api", false]], "create_empty_array() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.create_empty_array", false]], "create_shared_memory() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.create_shared_memory", false]], "current_namespace (gymnasium.envs.registration attribute)": [[4, "gymnasium.envs.registration.current_namespace", false]], "delayobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.DelayObservation", false]], "dict (class in gymnasium.spaces)": [[6, "gymnasium.spaces.Dict", false]], "dictinfotolist (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.DictInfoToList", false]], "discrete (class in gymnasium.spaces)": [[7, "gymnasium.spaces.Discrete", false]], "dtype (gymnasium.spaces.space property)": [[5, "gymnasium.spaces.Space.dtype", false]], "dtypeobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.DtypeObservation", false]], "dtypeobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.DtypeObservation", false]], "env (class in gymnasium)": [[2, "gymnasium.Env", false]], "env (gymnasium.wrapper attribute)": [[15, "gymnasium.Wrapper.env", false]], "envspec (class in gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.EnvSpec", false]], "ezpickle (class in gymnasium.utils.ezpickle)": [[9, "gymnasium.utils.ezpickle.EzPickle", false]], "filterobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.FilterObservation", false]], "filterobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.FilterObservation", false]], "find_highest_version() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.find_highest_version", false]], "flatdim() (in module gymnasium.spaces.utils)": [[8, "gymnasium.spaces.utils.flatdim", false]], "flatten() (in module gymnasium.spaces.utils)": [[8, "gymnasium.spaces.utils.flatten", false]], "flatten_space() (in module gymnasium.spaces.utils)": [[8, "gymnasium.spaces.utils.flatten_space", false]], "flattenobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.FlattenObservation", false]], "flattenobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.FlattenObservation", false]], "framestackobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.FrameStackObservation", false]], "from_jsonable() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.from_jsonable", false]], "funcenv (class in gymnasium.experimental.functional)": [[3, "gymnasium.experimental.functional.FuncEnv", false]], "functionaljaxenv (class in gymnasium.envs.functional_jax_env)": [[3, "gymnasium.envs.functional_jax_env.FunctionalJaxEnv", false]], "get_attr() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.get_attr", false]], "get_attr() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.get_attr", false]], "get_env_id() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.get_env_id", false]], "get_wrapper_attr() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.get_wrapper_attr", false]], "graph (class in gymnasium.spaces)": [[6, "gymnasium.spaces.Graph", false]], "grayscaleobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.GrayscaleObservation", false]], "grayscaleobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.GrayscaleObservation", false]], "gymnasium.spaces": [[5, "module-gymnasium.spaces", false]], "gymnasium.wrappers": [[15, "module-gymnasium.wrappers", false]], "humanrendering (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.HumanRendering", false]], "initial() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.initial", false]], "is_bounded() (gymnasium.spaces.box method)": [[7, "gymnasium.spaces.Box.is_bounded", false]], "is_np_flattenable (gymnasium.spaces.space property)": [[5, "gymnasium.spaces.Space.is_np_flattenable", false]], "iterate() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.iterate", false]], "jaxtonumpy (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.JaxToNumpy", false]], "jaxtonumpy (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.JaxToNumpy", false]], "jaxtotorch (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.JaxToTorch", false]], "jaxtotorch (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.JaxToTorch", false]], "load_env_creator() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.load_env_creator", false]], "make() (in module gymnasium)": [[4, "gymnasium.make", false]], "make_vec() (in module gymnasium)": [[4, "gymnasium.make_vec", false]], "maxandskipobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.MaxAndSkipObservation", false]], "metadata (gymnasium.env attribute)": [[2, "gymnasium.Env.metadata", false]], "metadata (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.metadata", false]], "metadata (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.metadata", false]], "module": [[5, "module-gymnasium.spaces", false], [15, "module-gymnasium.wrappers", false]], "multibinary (class in gymnasium.spaces)": [[7, "gymnasium.spaces.MultiBinary", false]], "multidiscrete (class in gymnasium.spaces)": [[7, "gymnasium.spaces.MultiDiscrete", false]], "namespace() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.namespace", false]], "normalizeobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.NormalizeObservation", false]], "normalizeobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.NormalizeObservation", false]], "normalizereward (class in gymnasium.wrappers)": [[19, "gymnasium.wrappers.NormalizeReward", false]], "normalizereward (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.NormalizeReward", false]], "np_random (gymnasium.env property)": [[2, "gymnasium.Env.np_random", false]], "np_random (gymnasium.spaces.space property)": [[5, "gymnasium.spaces.Space.np_random", false]], "np_random (gymnasium.vector.asyncvectorenv property)": [[11, "gymnasium.vector.AsyncVectorEnv.np_random", false]], "np_random (gymnasium.vector.syncvectorenv property)": [[12, "gymnasium.vector.SyncVectorEnv.np_random", false]], "np_random (gymnasium.vector.vectorenv property)": [[10, "gymnasium.vector.VectorEnv.np_random", false]], "np_random (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.np_random", false]], "np_random() (in module gymnasium.utils.seeding)": [[9, "gymnasium.utils.seeding.np_random", false]], "np_random_seed (gymnasium.env property)": [[2, "gymnasium.Env.np_random_seed", false]], "np_random_seed (gymnasium.vector.asyncvectorenv property)": [[11, "gymnasium.vector.AsyncVectorEnv.np_random_seed", false]], "np_random_seed (gymnasium.vector.syncvectorenv property)": [[12, "gymnasium.vector.SyncVectorEnv.np_random_seed", false]], "np_random_seed (gymnasium.vector.vectorenv property)": [[10, "gymnasium.vector.VectorEnv.np_random_seed", false]], "np_random_seed (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.np_random_seed", false]], "num_envs (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.num_envs", false]], "numpytotorch (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.NumpyToTorch", false]], "numpytotorch (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.NumpyToTorch", false]], "observation() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.observation", false]], "observation() (gymnasium.observationwrapper method)": [[18, "gymnasium.ObservationWrapper.observation", false]], "observation_space (gymnasium.env attribute)": [[2, "gymnasium.Env.observation_space", false]], "observation_space (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.observation_space", false]], "observation_space (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.observation_space", false]], "observations() (gymnasium.vector.vectorobservationwrapper method)": [[14, "gymnasium.vector.VectorObservationWrapper.observations", false]], "observationwrapper (class in gymnasium)": [[18, "gymnasium.ObservationWrapper", false]], "oneof (class in gymnasium.spaces)": [[6, "gymnasium.spaces.OneOf", false]], "orderenforcing (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.OrderEnforcing", false]], "parse_env_id() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.parse_env_id", false]], "passiveenvchecker (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.PassiveEnvChecker", false]], "play() (in module gymnasium.utils.play)": [[9, "gymnasium.utils.play.play", false]], "playablegame (class in gymnasium.utils.play)": [[9, "gymnasium.utils.play.PlayableGame", false]], "playplot (class in gymnasium.utils.play)": [[9, "gymnasium.utils.play.PlayPlot", false]], "pprint_registry() (in module gymnasium)": [[4, "gymnasium.pprint_registry", false]], "process_event() (gymnasium.utils.play.playablegame method)": [[9, "gymnasium.utils.play.PlayableGame.process_event", false]], "read_from_shared_memory() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.read_from_shared_memory", false]], "recordepisodestatistics (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.RecordEpisodeStatistics", false]], "recordepisodestatistics (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.RecordEpisodeStatistics", false]], "recordvideo (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.RecordVideo", false]], "register() (in module gymnasium)": [[4, "gymnasium.register", false]], "registry (gymnasium.envs.registration attribute)": [[4, "gymnasium.envs.registration.registry", false]], "render() (gymnasium.env method)": [[2, "gymnasium.Env.render", false]], "render() (gymnasium.envs.functional_jax_env.functionaljaxenv method)": [[3, "gymnasium.envs.functional_jax_env.FunctionalJaxEnv.render", false]], "render() (gymnasium.vector.vectorenv method)": [[10, "gymnasium.vector.VectorEnv.render", false]], "render() (gymnasium.vector.vectorwrapper method)": [[14, "gymnasium.vector.VectorWrapper.render", false]], "render() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.render", false]], "render_close() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.render_close", false]], "render_image() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.render_image", false]], "render_init() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.render_init", false]], "render_mode (gymnasium.env attribute)": [[2, "gymnasium.Env.render_mode", false]], "render_mode (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.render_mode", false]], "rendercollection (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.RenderCollection", false]], "rescaleaction (class in gymnasium.wrappers)": [[16, "gymnasium.wrappers.RescaleAction", false]], "rescaleaction (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.RescaleAction", false]], "rescaleobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.RescaleObservation", false]], "rescaleobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.RescaleObservation", false]], "reset() (gymnasium.env method)": [[2, "gymnasium.Env.reset", false]], "reset() (gymnasium.envs.functional_jax_env.functionaljaxenv method)": [[3, "gymnasium.envs.functional_jax_env.FunctionalJaxEnv.reset", false]], "reset() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.reset", false]], "reset() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.reset", false]], "reset() (gymnasium.vector.vectorenv method)": [[10, "gymnasium.vector.VectorEnv.reset", false]], "reset() (gymnasium.vector.vectorwrapper method)": [[14, "gymnasium.vector.VectorWrapper.reset", false]], "reset() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.reset", false]], "reshapeobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.ReshapeObservation", false]], "reshapeobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.ReshapeObservation", false]], "resizeobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.ResizeObservation", false]], "resizeobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.ResizeObservation", false]], "reward() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.reward", false]], "reward() (gymnasium.rewardwrapper method)": [[19, "gymnasium.RewardWrapper.reward", false]], "rewards() (gymnasium.vector.vectorrewardwrapper method)": [[14, "gymnasium.vector.VectorRewardWrapper.rewards", false]], "rewardwrapper (class in gymnasium)": [[19, "gymnasium.RewardWrapper", false]], "sample() (gymnasium.spaces.box method)": [[7, "gymnasium.spaces.Box.sample", false]], "sample() (gymnasium.spaces.dict method)": [[6, "gymnasium.spaces.Dict.sample", false]], "sample() (gymnasium.spaces.discrete method)": [[7, "gymnasium.spaces.Discrete.sample", false]], "sample() (gymnasium.spaces.graph method)": [[6, "gymnasium.spaces.Graph.sample", false]], "sample() (gymnasium.spaces.multibinary method)": [[7, "gymnasium.spaces.MultiBinary.sample", false]], "sample() (gymnasium.spaces.multidiscrete method)": [[7, "gymnasium.spaces.MultiDiscrete.sample", false]], "sample() (gymnasium.spaces.oneof method)": [[6, "gymnasium.spaces.OneOf.sample", false]], "sample() (gymnasium.spaces.sequence method)": [[6, "gymnasium.spaces.Sequence.sample", false]], "sample() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.sample", false]], "sample() (gymnasium.spaces.text method)": [[7, "gymnasium.spaces.Text.sample", false]], "sample() (gymnasium.spaces.tuple method)": [[6, "gymnasium.spaces.Tuple.sample", false]], "save_video() (in module gymnasium.utils.save_video)": [[9, "gymnasium.utils.save_video.save_video", false]], "seed() (gymnasium.spaces.box method)": [[7, "gymnasium.spaces.Box.seed", false]], "seed() (gymnasium.spaces.dict method)": [[6, "gymnasium.spaces.Dict.seed", false]], "seed() (gymnasium.spaces.discrete method)": [[7, "gymnasium.spaces.Discrete.seed", false]], "seed() (gymnasium.spaces.graph method)": [[6, "gymnasium.spaces.Graph.seed", false]], "seed() (gymnasium.spaces.multibinary method)": [[7, "gymnasium.spaces.MultiBinary.seed", false]], "seed() (gymnasium.spaces.multidiscrete method)": [[7, "gymnasium.spaces.MultiDiscrete.seed", false]], "seed() (gymnasium.spaces.oneof method)": [[6, "gymnasium.spaces.OneOf.seed", false]], "seed() (gymnasium.spaces.sequence method)": [[6, "gymnasium.spaces.Sequence.seed", false]], "seed() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.seed", false]], "seed() (gymnasium.spaces.text method)": [[7, "gymnasium.spaces.Text.seed", false]], "seed() (gymnasium.spaces.tuple method)": [[6, "gymnasium.spaces.Tuple.seed", false]], "sequence (class in gymnasium.spaces)": [[6, "gymnasium.spaces.Sequence", false]], "set_attr() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.set_attr", false]], "set_attr() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.set_attr", false]], "set_wrapper_attr() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.set_wrapper_attr", false]], "shape (gymnasium.spaces.space property)": [[5, "gymnasium.spaces.Space.shape", false]], "single_action_space (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.single_action_space", false]], "single_observation_space (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.single_observation_space", false]], "space (class in gymnasium.spaces)": [[5, "gymnasium.spaces.Space", false]], "spec (gymnasium.env attribute)": [[2, "gymnasium.Env.spec", false]], "spec (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.spec", false]], "spec (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.spec", false]], "spec() (in module gymnasium)": [[4, "gymnasium.spec", false]], "state_info() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.state_info", false]], "step() (gymnasium.env method)": [[2, "gymnasium.Env.step", false]], "step() (gymnasium.envs.functional_jax_env.functionaljaxenv method)": [[3, "gymnasium.envs.functional_jax_env.FunctionalJaxEnv.step", false]], "step() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.step", false]], "step() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.step", false]], "step() (gymnasium.vector.vectorenv method)": [[10, "gymnasium.vector.VectorEnv.step", false]], "step() (gymnasium.vector.vectorwrapper method)": [[14, "gymnasium.vector.VectorWrapper.step", false]], "step() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.step", false]], "step_api_compatibility() (in module gymnasium.utils.step_api_compatibility)": [[9, "gymnasium.utils.step_api_compatibility.step_api_compatibility", false]], "stickyaction (class in gymnasium.wrappers)": [[16, "gymnasium.wrappers.StickyAction", false]], "syncvectorenv (class in gymnasium.vector)": [[12, "gymnasium.vector.SyncVectorEnv", false]], "terminal() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.terminal", false]], "text (class in gymnasium.spaces)": [[7, "gymnasium.spaces.Text", false]], "timeawareobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.TimeAwareObservation", false]], "timelimit (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.TimeLimit", false]], "to_jsonable() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.to_jsonable", false]], "transform() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.transform", false]], "transformaction (class in gymnasium.wrappers)": [[16, "gymnasium.wrappers.TransformAction", false]], "transformaction (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.TransformAction", false]], "transformobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.TransformObservation", false]], "transformobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.TransformObservation", false]], "transformreward (class in gymnasium.wrappers)": [[19, "gymnasium.wrappers.TransformReward", false]], "transformreward (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.TransformReward", false]], "transition() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.transition", false]], "transition_info() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.transition_info", false]], "tuple (class in gymnasium.spaces)": [[6, "gymnasium.spaces.Tuple", false]], "unflatten() (in module gymnasium.spaces.utils)": [[8, "gymnasium.spaces.utils.unflatten", false]], "unwrapped (gymnasium.env property)": [[2, "gymnasium.Env.unwrapped", false]], "unwrapped (gymnasium.vector.vectorenv property)": [[10, "gymnasium.vector.VectorEnv.unwrapped", false]], "unwrapped (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.unwrapped", false]], "vectoractionwrapper (class in gymnasium.vector)": [[14, "gymnasium.vector.VectorActionWrapper", false]], "vectorenv (class in gymnasium.vector)": [[10, "gymnasium.vector.VectorEnv", false]], "vectorizetransformaction (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.VectorizeTransformAction", false]], "vectorizetransformobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.VectorizeTransformObservation", false]], "vectorizetransformreward (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.VectorizeTransformReward", false]], "vectorobservationwrapper (class in gymnasium.vector)": [[14, "gymnasium.vector.VectorObservationWrapper", false]], "vectorrewardwrapper (class in gymnasium.vector)": [[14, "gymnasium.vector.VectorRewardWrapper", false]], "vectorwrapper (class in gymnasium.vector)": [[14, "gymnasium.vector.VectorWrapper", false]], "wrapper (class in gymnasium)": [[15, "gymnasium.Wrapper", false]], "wrapper_spec() (gymnasium.wrapper class method)": [[15, "gymnasium.Wrapper.wrapper_spec", false]], "wrapperspec (class in gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.WrapperSpec", false]], "write_to_shared_memory() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.write_to_shared_memory", false]]}, "objects": {"gymnasium": [[16, 0, 1, "", "ActionWrapper"], [2, 0, 1, "", "Env"], [18, 0, 1, "", "ObservationWrapper"], [19, 0, 1, "", "RewardWrapper"], [15, 0, 1, "", "Wrapper"], [4, 4, 1, "", "make"], [4, 4, 1, "", "make_vec"], [4, 4, 1, "", "pprint_registry"], [4, 4, 1, "", "register"], [5, 5, 0, "-", "spaces"], [4, 4, 1, "", "spec"], [15, 5, 0, "-", "wrappers"]], "gymnasium.ActionWrapper": [[16, 1, 1, "", "action"]], "gymnasium.Env": [[2, 2, 1, "", "action_space"], [2, 1, 1, "", "close"], [2, 2, 1, "", "metadata"], [2, 3, 1, "", "np_random"], [2, 3, 1, "", "np_random_seed"], [2, 2, 1, "", "observation_space"], [2, 1, 1, "", "render"], [2, 2, 1, "", "render_mode"], [2, 1, 1, "", "reset"], [2, 2, 1, "", "spec"], [2, 1, 1, "", "step"], [2, 3, 1, "", "unwrapped"]], "gymnasium.ObservationWrapper": [[18, 1, 1, "", "observation"]], "gymnasium.RewardWrapper": [[19, 1, 1, "", "reward"]], "gymnasium.Wrapper": [[15, 3, 1, "", "action_space"], [15, 1, 1, "", "close"], [15, 2, 1, "", "env"], [15, 1, 1, "", "get_wrapper_attr"], [15, 3, 1, "", "metadata"], [15, 3, 1, "", "np_random"], [15, 3, 1, "", "np_random_seed"], [15, 3, 1, "", "observation_space"], [15, 1, 1, "", "render"], [15, 1, 1, "", "reset"], [15, 1, 1, "", "set_wrapper_attr"], [15, 3, 1, "", "spec"], [15, 1, 1, "", "step"], [15, 3, 1, "", "unwrapped"], [15, 1, 1, "", "wrapper_spec"]], "gymnasium.envs.functional_jax_env": [[3, 0, 1, "", "FunctionalJaxEnv"]], "gymnasium.envs.functional_jax_env.FunctionalJaxEnv": [[3, 1, 1, "", "render"], [3, 1, 1, "", "reset"], [3, 1, 1, "", "step"]], "gymnasium.envs.registration": [[4, 0, 1, "", "EnvSpec"], [4, 0, 1, "", "WrapperSpec"], [4, 2, 1, "", "current_namespace"], [4, 4, 1, "", "find_highest_version"], [4, 4, 1, "", "get_env_id"], [4, 4, 1, "", "load_env_creator"], [4, 4, 1, "", "namespace"], [4, 4, 1, "", "parse_env_id"], [4, 2, 1, "", "registry"]], "gymnasium.experimental.functional": [[3, 0, 1, "", "FuncEnv"]], "gymnasium.experimental.functional.FuncEnv": [[3, 1, 1, "", "initial"], [3, 1, 1, "", "observation"], [3, 1, 1, "", "render_close"], [3, 1, 1, "", "render_image"], [3, 1, 1, "", "render_init"], [3, 1, 1, "", "reward"], [3, 1, 1, "", "state_info"], [3, 1, 1, "", "terminal"], [3, 1, 1, "", "transform"], [3, 1, 1, "", "transition"], [3, 1, 1, "", "transition_info"]], "gymnasium.spaces": [[7, 0, 1, "", "Box"], [6, 0, 1, "", "Dict"], [7, 0, 1, "", "Discrete"], [6, 0, 1, "", "Graph"], [7, 0, 1, "", "MultiBinary"], [7, 0, 1, "", "MultiDiscrete"], [6, 0, 1, "", "OneOf"], [6, 0, 1, "", "Sequence"], [5, 0, 1, "", "Space"], [7, 0, 1, "", "Text"], [6, 0, 1, "", "Tuple"]], "gymnasium.spaces.Box": [[7, 1, 1, "", "is_bounded"], [7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.Dict": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.Discrete": [[7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.Graph": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.MultiBinary": [[7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.MultiDiscrete": [[7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.OneOf": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.Sequence": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.Space": [[5, 1, 1, "", "contains"], [5, 3, 1, "", "dtype"], [5, 1, 1, "", "from_jsonable"], [5, 3, 1, "", "is_np_flattenable"], [5, 3, 1, "", "np_random"], [5, 1, 1, "", "sample"], [5, 1, 1, "", "seed"], [5, 3, 1, "", "shape"], [5, 1, 1, "", "to_jsonable"]], "gymnasium.spaces.Text": [[7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.Tuple": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.utils": [[8, 4, 1, "", "flatdim"], [8, 4, 1, "", "flatten"], [8, 4, 1, "", "flatten_space"], [8, 4, 1, "", "unflatten"]], "gymnasium.utils.env_checker": [[9, 4, 1, "", "check_env"]], "gymnasium.utils.ezpickle": [[9, 0, 1, "", "EzPickle"]], "gymnasium.utils.performance": [[9, 4, 1, "", "benchmark_init"], [9, 4, 1, "", "benchmark_render"], [9, 4, 1, "", "benchmark_step"]], "gymnasium.utils.play": [[9, 0, 1, "", "PlayPlot"], [9, 0, 1, "", "PlayableGame"], [9, 4, 1, "", "play"]], "gymnasium.utils.play.PlayPlot": [[9, 1, 1, "", "callback"]], "gymnasium.utils.play.PlayableGame": [[9, 1, 1, "", "process_event"]], "gymnasium.utils.save_video": [[9, 4, 1, "", "capped_cubic_video_schedule"], [9, 4, 1, "", "save_video"]], "gymnasium.utils.seeding": [[9, 4, 1, "", "np_random"]], "gymnasium.utils.step_api_compatibility": [[9, 4, 1, "", "convert_to_done_step_api"], [9, 4, 1, "", "convert_to_terminated_truncated_step_api"], [9, 4, 1, "", "step_api_compatibility"]], "gymnasium.vector": [[11, 0, 1, "", "AsyncVectorEnv"], [12, 0, 1, "", "SyncVectorEnv"], [14, 0, 1, "", "VectorActionWrapper"], [10, 0, 1, "", "VectorEnv"], [14, 0, 1, "", "VectorObservationWrapper"], [14, 0, 1, "", "VectorRewardWrapper"], [14, 0, 1, "", "VectorWrapper"]], "gymnasium.vector.AsyncVectorEnv": [[11, 1, 1, "", "call"], [11, 1, 1, "", "close"], [11, 1, 1, "", "get_attr"], [11, 3, 1, "", "np_random"], [11, 3, 1, "", "np_random_seed"], [11, 1, 1, "", "reset"], [11, 1, 1, "", "set_attr"], [11, 1, 1, "", "step"]], "gymnasium.vector.SyncVectorEnv": [[12, 1, 1, "", "call"], [12, 1, 1, "", "close"], [12, 1, 1, "", "get_attr"], [12, 3, 1, "", "np_random"], [12, 3, 1, "", "np_random_seed"], [12, 1, 1, "", "reset"], [12, 1, 1, "", "set_attr"], [12, 1, 1, "", "step"]], "gymnasium.vector.VectorActionWrapper": [[14, 1, 1, "", "actions"]], "gymnasium.vector.VectorEnv": [[10, 2, 1, "", "action_space"], [10, 1, 1, "", "close"], [10, 2, 1, "", "closed"], [10, 2, 1, "", "metadata"], [10, 3, 1, "", "np_random"], [10, 3, 1, "", "np_random_seed"], [10, 2, 1, "", "num_envs"], [10, 2, 1, "", "observation_space"], [10, 1, 1, "", "render"], [10, 2, 1, "", "render_mode"], [10, 1, 1, "", "reset"], [10, 2, 1, "", "single_action_space"], [10, 2, 1, "", "single_observation_space"], [10, 2, 1, "", "spec"], [10, 1, 1, "", "step"], [10, 3, 1, "", "unwrapped"]], "gymnasium.vector.VectorObservationWrapper": [[14, 1, 1, "", "observations"]], "gymnasium.vector.VectorRewardWrapper": [[14, 1, 1, "", "rewards"]], "gymnasium.vector.VectorWrapper": [[14, 1, 1, "", "close"], [14, 1, 1, "", "render"], [14, 1, 1, "", "reset"], [14, 1, 1, "", "step"]], "gymnasium.vector.utils": [[13, 4, 1, "", "CloudpickleWrapper"], [13, 4, 1, "", "batch_space"], [13, 4, 1, "", "clear_mpi_env_vars"], [13, 4, 1, "", "concatenate"], [13, 4, 1, "", "create_empty_array"], [13, 4, 1, "", "create_shared_memory"], [13, 4, 1, "", "iterate"], [13, 4, 1, "", "read_from_shared_memory"], [13, 4, 1, "", "write_to_shared_memory"]], "gymnasium.wrappers": [[18, 0, 1, "", "AddRenderObservation"], [17, 0, 1, "", "AtariPreprocessing"], [17, 0, 1, "", "Autoreset"], [16, 0, 1, "", "ClipAction"], [19, 0, 1, "", "ClipReward"], [18, 0, 1, "", "DelayObservation"], [18, 0, 1, "", "DtypeObservation"], [18, 0, 1, "", "FilterObservation"], [18, 0, 1, "", "FlattenObservation"], [18, 0, 1, "", "FrameStackObservation"], [18, 0, 1, "", "GrayscaleObservation"], [17, 0, 1, "", "HumanRendering"], [17, 0, 1, "", "JaxToNumpy"], [17, 0, 1, "", "JaxToTorch"], [18, 0, 1, "", "MaxAndSkipObservation"], [18, 0, 1, "", "NormalizeObservation"], [19, 0, 1, "", "NormalizeReward"], [17, 0, 1, "", "NumpyToTorch"], [17, 0, 1, "", "OrderEnforcing"], [17, 0, 1, "", "PassiveEnvChecker"], [17, 0, 1, "", "RecordEpisodeStatistics"], [17, 0, 1, "", "RecordVideo"], [17, 0, 1, "", "RenderCollection"], [16, 0, 1, "", "RescaleAction"], [18, 0, 1, "", "RescaleObservation"], [18, 0, 1, "", "ReshapeObservation"], [18, 0, 1, "", "ResizeObservation"], [16, 0, 1, "", "StickyAction"], [18, 0, 1, "", "TimeAwareObservation"], [17, 0, 1, "", "TimeLimit"], [16, 0, 1, "", "TransformAction"], [18, 0, 1, "", "TransformObservation"], [19, 0, 1, "", "TransformReward"]], "gymnasium.wrappers.vector": [[14, 0, 1, "", "ClipAction"], [14, 0, 1, "", "ClipReward"], [14, 0, 1, "", "DictInfoToList"], [14, 0, 1, "", "DtypeObservation"], [14, 0, 1, "", "FilterObservation"], [14, 0, 1, "", "FlattenObservation"], [14, 0, 1, "", "GrayscaleObservation"], [14, 0, 1, "", "JaxToNumpy"], [14, 0, 1, "", "JaxToTorch"], [14, 0, 1, "", "NormalizeObservation"], [14, 0, 1, "", "NormalizeReward"], [14, 0, 1, "", "NumpyToTorch"], [14, 0, 1, "", "RecordEpisodeStatistics"], [14, 0, 1, "", "RescaleAction"], [14, 0, 1, "", "RescaleObservation"], [14, 0, 1, "", "ReshapeObservation"], [14, 0, 1, "", "ResizeObservation"], [14, 0, 1, "", "TransformAction"], [14, 0, 1, "", "TransformObservation"], [14, 0, 1, "", "TransformReward"], [14, 0, 1, "", "VectorizeTransformAction"], [14, 0, 1, "", "VectorizeTransformObservation"], [14, 0, 1, "", "VectorizeTransformReward"]]}, "objnames": {"0": ["py", "class", "Python class"], "1": ["py", "method", "Python method"], "2": ["py", "attribute", "Python attribute"], "3": ["py", "property", "Python property"], "4": ["py", "function", "Python function"], "5": ["py", "module", "Python module"]}, "objtypes": {"0": "py:class", "1": "py:method", "2": "py:attribute", "3": "py:property", "4": "py:function", "5": "py:module"}, "terms": {"": [1, 2, 3, 5, 6, 7, 9, 10, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 161, 163, 165, 166, 167, 170, 173, 175, 177], "0": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 157, 158, 161, 163, 165, 166, 167, 170, 173, 175, 177], "00": [2, 10, 18], "00000000e": [10, 18], "00048403": 14, "000635": [10, 14], "00080468": 14, "0008876301247721108": 19, "001": [133, 135, 140, 163, 170, 175], "00105192": 14, "00127": 14, "00133522": 14, "00142543": 14, "0015": 134, "002": [18, 139, 147, 169], "00209899": 14, "00214607": 11, "0025": [133, 134], "00266526": 14, "003": 140, "00372536": 14, "00395268": 14, "005": [144, 145, 170], "00529102": 14, "00545912": 14, "0059888": 18, "00611216": [10, 18], "00614867": 14, "00615723": 14, "00624371": 14, "00708": 6, "00711833": 6, "00715587": 14, "007812": 14, "008": [139, 147], "00848456": 10, "00942293": 10, "01": [2, 10, 14, 16, 18, 129, 137, 138, 141, 142, 143, 144, 145, 146, 154, 155, 163, 169, 170, 175], "010162116476634746": 19, "01098382": [10, 14], "01162461": 10, "0123456789": 7, "0123456789abcdefghijklmnopqrstuvwxyzabcdefghijklmnopqrstuvwxyz": 7, "01431748": 10, "015": 140, "01522993": 10, "01734283": [10, 16], "01823519": [10, 14, 16, 18], "0197368": [10, 18], "02": [10, 18, 145, 154, 155], "0203607": 16, "0219676": 14, "02418869": 10, "024251968": 14, "02438": 170, "02480598": [10, 14], "02727336": [10, 18], "02728892": 16, "0273956": [10, 18], "02796401": [10, 14, 16, 18], "02852531": [10, 14], "02858594": [10, 14], "02859527": [10, 16], "029028230434438706": 14, "02909703": 10, "03": [14, 129, 154, 155], "03156282": [10, 14, 16, 18], "03203924": [10, 14], "0333221090036294": 14, "03332211": 14, "03359492141887935": 14, "03392126": 10, "03516225": 10, "03517495": [10, 14], "03525399": 16, "03585979": [10, 18], "03625453": [10, 18], "03647037": 132, "03774345": 10, "03812904": 16, "03822722": 10, "04": [14, 16, 146, 154], "0446179": [10, 14, 16, 18], "04562247": 10, "0469136": [10, 14], "0469184": 10, "04731862": 10, "04740972": 10, "04794393": 16, "04799704": 10, "05": [14, 132, 137, 138, 141, 144, 154, 155, 169], "055928": 14, "05592803": 132, "06": 154, "06296527": 14, "06296527291998574": 14, "0631256": 14, "06312564": 132, "0640786": 14, "06540678": 18, "06670535": 16, "07": [133, 134, 154, 155], "07287608": 131, "07422512": 18, "08": [154, 155], "0808398": 11, "08227695": 18, "0892358": [14, 132], "09": [154, 155], "09417735": 13, "09613613": 18, "0_": [137, 138, 142, 144], "0_2": [144, 145], "0_3": 144, "0a2": 132, "0th": 17, "0x7f04efcb8850": 157, "0x7fbb5efd0490": 15, "1": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 158, 163, 165, 166, 167, 170, 173, 175, 177], "10": [6, 7, 14, 17, 18, 127, 129, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 150, 151, 153, 154, 155, 158, 165, 170, 175, 177], "100": [6, 8, 14, 17, 127, 128, 129, 131, 134, 139, 144, 148, 151, 152, 158], "1000": [9, 17, 128, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 156, 169, 170, 177], "1006": 155, "1009": 155, "100_000": [163, 175], "1013": 155, "10172354684460168": 14, "10172355": 14, "1018": 155, "1019": 155, "1027": 155, "1033": 155, "105": [137, 155], "107": 137, "10_000": 161, "10x": [163, 175], "11": [14, 137, 138, 139, 140, 141, 144, 147, 150, 151, 154, 170, 175], "110": 155, "11185605": 131, "112": 155, "11448676": 14, "117": 155, "12": [8, 16, 17, 18, 129, 131, 132, 137, 138, 139, 140, 141, 144, 147, 151, 155, 170, 175], "120": 170, "12224312": [11, 12], "123": [2, 6, 10, 11, 14, 16, 17, 18, 19, 131, 132, 133, 134, 135, 155, 160, 173], "124": 155, "125": 155, "125637": 6, "12625128": 131, "128": 170, "1280": 136, "13": [137, 138, 140, 141, 144, 147, 153, 154, 155], "130": [140, 141, 155], "132": [151, 155], "136": 155, "138": 155, "139": 155, "14": [137, 138, 140, 141, 144, 147, 154, 155, 175], "141": 155, "1415927": 127, "143": 155, "145": 137, "1455": 17, "146": 155, "14995256": [11, 12], "15": [8, 16, 17, 18, 19, 129, 137, 138, 140, 141, 144, 147, 152, 170, 173], "150": 9, "1500": 154, "1506": 170, "15089367": [10, 16], "1511158": 18, "155": 155, "1598639586606745": 14, "16": [135, 137, 138, 140, 141, 144, 147, 152, 154, 155, 175, 177], "160": 155, "1600": 127, "1613": 153, "164": 155, "16717631": 10, "17": [137, 138, 140, 141, 144, 147, 150, 154], "171": 155, "1710671": 10, "172": 155, "173": 155, "174": 155, "17752565": 131, "178": 155, "179": 155, "18": [18, 137, 138, 140, 141, 144, 147, 154], "1800": 173, "183": 155, "184": 155, "1851753": 11, "186": 155, "1878752": 12, "18847767": 10, "18879032e": [10, 18], "1887903e": 2, "18933342": 6, "19": [137, 140, 141, 144, 170], "190": 155, "19049619": 6, "1938739": 14, "195": [155, 169], "19559774": 10, "1989": 163, "199": 9, "1990": [133, 134], "1995": 131, "1996": 131, "1_": 145, "1b15": 154, "1d": 177, "1e": [14, 18, 19, 139, 140, 141, 146, 147, 177], "2": [2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 158, 163, 165, 167, 170, 173, 175, 177], "20": [129, 137, 140, 141, 144, 153, 155, 170, 173, 175], "200": [9, 17, 129, 131, 132, 133, 135, 152, 153, 155], "2000": [9, 127, 153, 173], "2004": 177, "200th": 17, "2014": 128, "2016": 170, "2017": 154, "20172954": 18, "2018": [16, 17, 131, 154], "2019": 154, "2020": [148, 150, 151], "2021": [136, 148, 154], "2022": [136, 154, 155], "2023": 155, "2024": 155, "205": 155, "206": 155, "208": 155, "20946532": 14, "2095": 132, "21": [18, 19, 137, 140, 141, 144, 150, 155, 157, 163, 170, 175], "210x180": 17, "215": 155, "216": 155, "21649833": 6, "218": 155, "219": 155, "21944423": 14, "22": [17, 137, 140, 141, 144, 155, 175], "220": 175, "221": 155, "223": 155, "224": 155, "225": 155, "227": 153, "229": 177, "23": [137, 140, 141, 144, 155], "230": 155, "234": 155, "235": 155, "235794": 6, "2359734": 14, "238": 155, "24": [14, 17, 18, 127, 128, 129, 132, 137, 140, 141, 155, 175], "24002443": 10, "241": 155, "244": 155, "24503504": 14, "2456": 154, "246": 155, "247": 155, "2487226": 10, "249": 155, "25": [2, 8, 10, 17, 19, 137, 139, 140, 141, 147, 153, 155, 169], "250": [155, 161], "252": 155, "2524": 154, "254": 155, "255": [17, 18, 128, 155, 165], "25533703": 18, "256": 177, "257": 155, "25715804": 10, "258": 155, "259": 155, "25g": 137, "26": [2, 17, 137, 140, 141, 157, 159, 166, 177], "26141977": 10, "264": 155, "267": 155, "2671": 154, "2678": 154, "268": 155, "2684": 154, "27": [9, 19, 140, 141], "270": 155, "272": 155, "2736044": 135, "274": [131, 155], "274334": 131, "27648": [14, 18, 157], "277": 155, "278": 155, "27800309628058434": 14, "279": 155, "28": [14, 131, 140, 141], "281": 155, "282": 155, "2831855": 129, "284": 155, "29": [17, 140, 141, 154], "2906": 154, "292": 155, "2d": [136, 145, 148, 158, 167], "2gb": 154, "2nd": 163, "3": [2, 4, 5, 6, 7, 8, 9, 10, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 158, 159, 163, 165, 170, 173, 175, 177], "30": [2, 9, 17, 140, 141, 155], "300": [127, 153, 165, 173], "3000": 9, "3016": 154, "303": 153, "3034542e": 14, "3036": 154, "3037": 154, "304": 155, "3040": 154, "3041": 154, "3044": 154, "3072": 154, "3076": 154, "3080": 154, "3083": 154, "31": [140, 141, 155], "3110827": 10, "3118435e": 14, "3152": 19, "3154": 19, "32": [18, 140, 141, 150, 170, 177], "321": 155, "323": [144, 155], "32351476": 18, "327": [137, 155], "329": 155, "33": [140, 141], "330": 155, "331": 155, "3319138": 10, "332": 155, "33293587": [10, 16], "333": 155, "337": 155, "338": 155, "339": 155, "34": [140, 141], "340": 155, "341": 155, "342": 155, "34641072": 16, "34756234": 16, "348": [140, 141], "35": [140, 141, 155], "350": [140, 141], "355": 155, "36": [6, 140, 141, 151], "37": [140, 141, 155], "373": 155, "38": [140, 141], "39": [140, 141, 155], "3991573": 6, "3d": [136, 137, 140, 141, 148, 158, 175], "3rd": 136, "3v3": 148, "4": [2, 6, 7, 8, 9, 10, 14, 16, 17, 18, 127, 128, 129, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 155, 158, 161, 165, 167, 175, 177], "40": [140, 141], "400": [18, 153], "403": 155, "404": [153, 155], "405": 155, "407": 155, "41": [140, 141, 155], "4123625": 135, "418": [132, 155], "41887903": [10, 18, 132], "4189": 6, "42": [6, 7, 10, 11, 12, 13, 16, 18, 140, 141, 155, 156, 170], "424": 155, "42683297": 14, "42884544": 14, "429": 155, "43": [140, 141], "433": 155, "435": 155, "4351738": 14, "4354835e": 14, "43887845": 13, "44": [6, 140, 141], "440": 155, "444": 155, "44504836": 14, "44799727": 14, "4488689e": 14, "45": [134, 144], "450": 155, "451": 155, "459": 155, "4624777": 14, "46352962": [14, 133], "46515748": 14, "46543318": 14, "46546045": 14, "46553135": 14, "47": 151, "47037": 6, "47377947": 14, "474": 155, "47599354": 14, "48": 151, "480": 136, "4808415e": 16, "482": 155, "48468155": 14, "48657528": 14, "487": 155, "491": 155, "495": 155, "497": 155, "498371": 14, "4x12": 151, "4x4": 152, "4\u03c0": 131, "5": [6, 7, 8, 9, 10, 13, 14, 16, 17, 18, 19, 127, 128, 129, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 153, 158, 163, 165, 170, 173, 175, 177], "50": [17, 129, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 177], "500": [17, 18, 131, 132, 153, 154, 170, 175], "503": 155, "51": [6, 155], "510": 155, "512": [155, 165], "515": 155, "516": 155, "518": 155, "520": 155, "522": 155, "524": 155, "525": 155, "5281": 155, "529": 155, "533": 155, "5342437e": 14, "535": 155, "536": 155, "538": 155, "54": 6, "540": 155, "5420062": 16, "548": 155, "55": [6, 155], "553": 155, "554": 155, "557": 155, "558": 155, "561": 155, "5635296": 134, "564": 155, "566371": 131, "567": 131, "5676788": 18, "569": 155, "57": 170, "572": 155, "573": 155, "576": 155, "5760367": [11, 12], "58": 155, "580": 155, "581": 155, "583": 155, "5836242": 6, "588": 155, "589": 155, "590": 155, "594091": 6, "5e": [137, 139, 140, 141, 147, 169], "5e3": 177, "5gb": 154, "5x": 155, "5x5": 153, "6": [8, 14, 16, 17, 18, 129, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 151, 153, 155, 177], "60": [8, 155], "600": [18, 170], "604": 155, "608": 155, "61": 155, "6102389": 12, "612": 155, "6120394": 18, "616": 155, "6193494": 11, "62": [6, 11, 12], "62259156": 14, "63": [6, 7], "63214064": 6, "634": 155, "6343064e": 14, "6344974": 16, "635": 155, "6382770e": 16, "639": 153, "6397772e": 16, "64": [6, 155], "6608303": 16, "666": 129, "669": 6, "6822636": 6, "683": 155, "68512": 6, "695": 155, "697368": 13, "7": [9, 129, 134, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 155, 170], "70": 155, "708": 155, "709": 155, "714599": 11, "7183299": 18, "72": 155, "720": 136, "7257502": 6, "729": 9, "73": 155, "731": 155, "732": 128, "74": 155, "748": 155, "75": [16, 137, 169], "75cm": 137, "7619033": 14, "762": 155, "7695615": 12, "77": 6, "77395606": 13, "774": 155, "78": [137, 140, 141, 155], "7851154": 11, "788": 155, "789": 155, "791": 155, "79221743": 12, "8": [6, 9, 10, 14, 18, 19, 128, 129, 131, 132, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 152, 154, 170, 173, 177], "80": 152, "800": [155, 173], "8000": 1, "80000019": [10, 18], "80000019e": [10, 18], "8000002e": 2, "801": 155, "81": [11, 12, 135, 155], "810": 155, "811": 155, "812": 155, "8127025": 155, "8127026": 155, "817": 155, "8174238": [11, 12], "81882": 6, "827": 155, "83": 155, "830": 155, "83506": 6, "838": 155, "84": [6, 17], "8498053": 12, "84x84": 17, "85859793": 13, "867": 155, "886": 155, "889": 155, "88g": 137, "89": [6, 155], "8914354": 18, "89235795": 135, "893": 155, "89345848": 14, "8934584807363618": 14, "8971417e": 14, "898": 155, "8f1d43620bc6bb580df6e80b0dc05c48": 131, "8gb": 154, "8x8": 152, "9": [11, 12, 14, 16, 131, 135, 137, 138, 139, 140, 141, 142, 144, 145, 147, 150, 155, 170, 175], "90": 175, "9053838": 6, "9061728": 14, "90849805": 14, "910": 137, "91101986": 135, "91244936": [11, 12], "915": 155, "91g": 137, "9216": 14, "924": 155, "926": [128, 155], "928024": 6, "9296501": 14, "932": 155, "934": 155, "9375233e": 14, "9380709": 16, "941": 155, "9416149": 14, "9428282": 14, "9429494": 14, "95": [128, 163, 170, 173, 175], "9503881": 14, "950912": 6, "952": 155, "9532328e": 14, "956": 155, "957": 155, "958": 155, "959": 155, "96": [14, 18, 128, 157], "9635296": 14, "96495728": 11, "965": 155, "96562607": 12, "96x96": 128, "97562236": 13, "977": 155, "978": 155, "97854": 6, "979": 155, "9808417e": 16, "982": 155, "98219293": 12, "98270553": 11, "9841162": 131, "9886932": [11, 12], "9898634e": 14, "99": [14, 19, 170, 177], "9944268": 18, "997341": 131, "999": [134, 170], "99902063": 12, "9999": 129, "9\u03c0": 131, "A": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 14, 16, 17, 18, 19, 20, 127, 128, 131, 132, 134, 136, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 150, 151, 152, 154, 155, 158, 159, 165, 166, 167, 170, 175, 177], "And": [165, 169], "As": [2, 5, 10, 15, 17, 131, 132, 136, 149, 151, 153, 154, 155, 157, 158, 162, 167, 170, 173, 177], "At": [14, 17, 157], "Be": 155, "But": 137, "By": [2, 4, 17, 18, 131, 137, 138, 139, 140, 141, 146, 147, 148], "For": [2, 3, 5, 7, 8, 9, 10, 17, 18, 19, 128, 129, 136, 151, 152, 153, 154, 155, 157, 158, 159, 160, 161, 162, 163, 165, 167, 169, 170, 175], "If": [1, 2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 128, 129, 132, 134, 137, 139, 140, 147, 148, 150, 151, 152, 154, 155, 156, 157, 159, 163, 165, 167, 169, 170, 172, 173, 175], "In": [2, 7, 9, 13, 14, 15, 16, 19, 128, 129, 130, 131, 132, 137, 138, 139, 140, 141, 145, 146, 147, 153, 154, 155, 157, 158, 160, 161, 162, 165, 166, 167, 169, 170, 173, 175, 177], "It": [2, 4, 6, 7, 9, 10, 11, 12, 17, 127, 136, 137, 140, 141, 144, 145, 146, 148, 154, 155, 158, 162, 163, 165, 175, 177], "Its": 148, "No": [16, 17, 18, 150, 154, 177], "Not": [17, 154], "On": [131, 132, 133, 134, 135, 154, 155, 158, 173], "One": [7, 137, 140, 141, 155, 157, 158], "Or": 153, "Such": [15, 167], "That": [3, 132], "The": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 165, 166, 169, 170, 175, 177], "Then": [1, 15, 157, 165, 167], "There": [7, 15, 127, 129, 130, 133, 134, 136, 140, 142, 148, 151, 153, 154, 155, 157, 158, 165], "These": [9, 126, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 160, 163, 175, 177], "To": [1, 2, 4, 9, 10, 13, 15, 17, 18, 127, 128, 129, 131, 132, 144, 153, 154, 155, 157, 158, 159, 160, 163, 165, 167, 169, 170, 173, 175, 177], "Will": [9, 17, 20, 155], "With": [6, 14, 17, 148, 151, 155, 170, 173, 175], "_": [9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 128, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 156, 163, 167, 170, 173], "_2": [137, 138, 139, 140, 141, 144, 145, 146, 147], "__all__": 155, "__class__": 170, "__del__": 155, "__eq__": 154, "__future__": [170, 175, 177], "__get_attr__": 155, "__getattr__": 155, "__getitem__": 154, "__init__": [2, 9, 14, 15, 16, 18, 155, 163, 165, 167, 170, 173, 175, 177], "__len__": 154, "__main__": [167, 170], "__name__": [167, 170], "__setitem__": 154, "_action_to_direct": [158, 165], "_agent_loc": [158, 165], "_async_work": 11, "_bigint_from_byt": 154, "_build": 1, "_control": 141, "_cost": [140, 141], "_episod": [14, 17], "_final_info": 14, "_final_observ": 14, "_get_info": [158, 165], "_get_ob": [158, 165], "_int_list_from_bigint": 154, "_k": 14, "_modul": 18, "_nois": [137, 138, 139, 140, 141, 142, 143, 146, 147], "_np_random": [2, 5, 10, 154], "_np_random_se": [2, 10], "_rang": [140, 141], "_render_fram": 165, "_replac": 173, "_reward_ctrl": 14, "_reward_run": 14, "_scale": [137, 138, 139, 140, 141, 142, 143, 146, 147], "_script": 1, "_skip": 169, "_static": [1, 173], "_step_mujoco_simul": 155, "_target_loc": [158, 165], "_timelimit": 154, "_update_running_mean": [14, 19], "_worker": 11, "_x_posit": 14, "_x_veloc": 14, "a2c": [155, 168, 171], "a3c": 170, "a_": [166, 170], "a_0": 6, "a_i": 6, "a_max": 170, "a_min": 170, "a_n": 6, "a_t": 166, "aaronwalsman": 155, "ab": [128, 170], "abbeel": [137, 170], "abdomen": [140, 141], "abdomen_i": [140, 141], "abdomen_x": [140, 141], "abdomen_z": [140, 141], "abil": [155, 157, 162], "abl": [155, 166], "about": [2, 3, 9, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 155, 157, 158, 162, 163, 165, 166, 175, 177], "abov": [2, 6, 7, 9, 131, 132, 135, 136, 155, 157, 159, 160, 161, 170, 177], "absolut": [141, 143, 147, 169], "abstract": [155, 158, 165], "ac": [150, 175], "academ": 162, "academia": 155, "acceler": [128, 133, 134, 135, 148, 154, 155, 170], "accent_r": 175, "accept": [2, 5, 7, 9, 17, 154, 155, 158, 165, 169], "access": [14, 15, 17, 148, 154, 155, 160, 167], "accid": 154, "accident": 154, "accord": [4, 6, 7, 129, 170], "accordingli": 165, "account": [148, 154, 173], "accumul": 161, "accur": [136, 155], "accuraci": 136, "achiev": [2, 131, 150, 154, 161, 162, 170], "acquir": 136, "acrobot": [130, 154], "acrobotenv": 131, "acronym": 177, "across": 13, "act": [14, 155, 161, 162, 163, 167, 175], "action": [2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 15, 17, 20, 130, 136, 148, 149, 154, 155, 156, 158, 160, 161, 165, 167, 169, 170, 173, 177], "action_log_prob": 170, "action_logit": 170, "action_logits_vec": 170, "action_mask": 153, "action_mean": 177, "action_pd": 170, "action_s": 173, "action_shap": 170, "action_spac": [2, 5, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 153, 155, 156, 157, 158, 160, 161, 163, 165, 167, 173, 175, 177], "action_space_dim": 177, "action_stddev": 177, "action_valu": 175, "actionwrapp": [14, 15, 16], "actor": 177, "actor_lay": 170, "actor_loss": 170, "actor_losses_moving_averag": 170, "actor_lr": 170, "actor_optim": 170, "actor_weight": 170, "actor_weights_path": 170, "acttyp": [2, 3, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 155], "actual": [137, 153, 155, 158, 161, 165, 170, 173], "actuat": [131, 132, 140, 141, 155, 169], "acycl": 148, "ad": [10, 14, 16, 17, 18, 19, 129, 132, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 157, 158, 160, 165], "adamw": 177, "adapt": [132, 142, 143, 148, 151], "add": [1, 4, 5, 9, 11, 16, 17, 18, 19, 20, 128, 136, 137, 140, 141, 151, 153, 154, 155, 157, 158, 165, 167, 170, 175], "add_subplot": 175, "addit": [2, 5, 129, 130, 150, 153, 154, 155, 157, 160, 163, 165, 169, 170, 175, 177], "addition": [148, 154, 155, 158, 159], "additional_wrapp": [4, 155], "addrenderobserv": [18, 20, 155], "address": [155, 160], "adilzouitin": 154, "adjac": 6, "adjust": 169, "advanc": [11, 131, 155], "advantag": [137, 155, 165], "advis": 14, "affect": 155, "affin": [14, 16, 18, 20, 129, 157], "after": [2, 3, 5, 9, 10, 17, 129, 137, 138, 139, 140, 141, 144, 145, 146, 147, 150, 153, 154, 155, 157, 158, 163, 165, 166, 170, 175, 177], "again": [2, 129, 158, 165], "against": [148, 154], "agen": 158, "agent": [2, 4, 5, 14, 17, 20, 127, 128, 129, 133, 137, 138, 139, 140, 141, 142, 143, 145, 146, 147, 148, 154, 155, 157, 158, 160, 162, 165, 166, 167, 173], "agent_posit": 167, "agnost": [137, 138, 139, 140, 141, 146, 147], "ago": 155, "ahead": 154, "ahmedo42": 154, "ai": 148, "aim": [136, 139, 146, 147, 154, 155, 177], "air": [144, 155], "al": [4, 14, 16, 17, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 154, 155, 163, 175], "ale_pi": [17, 155], "alexdluken": 155, "algorithm": [2, 5, 148, 149, 153, 154, 155, 163, 166, 169, 170, 173, 175, 177], "alias": 148, "align": 155, "aliv": [140, 147], "alive_bonu": 142, "all": [2, 4, 5, 6, 7, 10, 11, 12, 13, 14, 15, 16, 17, 18, 126, 128, 129, 130, 131, 132, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 154, 155, 157, 158, 167, 169, 170, 171, 173, 175, 177], "all_act": 173, "all_stat": 173, "allow": [3, 4, 5, 6, 9, 11, 12, 14, 15, 17, 18, 19, 20, 131, 132, 133, 134, 135, 140, 141, 142, 143, 147, 148, 154, 155, 157, 158, 159, 160, 165, 167, 170, 177], "almost": 154, "along": [2, 9, 10, 15, 18, 128, 132, 133, 134, 142, 143, 146, 151, 154, 155, 157, 159], "alongsid": 157, "alphabet": 7, "alphanumer": 7, "alreadi": [2, 10, 15, 155, 157], "alreadypendingcallerror": 11, "also": [2, 7, 9, 10, 11, 12, 17, 18, 127, 128, 129, 130, 136, 137, 143, 148, 154, 155, 157, 158, 163, 165, 166, 167, 169, 170, 173, 175, 177], "alter": [2, 15, 157, 165], "altern": [1, 148, 155, 158], "although": [7, 137, 169], "altogeth": 169, "alwai": [7, 8, 17, 129, 133, 134, 137, 140, 141, 142, 145, 146, 150, 151, 152, 153, 155, 165], "among": [16, 130, 136], "amount": [9, 127, 132, 142, 143, 177], "amp": 155, "amtamasi": 154, "an": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 165, 166, 167, 169, 170, 172, 173], "analog": 2, "analogi": 144, "analysi": 148, "analyt": 142, "anderson": [132, 142, 143], "andrea": [155, 169, 173], "andrew": [133, 134, 155, 173], "andrewtanj": 154, "angl": [127, 129, 131, 132, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154], "angle_1": 137, "angle_2": 137, "angle_3": 137, "angle_4": 137, "angular": [127, 129, 131, 132, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "ani": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 129, 132, 137, 139, 140, 142, 143, 146, 147, 148, 152, 154, 155, 157, 158, 160, 163, 165, 166, 169, 173, 175, 177], "anim": [9, 136], "ankle_1": 137, "ankle_2": 137, "ankle_3": 137, "ankle_4": 137, "annot": [154, 155, 170, 173, 175, 177], "annot_kw": 173, "annoy": 155, "anoth": [14, 15, 137, 140, 141, 143, 147, 153, 154, 163, 166, 175, 177], "ansi": [2, 154, 155, 165], "ansi_list": 2, "answer": [148, 165], "ant": [136, 154, 155, 169], "anymor": [14, 18, 19, 154, 155], "anyth": [148, 169, 173], "anytrad": 148, "anywher": 155, "apach": 148, "apart": 165, "api": [2, 3, 17, 20, 148, 154, 155, 156, 157, 160, 165, 166, 169, 170], "apicompat": 155, "appear": [133, 134], "append": [14, 19, 155, 163, 170, 173, 175, 177], "appli": [2, 4, 6, 8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 127, 129, 130, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 154, 155, 157, 158, 161, 163, 165, 167, 175, 177], "applic": [5, 7, 146, 148, 157], "apply_along_axi": 175, "apply_api_compat": [4, 154, 155, 159], "apply_env_compat": 155, "appreci": 154, "approach": [19, 142, 154, 155, 158, 165, 175, 177], "appropri": [155, 158, 165, 169, 177], "approxim": [14, 19, 148], "ar": [2, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 157, 158, 159, 160, 161, 162, 163, 165, 166, 167, 169, 170, 172, 173, 175, 177], "araffin": 154, "arang": [170, 173, 175], "arbitrari": [2, 4, 7, 9, 148, 170], "arbitrarili": 155, "arcad": [17, 148], "archiv": 154, "area": 136, "aren": 154, "arg": [9, 11, 12, 163, 170, 175, 177], "argmax": [153, 163, 173, 175], "argument": [4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 126, 130, 149, 154, 155, 157, 158, 159, 165, 169, 175], "ariel": 155, "arjun": [154, 155], "arm": [136, 140, 141, 144, 145, 148], "around": [3, 14, 126, 148, 154], "arr": 175, "arrai": [2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 20, 129, 131, 132, 133, 134, 135, 154, 155, 158, 165, 167, 170, 175, 177], "arrang": 4, "array_equ": [158, 165], "arraytyp": [10, 11, 12, 14], "arrow": [7, 173], "articl": 173, "articul": [146, 148], "artifici": 153, "arxiv": 170, "as_cmap": 173, "ascii": 154, "aspect": [154, 157, 166], "assert": [155, 165], "assess": 2, "asset": 152, "assign": [2, 132, 133, 134, 144, 145, 148, 173, 175], "assist": 148, "associ": [154, 155, 160], "assum": [7, 14, 18, 136, 165], "async": [4, 11, 155], "asynchron": [5, 10, 11, 12, 155], "asyncvectorenv": [10, 155, 162, 170], "atari": [9, 17, 20, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 154, 155, 157, 163, 175], "atari_wrapp": 18, "ataripreprocess": [17, 20], "attach": [129, 132, 135, 137, 142, 143, 147], "attempt": [8, 129, 136, 153, 170], "attitud": 148, "attr": 6, "attribut": [4, 6, 11, 12, 17, 18, 19, 155, 157, 160, 165, 167], "attributeerror": 155, "augment": [18, 20, 157], "author": [133, 134, 154, 155, 169, 170, 173, 175, 177], "auto": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 155], "autobuild": 1, "autom": [148, 154], "automat": [1, 2, 4, 10, 11, 12, 17, 18, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 154, 155, 159, 160, 165, 170], "automata": [148, 155], "autoreset": [4, 10, 17, 20, 155], "autoresetwrapp": [17, 154, 155], "autorom": 154, "aux_1": 137, "aux_2": 137, "aux_3": 137, "aux_4": 137, "auxiliari": [2, 158, 165], "avail": [4, 129, 148, 150, 151, 154, 155, 158, 161, 165], "averag": [9, 14, 19, 20, 166, 173, 175, 177], "avg_reward": 177, "avoid": [10, 13, 15, 151, 153, 157, 169], "awai": [137, 138, 139, 140, 144, 145, 146, 147, 167, 173], "awak": 129, "awar": [155, 160], "award": 132, "ax": [5, 7, 165, 170, 173, 175], "ax1": 175, "ax2": 175, "axb2035": 155, "axi": [17, 18, 133, 134, 137, 140, 141, 146, 173, 175], "b": [1, 7, 144, 145, 153, 154], "b2dtut": 128, "b5": 7, "back": [11, 15, 18, 20, 126, 136, 137, 138, 155, 169], "back_leg": 137, "backend": [154, 155, 165], "background": 128, "backlog": 155, "backport": 154, "backward": [138, 154, 155, 159, 170, 177], "bad": 13, "badia": 170, "balanc": [132, 142, 143, 170, 177], "bali": 155, "balisujohn": [154, 155], "bandit": 148, "bar": [9, 170], "bare": 15, "barto": [2, 131, 132, 142, 143, 150, 151, 154, 163, 173, 175], "base": [2, 4, 5, 6, 9, 10, 13, 14, 15, 17, 18, 19, 20, 126, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 153, 154, 155, 157, 163, 165, 167, 169, 170, 175, 177], "base_env": 15, "baselin": 148, "baselines3": 18, "bash": 154, "basic": [155, 158, 163, 169], "basilisk": 148, "batch": [5, 10, 11, 12, 13, 14, 154, 170], "batch_spac": [5, 13, 155], "bbox_inch": 173, "bbox_to_anchor": 175, "beat": [148, 150], "beatabl": [163, 175], "becam": 154, "becaus": [136, 158, 160, 163, 165, 167, 169, 170, 175], "becom": [126, 154, 155, 166, 170], "been": [2, 9, 10, 129, 137, 140, 141, 148, 154, 155, 157, 158, 160, 165, 166, 170, 173], "befor": [2, 4, 9, 10, 14, 15, 16, 17, 18, 19, 20, 128, 137, 138, 139, 140, 144, 145, 146, 147, 154, 155, 158, 163, 165, 167, 169, 170, 177], "began": 155, "begin": [14, 17, 154, 158, 160, 165, 170, 175], "behav": 154, "behavior": [14, 15, 135, 136, 137, 138, 139, 140, 141, 146, 147, 154, 158, 165, 167, 169], "behaviour": [129, 136, 155, 157, 161], "behind": [2, 137, 140, 155], "being": [3, 4, 8, 17, 129, 133, 134, 135, 137, 140, 141, 144, 145, 154, 155, 158, 163, 169, 173, 175], "believ": 155, "belong": 6, "below": [7, 10, 20, 135, 147, 155, 157, 161, 166, 170, 175], "benchmark": [126, 148, 171, 176], "benchmark_init": 9, "benchmark_rend": 9, "benchmark_step": 9, "benefit": [17, 155], "bennet": 148, "besid": 136, "best": [163, 170, 173, 175], "better": [127, 154, 155, 170], "between": [3, 6, 7, 14, 17, 18, 19, 20, 129, 131, 132, 135, 136, 137, 138, 139, 140, 141, 142, 143, 145, 146, 147, 148, 154, 155, 158, 163, 165, 166, 170, 173, 175, 177], "beyond": 155, "bfoot": 138, "bia": [137, 138, 139, 140, 141, 146, 147, 170], "bias": 170, "big": [165, 173], "bigger": 173, "biggest": 173, "binari": [5, 7, 148, 157, 165], "bind": [126, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154], "biomechan": 136, "bipe": 136, "biped": [126, 140, 141, 147, 154, 169], "bipedal_walk": 127, "bipedalwalk": [127, 154], "biplan": 148, "bird": 148, "bit": 154, "bitcraz": 148, "bkrl": 154, "black": [131, 154, 155, 173, 175], "blackjack": [149, 154, 155, 163, 171, 176], "blackjack_tutori": 175, "blackjackag": [163, 175], "blit": 165, "blob": [151, 154], "block": 155, "blog": [154, 155, 160], "blue": [131, 153, 165, 173], "board": 148, "board_gam": 154, "bodi": [129, 136, 137, 138, 139, 140, 141, 142, 144, 146, 147, 169], "boilerpl": [15, 157, 158, 165, 167], "bonu": [147, 170], "book": [131, 150, 151, 154, 163, 175], "book_or_nip": 131, "bool": [2, 4, 5, 6, 7, 9, 10, 11, 12, 14, 15, 17, 18, 132, 136, 137, 138, 139, 140, 141, 146, 147, 160, 163, 165, 173, 175, 177], "bool8": 155, "bool_": 155, "boolean": [2, 5, 9, 14, 17, 129, 155, 160, 163, 175, 177], "booster": 129, "bootstrap": [2, 160, 166, 170], "both": [7, 10, 11, 12, 18, 127, 130, 131, 147, 148, 150, 151, 152, 153, 154, 155, 157, 158, 159, 162, 165, 166, 173, 177], "bother": 165, "bottom": [128, 133, 134, 139, 151, 155, 173], "bound": [2, 7, 14, 16, 17, 18, 19, 20, 129, 131, 132, 133, 134, 135, 154, 155, 157, 158, 160, 166], "boundari": 8, "bounded": 5, "box": [2, 5, 6, 7, 8, 10, 11, 13, 14, 15, 16, 17, 18, 20, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 157, 158, 165, 167], "box2d": [127, 128, 129, 154, 155, 157], "brake": 128, "brax": [148, 155], "break": [128, 154, 155, 160, 173], "bridg": 148, "briefli": 160, "bring": 155, "brittl": 177, "broken": [17, 154], "brown": 155, "browser": 1, "bshin": 138, "bsk": 148, "bsuit": 148, "bthigh": 138, "btjanaka": 155, "buffalo": 148, "buffer": [10, 14, 17, 128, 161], "buffer_length": [14, 17, 161], "bug": [17, 129, 136, 137, 139, 140, 142, 143, 144, 145, 147, 152, 154, 155], "build": [5, 126, 137, 138, 139, 140, 141, 142, 143, 146, 147, 155, 165, 167], "built": [13, 148, 154, 155, 160, 162], "bullet": 148, "bump": 154, "bunch": 173, "burden": 177, "bushi": 9, "bust": [150, 175], "button": [5, 7, 9, 157], "c": [9, 129, 131, 155, 165, 177], "calcul": [14, 18, 19, 135, 137, 142, 151, 152, 153, 169, 170, 177], "call": [2, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 137, 144, 145, 150, 152, 154, 155, 157, 158, 159, 160, 163, 165, 169, 170, 175], "call_async": 155, "call_wait": 155, "callabl": [3, 4, 9, 11, 12, 13, 14, 16, 17, 18, 19, 155], "callback": [9, 154], "cambridg": [133, 134], "camera": [136, 137, 138, 139, 140, 146, 147], "camera_id": 136, "camera_nam": 136, "campbel": 128, "can": [1, 2, 4, 5, 6, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 167, 169, 170, 173, 177], "cannot": [4, 8, 13, 17, 148, 151, 154, 155, 157, 158, 160, 165, 166, 172], "canva": 165, "capabl": [154, 156], "capped_cubic_video_schedul": [9, 17], "captur": [9, 17], "car": [126, 130, 148, 154, 157], "car_rac": [128, 154, 157], "card": [150, 163, 175], "care": [5, 148], "carl": 148, "carlo": [163, 170, 175, 177], "carloslui": 154, "carrac": [9, 14, 18, 128, 154, 157], "cart": [130, 142, 143, 177], "cartesian": [6, 7, 135], "cartpol": [2, 9, 10, 14, 16, 17, 18, 19, 130, 132, 136, 142, 143, 148, 154, 155, 157, 159, 161, 177], "cartpoleenv": [17, 132, 155], "cartpolejax": 17, "cartpolevectorenv": [132, 155], "casadi": 148, "case": [2, 3, 5, 6, 7, 13, 14, 16, 128, 137, 138, 139, 140, 141, 146, 147, 153, 155, 157, 158, 160, 163, 165, 166, 167, 169, 170, 177], "casino": [163, 175], "cast": 18, "cat": 138, "catch": 155, "categor": [7, 170], "categori": 1, "caught": 154, "caus": [140, 147, 153, 154, 155, 157, 162], "cbar": 175, "cc": 131, "cclauss": 154, "cd": 1, "cell": [148, 151, 158, 165], "cellular": [148, 155], "center": [14, 18, 20, 128, 129, 132, 135, 137, 140, 141], "centr": [137, 140, 141], "centric": 177, "certain": [2, 143, 145, 154, 155, 163, 170, 175], "cfrc_ext": [137, 140, 141], "chain": [15, 131, 146, 157], "challeng": [148, 163, 175], "chanc": 11, "chandrasekar": 177, "chang": [1, 2, 3, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 143, 146, 147, 148, 153, 154, 155, 157, 158, 159, 160, 161, 162, 163, 165, 167, 169, 175, 177], "changelog": [4, 154, 155], "channel": [14, 17, 18], "charact": [7, 9], "character": 148, "charg": [6, 148], "charlist": 7, "charraut": 155, "charset": 7, "check": [1, 2, 5, 7, 15, 17, 20, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 158, 165, 167, 169, 173, 175], "check_env": [9, 155], "check_environments_match": 155, "checker": [4, 20, 154, 155], "cheetah": 136, "child": 13, "children": 11, "choic": [170, 173, 177], "choos": [158, 165, 169, 173], "choose_act": 173, "chosen": [2, 7, 151, 158, 170, 173], "chri": 128, "christofkaufmann": 155, "chrisyeh96": 155, "chunk": 17, "ci": [9, 154, 155], "cinert": [140, 141], "circl": 165, "citat": 155, "clamp": [140, 141], "clarifi": [154, 155], "class": [2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 14, 15, 17, 18, 19, 136, 148, 154, 155, 157, 158, 162, 163, 165, 167, 169, 170, 173, 175, 177], "classic": [129, 131, 132, 133, 134, 135, 139, 143, 146, 147, 148, 154, 155, 157, 160, 166], "classic_control": [17, 154, 160], "classif": 148, "classmethod": 15, "clean": [2, 148], "cleaner": 153, "cleanup": 154, "clear": [13, 17, 155], "clear_mpi_env_var": 13, "clearer": [2, 154], "clearli": 5, "clementperroud": 155, "cliff": [149, 155], "cliff_walk": 151, "cliffwalk": [151, 154, 155], "climb": 130, "clip": [14, 16, 19, 20, 133, 134, 137, 155, 157, 158, 165, 167, 170], "clip_reward": 165, "clipact": [14, 16, 20, 157], "clipreward": [10, 14, 19, 20, 155, 167], "clobber": 154, "clock": 165, "clockwis": 135, "clone": [148, 169], "close": [2, 3, 7, 9, 10, 11, 12, 14, 15, 17, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 136, 137, 139, 140, 145, 147, 154, 155, 156, 157, 160, 161, 167, 170, 173, 175], "close_extra": [10, 11, 12], "closer": [129, 150, 155], "closest": 127, "cloudpickl": 13, "cloudpicklewrapp": 13, "cluster": 148, "cmap": [173, 175], "cnt": 155, "co": [131, 133, 134, 135, 142, 145, 148], "coars": 131, "code": [2, 3, 5, 6, 9, 11, 14, 15, 17, 18, 19, 131, 148, 154, 155, 158, 162, 163, 165, 167, 170, 171, 173, 175, 177], "codebas": [155, 165], "coeffici": [146, 170], "coin": 7, "coincid": [158, 165], "col": [151, 152], "cola": 155, "colder": 154, "collect": [2, 10, 11, 12, 17, 20, 148, 155, 156, 161, 163, 169, 170, 175], "collid": 129, "collis": [133, 134, 154], "color": [2, 6, 153], "color_palett": 173, "colour": [128, 154], "column": [4, 173, 177], "com": [19, 140, 141, 151, 152, 154, 165, 169, 173, 177], "combin": [2, 5, 9, 15, 145, 157, 170], "come": [129, 137, 138, 139, 140, 146, 147], "comet": 155, "command": [136, 165], "comment": [136, 154, 155, 170], "commit": [155, 165], "common": [7, 15, 18, 20, 155, 157, 158, 166, 167, 177], "commonli": [20, 154, 157], "commonroad": 148, "commun": [11, 154, 160], "compar": [139, 140, 146, 147, 148, 163, 170, 173, 175], "compartment": 155, "compat": [18, 144, 148, 154, 155, 156, 157], "compil": [148, 154, 162], "compilergym": 148, "complement": 2, "complet": [1, 4, 11, 17, 128, 129, 155, 157, 158, 160, 163, 166, 167, 175], "complex": [5, 6, 136, 140, 141, 142, 143, 155, 162, 170, 177], "compli": 154, "complianc": 154, "compliant": [154, 159], "complic": [7, 15, 167], "compon": [18, 157, 158, 165], "compos": [9, 137, 138, 139, 140, 141, 143, 146, 147], "composit": 7, "compound": 8, "comprehens": [148, 154], "compris": 7, "comput": [2, 3, 9, 18, 19, 20, 136, 137, 154, 155, 158, 163, 165, 170, 175, 177], "computation": [148, 162], "compute_metr": 9, "concat": 173, "concaten": [5, 8, 11, 12, 13, 136, 155], "conceptu": 7, "concern": 177, "conda": [155, 165], "condit": [2, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 163, 166, 169, 175, 177], "conduct": 155, "config": 4, "configur": [126, 130, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 165, 170, 175], "confirm": [131, 154], "conflict": 9, "conjunct": [9, 13], "connect": [2, 6, 11, 131, 137, 138, 139, 140, 141, 145, 146, 147, 148], "connectionist": 177, "consecut": 129, "consequ": 154, "consid": [4, 128, 129, 130, 136, 137, 139, 140, 143, 147, 165, 167], "consist": [3, 7, 127, 129, 131, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 154, 155, 157, 158, 162, 165, 167, 169, 175, 177], "consol": 4, "consolid": 154, "constant": [3, 129, 134, 137, 140, 141, 142], "constantli": 145, "constitu": 6, "constraint": [140, 141, 142, 148], "construct": [7, 9, 131, 137, 138, 139, 140, 141, 146, 147, 148], "constructor": [4, 9, 15, 154, 155, 157, 165], "consult": 169, "consumpt": 2, "contact": [127, 129, 136, 137, 140, 141, 142, 154], "contact_cost": [137, 140], "contact_cost_rang": 140, "contact_cost_weight": [137, 140, 169], "contact_force_rang": 137, "contain": [1, 2, 5, 8, 10, 13, 17, 18, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 157, 158, 160, 163, 165, 169, 170, 172, 175, 177], "context": [4, 11, 13, 148, 177], "contextu": 148, "continu": [2, 5, 9, 14, 18, 19, 128, 129, 130, 133, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 157, 166, 170, 175, 177], "continuous_mountaincarenv": 134, "contrari": [14, 19], "contrast": 160, "contribut": [126, 148, 155, 172], "contributor": 155, "control": [2, 5, 7, 126, 128, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 157, 160, 167, 169, 170, 173], "conveni": [6, 15, 157, 158, 165], "convent": [1, 2], "converg": [163, 173, 175], "convers": [3, 148, 159], "convert": [1, 5, 6, 14, 18, 20, 128, 129, 148, 154, 155, 158, 159, 173, 175], "convert_to_done_step_api": 9, "convert_to_terminated_truncated_step_api": [9, 159], "convolv": [155, 170, 175], "cool": 155, "coordin": [7, 14, 127, 129, 135, 137, 138, 139, 140, 141, 144, 145, 146, 147, 148], "coordinatesof": 142, "copi": [10, 11, 12, 13, 165], "copier": 165, "core": [154, 155, 157], "correct": [4, 9, 16, 129, 148, 153, 154, 155, 165, 166, 177], "correctli": [2, 9, 14, 18, 19, 154, 155, 165], "correspond": [1, 2, 4, 11, 12, 15, 18, 129, 131, 132, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 153, 158, 160, 165, 170, 173, 177], "correspondingli": 128, "cosin": [131, 142, 145], "cost": [127, 155], "could": [2, 9, 14, 137, 138, 139, 140, 141, 143, 146, 147, 155, 157, 165, 166, 167, 169, 170, 173, 175, 177], "coulom": 146, "count": [7, 127, 129, 150, 163, 175], "counter": [135, 155], "coupl": [157, 158, 170], "cours": [148, 173], "cover": [5, 12, 163, 175], "coverag": 148, "cpu": [129, 136, 162, 170], "craft": 129, "craftium": 148, "crash": [129, 157], "crazyfli": 148, "creat": [4, 5, 7, 9, 10, 11, 12, 13, 14, 17, 127, 128, 129, 145, 148, 154, 155, 157, 162, 167, 169, 170, 175, 177], "create_empty_arrai": [5, 13, 155], "create_grid": 175, "create_plot": 175, "create_se": 154, "create_shared_memori": [5, 13], "creation": [2, 4, 155, 158, 165], "creatur": 148, "critic": [2, 17, 155], "critic_lay": 170, "critic_loss": 170, "critic_losses_moving_averag": 170, "critic_lr": 170, "critic_optim": 170, "critic_weight": 170, "critic_weights_path": 170, "cross": [151, 152], "crucial": 5, "crypto": 148, "cstride": 175, "ctr_cost": 147, "ctrl_cost": [137, 138, 139, 140, 146, 147], "ctrl_cost_weight": [137, 138, 139, 140, 141, 146, 147, 169], "ctx": 13, "cube": 5, "cuda": [148, 170], "cum_reward": 173, "cumsum": 173, "cumul": [9, 14, 17, 20, 173], "cumulative_reward": 9, "current": [2, 4, 7, 9, 10, 17, 18, 128, 148, 150, 151, 152, 153, 154, 155, 158, 162, 163, 165, 170, 173, 175, 177], "current_col": [151, 152], "current_namespac": 4, "current_row": [151, 152], "curriculum": 154, "curv": 154, "custom": [2, 5, 10, 11, 12, 14, 15, 17, 18, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 152, 154, 155, 157, 160, 168, 171], "customis": 148, "customiz": 148, "customspaceerror": 13, "cvel": [140, 141], "cyaneu": 152, "cycl": 157, "cylind": 144, "cython": 155, "d": [7, 9, 15, 131, 148, 170, 172], "dac": 148, "dacbench": [148, 155], "daemon": 11, "dag": 148, "dai": 126, "darkgrid": 177, "data": [5, 8, 9, 10, 15, 148, 154, 155, 158, 160, 161, 162, 165, 167, 170, 173, 175, 177], "data_equival": 155, "data_equivil": 155, "databas": 2, "datafram": [173, 177], "datapoint": 8, "dataset": 155, "datatyp": 154, "david": 173, "davidslayback": 155, "de": 155, "dead": [154, 155], "deal": [6, 170], "dealer": [150, 154, 163, 175], "dealer_count": 175, "debug": [2, 5, 9, 148, 149, 154, 155], "decai": [163, 175], "decay_epsilon": [163, 175], "decent": [163, 175, 177], "decid": [150, 155, 173], "decis": [154, 155, 157, 160, 166], "deck": [150, 163, 175], "declar": 158, "decomposit": 153, "decreas": [129, 162, 173], "deep": [148, 154, 162, 166, 170, 173, 177], "deepmind": [136, 154, 169], "def": [9, 14, 155, 158, 163, 165, 167, 170, 173, 175, 177], "default": [2, 4, 5, 6, 7, 9, 11, 12, 13, 14, 15, 17, 18, 19, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 154, 155, 157, 160, 165, 166, 169], "default_camera_config": [136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "default_rng": 173, "defaultdict": [163, 175], "defin": [2, 3, 5, 8, 11, 12, 14, 15, 16, 17, 18, 135, 154, 155, 157, 158, 165, 166, 167, 170, 173], "definit": [137, 139, 140, 142, 154, 155, 160, 166], "degre": [11, 140, 141, 142, 167, 170], "delai": [18, 20, 155, 165], "delayobserv": [18, 20, 155], "deleu": 155, "deliv": 153, "delta": [173, 177], "demand": 128, "demo": 1, "demonstr": [14, 127, 148], "dennybritz": 151, "denot": [2, 132], "densiti": [144, 155], "depend": [2, 6, 9, 10, 14, 18, 19, 126, 129, 130, 132, 136, 137, 138, 139, 140, 141, 146, 147, 152, 154, 155, 165, 170], "dependencynotinstal": [9, 17], "deprec": [2, 136, 154, 155], "depreci": 154, "deque_s": [14, 17, 163, 170, 175], "deriv": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 177], "derk": 148, "desc": [152, 173], "describ": [2, 5, 6, 131, 132, 146, 150, 154, 155, 157, 160, 167], "descript": [20, 136, 155, 165], "design": [131, 140, 141, 148, 149, 153, 165], "desir": [9, 14, 153, 169, 170], "despit": 155, "destin": 153, "destroi": [129, 155, 157], "detach": 170, "detail": [5, 6, 7, 9, 19, 131, 154, 155, 157, 158, 160, 165, 169, 170], "detect": [155, 165], "determin": [2, 17, 129, 131, 132, 133, 134, 135, 152, 154, 155, 157, 165, 166, 169], "determinist": [129, 131, 133, 134, 158, 165, 170], "dev": [2, 154, 177], "develop": [136, 148, 154, 155, 161, 177], "deviat": [14, 19, 146, 170, 177], "devic": [14, 17, 170], "devicearrai": 17, "dexter": 148, "dexteroushand": 148, "df1": 177, "diagnost": 2, "diagram": 135, "dict": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 18, 20, 136, 151, 152, 153, 154, 155, 157, 158, 160, 165, 177], "dict_info": 14, "dict_kei": 18, "dict_time_kei": 18, "dictat": [128, 129], "dictinfotolist": [10, 14, 20, 155], "dictionari": [2, 5, 6, 7, 9, 10, 14, 18, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 158, 160, 163, 165, 167, 175, 177], "did": [155, 166], "didn": [155, 173], "die": 128, "dietterich": 153, "differ": [2, 5, 6, 9, 11, 12, 16, 17, 18, 128, 135, 136, 147, 148, 154, 155, 157, 158, 160, 161, 165, 166, 167, 169, 170, 173], "differenti": 166, "difficult": [132, 136, 142, 143, 155, 162, 173], "difficulti": 154, "digit": 7, "dim": [140, 141], "dimens": [5, 7, 8, 18, 148, 155, 177], "dimension": [5, 17, 129, 137, 138, 139, 146, 147, 155, 157, 158, 165, 170], "direct": [6, 131, 132, 133, 134, 137, 138, 139, 140, 142, 143, 146, 147, 148, 151, 152, 153, 158, 165, 173, 177], "directli": [2, 6, 10, 15, 17, 131, 136, 157, 158, 163, 165, 175, 177], "directori": [1, 154, 165], "dirhtml": 1, "disabl": [4, 17, 18, 19, 151, 152, 154, 155, 169], "disable_env_check": [4, 16, 17, 154], "disable_logg": 17, "disable_print": 4, "disable_render_order_enforc": [17, 154], "disallow": 153, "disc_to_cont": 167, "discard": 18, "discord": [148, 154, 155, 172], "discount": [14, 19, 163, 166, 170, 173, 175, 177], "discount_factor": [163, 175], "discourag": 154, "discov": [155, 169], "discret": [2, 5, 6, 7, 8, 11, 13, 18, 128, 129, 131, 132, 133, 134, 148, 149, 150, 151, 152, 153, 154, 155, 157, 158, 163, 165, 167, 175], "discrete_act": 165, "discreteact": 167, "discreteenv": 154, "discuss": [154, 155], "displac": [137, 138, 139, 140, 146, 147], "displai": [2, 4, 132, 154, 165], "dissect": 173, "dist": 144, "distanc": [138, 139, 141, 144, 147, 158, 165], "distance_from_origin": [137, 140], "distance_penalti": 142, "distinct": 166, "distinguish": [2, 165, 166], "distrib": 177, "distribut": [5, 6, 7, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 152, 154, 169, 170, 173, 177], "divers": 156, "divid": [14, 19], "do": [4, 13, 15, 17, 128, 129, 131, 154, 155, 158, 161, 165, 167, 169, 170, 173, 177], "doc": [9, 129, 154, 155, 167], "dockerfil": 154, "docstr": [1, 154, 155], "doctest": 155, "document": [5, 7, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 130, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 154, 155, 157, 158, 163, 165, 169, 170, 175], "doe": [2, 6, 8, 9, 10, 11, 12, 13, 14, 17, 18, 19, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 154, 155, 158, 165, 166, 167, 170], "doesn": [1, 2, 4, 15, 17, 129, 154, 155, 158, 166, 173], "dof": [137, 140, 141], "dog": 9, "doi": [153, 155], "domain": [16, 128, 131, 133, 134, 153, 167, 168, 171], "domain_random": 128, "don": [6, 14, 15, 128, 133, 154, 155, 158, 160, 165, 166, 170, 173], "done": [2, 9, 14, 154, 155, 157, 159, 160, 163, 165, 166, 169, 170, 173, 175, 177], "donesteptyp": 9, "doom": 148, "dot": [6, 7, 165], "doubl": 136, "doublependulum": 142, "down": [5, 7, 128, 141, 148, 150, 151, 152, 153, 155, 158, 163, 165, 166, 173, 175], "download": [157, 165, 166, 167, 170, 171, 173, 175, 177], "downward": 131, "dramat": [154, 155], "draw": [6, 150, 163, 165, 175], "drawn": [6, 7, 144, 146, 150], "drive": 128, "drone": 148, "drop": [153, 155], "drryanhuang": 155, "dsctt": 154, "dstack": 175, "dt": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 169], "dtype": [2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 20, 129, 131, 132, 133, 134, 135, 154, 155, 158, 165, 173], "dtypelik": 5, "dtypeobserv": [14, 18, 20, 155], "dual": 148, "duburcqa": 154, "duckietown": 148, "due": [2, 129, 132, 136, 152, 154, 155, 160, 163, 166, 170], "dump": 155, "dunbar": 155, "dunder": 155, "duplic": [14, 155], "durat": [9, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 177], "dure": [2, 4, 6, 7, 10, 14, 17, 18, 19, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 158, 165, 167, 169], "dw": 9, "dx": [137, 138, 139, 140, 146, 147], "dylwil3": 155, "dynam": [2, 131, 135, 136, 142, 148, 154, 155], "e": [1, 2, 4, 5, 7, 8, 9, 11, 13, 14, 15, 17, 129, 137, 138, 139, 140, 141, 143, 144, 145, 146, 147, 148, 150, 152, 154, 155, 158, 163, 165, 167, 170, 173, 175, 177], "each": [2, 5, 6, 7, 9, 10, 11, 12, 13, 14, 16, 17, 18, 20, 126, 127, 129, 130, 131, 132, 133, 134, 136, 137, 140, 141, 142, 143, 144, 145, 148, 149, 151, 153, 154, 155, 157, 158, 161, 163, 165, 170, 173, 175], "earli": [126, 163, 175, 177], "earlier": [137, 140, 141, 158, 165], "earliest": 177, "easi": [148, 149, 154, 157, 173], "easier": [130, 154, 170], "easiest": [128, 154, 155], "easili": [6, 10, 15, 148, 154, 155, 162], "east": 153, "ed": 131, "edg": [5, 6, 8, 132, 154, 155, 157], "edge_link": [6, 8], "edge_mask": 6, "edge_spac": [6, 8], "edgecolor": 175, "edit": [148, 154, 165], "effect": [2, 8, 13, 15, 17, 129, 137, 141, 142, 143, 144, 145, 146, 153, 157, 163, 175, 177], "effector": [144, 145], "effici": [11, 133, 134, 136, 148, 155, 162, 170], "effort": [130, 155], "eg": 165, "egl": 136, "eight": 137, "einforc": 177, "either": [7, 8, 9, 12, 17, 133, 134, 148, 150, 152, 157, 159, 170, 173, 177], "ekalosak": [154, 155], "elaps": [14, 17], "elbow": 144, "electron": 148, "element": [2, 5, 6, 7, 10, 11, 13, 18, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 157, 158, 165, 169, 170, 177], "elev": 169, "eleven": 136, "elf": 152, "elliottow": 155, "elo": 148, "els": [14, 18, 139, 147, 152, 155, 158, 163, 165, 170, 173, 175], "emerg": 155, "emit": 2, "emploi": 17, "empti": [5, 7, 13, 17, 137, 138, 139, 140, 141, 146, 147, 155, 163, 173, 175, 177], "emul": [154, 160], "en": [18, 177], "enabl": [4, 9, 17, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 128, 148, 154, 155], "enable_wind": [129, 170], "encapsul": 2, "encod": [8, 148, 153, 154, 158, 165], "encompass": 148, "encount": 170, "encourag": [154, 169, 170], "end": [2, 3, 10, 14, 17, 127, 135, 136, 154, 155, 156, 157, 158, 161, 163, 166, 169, 170, 173, 175, 177], "endeavor": 148, "enemi": 157, "energi": [127, 129, 132, 167], "enforc": [4, 17, 154], "engin": [129, 136, 148, 155], "english": 7, "engstrom": [14, 19], "enough": [169, 170], "ensur": [1, 2, 4, 6, 8, 9, 129, 148, 155, 157, 163, 165, 175], "ent_coef": 170, "enter": [2, 9, 129, 151], "entir": [9, 17, 139, 147, 154, 155, 170], "entiti": 5, "entrain": 148, "entri": [4, 154, 155, 158, 160], "entropi": [2, 170], "entropy_moving_averag": 170, "entry_point": [4, 154, 158, 165], "entrypoint": 165, "enum": [4, 165], "enumer": 173, "env": [4, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 166, 167, 168, 169, 171, 173, 175, 177], "env_check": [9, 154, 155], "env_fn": [11, 12], "env_id": [4, 155, 159], "env_lambda": 9, "env_nam": [4, 159], "env_new": 9, "env_old": 9, "env_spec": 155, "env_typ": 1, "envapicompat": 154, "envcompat": [154, 159], "envcreat": 4, "environ": [4, 5, 7, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 127, 128, 129, 130, 131, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 149, 150, 151, 152, 153, 154, 155, 156, 161, 163, 166, 167, 168, 171, 177], "environment_cr": 165, "environmentnam": 155, "envpool": [154, 155], "envs_wrapp": 170, "envspec": [2, 3, 4, 10, 15, 17, 155], "ep": [173, 177], "ep_action_log_prob": 170, "ep_reward": 170, "ep_value_pr": 170, "episod": [2, 4, 9, 10, 14, 17, 18, 19, 20, 154, 155, 156, 157, 158, 160, 163, 165, 166, 169, 170, 173, 175, 177], "episode_id": 9, "episode_index": 9, "episode_num": 161, "episode_ov": [157, 161], "episode_returns_moving_averag": 170, "episode_reward": [14, 19], "episode_trigg": [9, 17, 161], "episodestatisticsrecord": 155, "epoch": 170, "epsilon": [14, 18, 19, 163, 173, 175], "epsilon_decai": [163, 175], "epsilongreedi": 173, "equal": [11, 12, 129, 133, 134, 139, 144, 147, 150, 152, 158, 173], "equat": [131, 135], "equival": [4, 5, 8, 10, 14, 137, 140, 154, 155, 160, 170], "erez": [139, 140, 141], "ericonaldo": 154, "error": [2, 4, 7, 8, 9, 11, 17, 20, 154, 155, 166, 170, 175], "errorr": 154, "escap": 2, "especi": [5, 165, 167, 170], "essenti": [7, 169], "estim": [19, 137, 166, 170, 177], "et": [14, 16, 17, 19], "etc": [2, 5, 10, 137, 138, 139, 140, 146, 147, 154, 155, 157, 158, 165, 166, 167], "euclidean": [144, 145], "ev": 148, "ev2gym": 148, "eval": [161, 170], "evalu": [14, 17, 18, 19, 148, 155, 161, 163, 175], "evanmath": 154, "even": [2, 4, 137, 139, 140, 142, 143, 147, 154, 165, 175], "event": [9, 154, 165], "eventu": 13, "ever": [126, 154], "everi": [1, 5, 6, 9, 14, 17, 18, 19, 128, 129, 132, 137, 139, 140, 142, 143, 147, 155, 157, 165, 170, 177], "everyon": 136, "everyth": [17, 148], "evolv": 9, "exact": [150, 155, 175], "exactli": [8, 146], "exampl": [1, 2, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 128, 131, 136, 148, 150, 151, 152, 154, 155, 157, 158, 161, 163, 165, 166, 167, 170, 171, 175], "examplefuncenv": 155, "exce": [127, 150], "exceed": [2, 17, 20, 157, 163, 175], "excel": 169, "except": [2, 154, 155, 165], "exchang": 157, "exclud": [4, 17, 20, 137, 138, 139, 140, 141, 142, 145, 146, 147, 153], "exclude_current_positions_from_observ": [137, 138, 139, 140, 141, 146, 147, 169], "exclude_namespac": 4, "exclus": [6, 136, 140, 141, 155], "execut": [1, 9, 16, 153, 157, 165, 170, 177], "exhibit": 136, "exist": [2, 4, 5, 10, 11, 12, 15, 16, 17, 18, 19, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 157, 162, 170, 173], "exist_ok": 173, "exit": [9, 10, 11, 12], "exp": 177, "expand": [5, 7, 160], "expect": [2, 4, 5, 6, 7, 154, 155, 157, 169, 170, 173, 177], "expens": [5, 154], "experi": [17, 136, 137, 140, 141, 142, 143, 148, 163, 170, 173, 175, 177], "experiment": [3, 131, 155], "explain": [154, 158], "explan": [157, 160, 163, 175, 177], "explanatori": [163, 175], "explicit": 155, "explicitli": [3, 165, 166, 169], "exploit": [163, 173, 175], "explor": [2, 5, 148, 155, 162, 163, 170, 173, 175], "explor_exploit_tradeoff": 173, "exponenti": [7, 14, 19, 20], "export": 155, "expos": [3, 155], "express": [5, 9, 137, 140, 141], "ext_control": 6, "extend": [2, 17, 137], "extens": [148, 155, 177], "extent": 152, "extern": [2, 137, 140, 141, 154, 155, 160, 166], "extra": [154, 155, 160, 163, 165, 175], "extract": [9, 155, 173], "extrem": [148, 149], "ez": 154, "ezpickl": [4, 9, 15], "f": [161, 170, 173, 175, 177], "f_": [137, 140, 141], "face": [137, 140, 141, 150, 163, 175], "facecolor": 175, "facedown": 150, "facilit": 136, "fact": 177, "factor": [14, 19, 129, 163, 170, 175, 177], "factori": 4, "fail": 155, "failur": 166, "fair": 7, "fairli": [154, 155, 177], "fall": [127, 140, 151, 152, 166, 169, 173], "fallen": [139, 169], "fals": [4, 6, 9, 10, 11, 12, 14, 16, 17, 18, 19, 128, 129, 132, 137, 138, 139, 140, 141, 146, 147, 150, 154, 155, 157, 158, 160, 161, 163, 165, 169, 170, 173, 175, 177], "familiar": 169, "famou": 163, "far": [127, 128, 141, 142, 144, 145, 152], "farama": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 148, 155, 163, 165, 170, 172, 175, 177], "fast": [128, 136, 138, 140, 142, 146, 148], "faster": [127, 129, 137, 140, 141, 142, 145, 155, 170], "favor": [2, 154, 155], "favorit": [163, 175], "favour": [2, 154, 155, 159, 160], "featur": [6, 7, 18, 136, 154, 155, 170], "feature_spac": 6, "fed": 170, "feed": 170, "feedback": 155, "feel": [169, 177], "feet": [138, 147], "few": [1, 131, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 173, 175], "ffffffff": 152, "fffffhff": 152, "fffh": 152, "fffhffff": 152, "fffhfffg": 152, "ffoot": 138, "ffset": 177, "fhffhfhf": 152, "fhfh": 152, "fhhfffhf": 152, "fibonacci": 177, "fickl": 153, "fidel": 148, "field": [155, 162], "fieteo": 154, "fig": [170, 173, 175], "fig1": 175, "fig2": 175, "figaspect": 175, "fight": 148, "figsiz": [170, 173, 175, 177], "figur": [139, 155, 173, 175, 177], "file": [1, 131, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 165, 169], "filenam": [9, 17], "fill": 165, "filter": [14, 18, 20], "filter_kei": [14, 18], "filterobserv": [14, 18, 20], "filterobservationwrapp": 18, "final": [3, 9, 18, 154, 155, 161, 163, 165, 166, 169, 175], "final_epsilon": [163, 175], "final_info": [14, 17, 154, 155], "final_observ": [14, 17, 154, 155], "find": [2, 4, 155, 162, 165, 169, 173], "find_highest_vers": 4, "fine": [165, 170], "finetun": 148, "finfo": 173, "fingertip": [144, 145], "finish": [2, 128, 129, 154, 155, 165, 170], "finit": [6, 7, 137, 143, 147, 166, 167, 177], "fire": 129, "firefoxmetzg": 154, "firmwar": 148, "first": [2, 4, 6, 9, 10, 11, 12, 15, 129, 131, 133, 134, 137, 138, 139, 140, 141, 142, 145, 146, 147, 151, 153, 154, 155, 156, 157, 158, 160, 161, 163, 165, 173, 175, 177], "five": [130, 154, 177], "fix": [5, 6, 7, 17, 20, 129, 131, 132, 135, 136, 137, 138, 139, 140, 142, 143, 144, 145, 147, 148, 150, 152, 154, 155, 157, 158, 160, 165, 177], "fixtur": 145, "flag": [11, 133, 136, 148], "flake8": [154, 155], "flappi": 148, "flat": [5, 6, 7, 8], "flatdim": [5, 8, 154], "flatt": 170, "flatten": [5, 8, 14, 18, 20, 136, 154, 155, 158, 165, 170, 173, 175], "flatten_spac": [5, 8], "flattenobserv": [6, 14, 18, 20, 157, 158, 165], "flesh": 3, "flex": 144, "flexibl": [11, 148, 155], "flight": 148, "float": [4, 7, 8, 9, 14, 16, 17, 18, 19, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 163, 165, 170, 173, 175, 177], "float32": [2, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 18, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155], "float64": [8, 10, 14, 18, 19, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "flow": 148, "fluid": 146, "fly": [129, 160], "flycraft": 148, "fmt": 173, "fn": 13, "focu": [148, 170], "focus": 172, "foddi": 148, "folder": [1, 9, 17, 155, 161, 173], "follow": [1, 3, 4, 5, 9, 10, 14, 16, 17, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 129, 131, 132, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 158, 161, 165, 166, 167, 169, 170, 173, 175, 177], "fontsiz": [173, 175], "foot": [11, 138, 139, 140, 141, 147], "foot_joint": [139, 147], "foot_left_joint": 147, "forc": [129, 132, 133, 134, 136, 137, 140, 141, 142, 143, 147, 154, 169, 177], "forcibli": 166, "forearm": [140, 141, 144], "forev": 166, "forex": 148, "forg": 165, "forget": [14, 15, 165], "fork": [1, 155, 156, 160], "form": [5, 6, 7, 17, 131, 146, 155, 165], "formal": 166, "format": [1, 4, 5, 9, 14, 155, 157, 169], "formerli": 155, "forum": [14, 19], "forward": [13, 19, 127, 137, 138, 139, 140, 141, 146, 147, 154, 170, 177], "forward_reward": [137, 138, 139, 140, 146, 147, 169], "forward_reward_weight": [137, 138, 139, 140, 146, 147, 169], "found": [20, 136, 142, 154, 155, 163, 175, 177], "foundat": [148, 155, 165, 172], "four": [128, 129, 137, 139, 148, 153, 155, 157, 158, 163, 175], "foxik": 154, "fp": [2, 9, 10, 17, 129, 155, 165], "frac": [137, 138, 139, 140, 146, 147], "frame": [2, 9, 10, 17, 18, 20, 128, 129, 144, 145, 154, 155, 169, 173], "frame_skip": [17, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 169], "framer": 165, "frameskip": 17, "frameskipobserv": 155, "framestack": [17, 18, 154, 155], "framestackobserv": [17, 18, 20, 155], "frametim": [137, 138, 139, 140, 141, 146, 147], "framework": [136, 148, 155, 169], "franuka": 152, "free": [131, 135, 136, 137, 140, 141, 142, 143, 148, 163, 169, 175, 177], "free_body_rot": 146, "freedom": [140, 141, 142, 167], "freejoint": [140, 141], "freez": [14, 18, 19], "frequenc": 17, "frequent": [154, 166], "friction": [127, 146, 147, 148], "frictionless": 132, "from": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 127, 128, 129, 131, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 157, 158, 160, 161, 162, 163, 166, 169, 170, 173, 175, 177], "from_json": [5, 155], "front": [137, 138, 146, 147], "front_left_leg": 137, "front_right_leg": 137, "frontpag": 155, "frozen": [149, 154, 155, 173], "frozen_lak": [152, 173], "frozenlak": [9, 152, 154, 155, 171, 176], "frozenlake4x4": 155, "frozenlake8x8": [152, 154, 155], "frozenlake_q_values_": 173, "frozenlake_states_actions_distrib_": 173, "frozenlake_steps_and_reward": 173, "frozenlake_tuto": 173, "frozenset": 7, "fshin": 138, "fthigh": 138, "fuel": 129, "full": [3, 4, 128, 129, 154, 155, 157, 158, 163, 175], "fulli": [2, 129, 137, 138, 139, 140, 141, 143, 146, 147, 148, 154], "fun": [175, 177], "func": [3, 14, 16, 18, 19, 155], "func_env": 3, "funcenv": [3, 155], "funcjaxenv": 155, "function": [2, 8, 10, 11, 12, 14, 16, 17, 18, 19, 20, 129, 132, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 152, 153, 154, 155, 157, 159, 160, 163, 166, 167, 169, 170, 172, 173, 175, 177], "functional_jax_env": 3, "functionaljaxenv": 3, "furcolor": 9, "further": [2, 5, 7, 129, 144, 145, 154, 160, 173, 177], "furthermor": [2, 9, 155, 157], "furthest": 127, "futur": [3, 154, 155], "future_q_valu": [163, 175], "g": [1, 2, 4, 5, 7, 8, 9, 11, 12, 13, 131, 135, 137, 138, 139, 140, 141, 143, 146, 147, 148, 152, 153, 154, 155, 158, 163, 165, 167, 170, 173, 175, 177], "ga": 128, "gae": 170, "gain": [155, 167], "galleri": [1, 155], "game": [7, 9, 17, 126, 150, 151, 152, 154, 155, 157, 158, 163, 165, 169, 175], "gamma": [14, 19, 166, 170, 173, 177], "gap": 148, "garbag": [10, 11, 12], "gather": 165, "gaussian": 136, "gcc": 148, "gen_gif": 1, "gen_md": 1, "gener": [1, 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 13, 15, 17, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 161, 162, 170], "generalis": [2, 10], "generate_random_map": [152, 173], "geometr": [6, 136], "geometri": 155, "get": [2, 4, 11, 12, 14, 15, 18, 19, 127, 129, 136, 137, 139, 140, 142, 150, 154, 155, 157, 163, 165, 166, 169, 170, 173, 175], "get_act": [163, 175], "get_attr": [11, 12, 155], "get_env_id": 4, "get_loss": 170, "get_rect": 165, "get_stat": 154, "get_wrapper_attr": [15, 155], "gianluca": 155, "gianlucadecola": [154, 155], "gif": [1, 131, 155], "git": [165, 169], "github": [19, 129, 132, 136, 137, 139, 140, 141, 142, 143, 144, 145, 147, 151, 154, 155, 165, 169], "give": [2, 10, 129, 137, 150, 153, 154, 163, 170, 175, 177], "given": [3, 4, 7, 16, 127, 129, 130, 131, 132, 133, 134, 137, 139, 140, 142, 143, 147, 154, 155, 157, 161, 166, 175], "glfw": 136, "global": 4, "gnn": 148, "go": [2, 9, 128, 148, 150, 154, 158, 162, 163, 166, 169, 170, 175, 177], "go1": 169, "goal": [2, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 158, 165, 177], "goal_slidei": 144, "goal_slidex": 144, "goal_veloc": [133, 134], "goalenv": 160, "goe": [150, 154], "gone": 154, "good": [163, 170, 173, 175], "googl": [154, 169], "goul\u00e3o": 155, "gpu": [136, 148, 162, 170], "gracefulli": 154, "gradient": [170, 177], "grai": 17, "grammat": 155, "grant": 129, "graph": [5, 6, 8, 13, 148, 154, 155, 157, 170, 175], "graphic": [136, 154], "graphinst": [6, 8, 13, 154, 155], "grave": 170, "gravit": 129, "graviti": [129, 132, 133, 135, 142, 143, 148, 170, 177], "grayscal": [14, 17, 18, 20], "grayscale_env": 18, "grayscale_newaxi": 17, "grayscale_ob": 17, "grayscaleobserv": [14, 18, 20], "grd": 158, "great": [148, 167, 175], "greater": [129, 131, 132, 133, 134, 143, 150, 163, 175], "greedi": [163, 173, 175], "greedili": [163, 175], "green": [131, 153], "grei": 175, "greyscal": 17, "grid": [148, 151, 152, 153, 158, 165, 175], "grid_world": 165, "gridlin": 165, "gridworld": [2, 148, 151, 158, 165], "gridworldenv": [158, 165], "grip": [163, 175], "ground": [127, 129, 141, 148], "group": [129, 154], "gt": [154, 155], "guarante": [5, 6, 13, 17, 148, 172], "guess": 154, "guid": [148, 155, 157, 158, 169], "guidelin": 17, "guyazran": 155, "gym": [2, 4, 5, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 126, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 155, 156, 157, 158, 160, 161, 163, 165, 166, 167, 170, 173, 175, 177], "gymenv": 159, "gymfc": 148, "gymlibrari": 154, "gymnasium": [2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 156, 157, 158, 159, 160, 161, 162, 163, 166, 169, 170, 173, 175, 177], "gymnasium_env": [158, 165], "gymnax": 148, "gymv21compat": 155, "gymv21environ": 159, "gymv22compat": 155, "gymv22environ": 155, "gymv26environ": 159, "gyroscop": 128, "h5": 170, "ha": [1, 2, 4, 5, 7, 8, 9, 10, 13, 14, 17, 18, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 139, 140, 141, 144, 145, 146, 148, 150, 154, 155, 156, 157, 158, 160, 163, 165, 166, 169, 170, 173, 175, 177], "hacki": 13, "had": [2, 154, 155], "half": 136, "half_cheetah": 138, "halfcheetah": [14, 136, 138], "halt": 166, "hand": [148, 150, 177], "handl": [5, 10, 11, 12, 142, 150, 154, 155, 168, 171, 175], "handling_time_limit": 166, "hang": [13, 131, 154], "happen": [133, 134, 137, 139, 140, 142, 143, 147, 150, 152, 153, 154, 155, 158, 160, 166], "haracterist": 177, "hard": 173, "hardcor": 127, "harden": [154, 155], "harder": [129, 169], "hardwar": [148, 154, 155], "harlei": 170, "harm": 175, "has_autoreset": 10, "hasattr": 155, "hash_se": 154, "hasselmo": 131, "hatchl": 165, "have": [2, 4, 5, 7, 8, 10, 11, 13, 14, 15, 16, 17, 18, 19, 126, 127, 128, 131, 137, 140, 141, 142, 143, 144, 145, 147, 150, 152, 153, 154, 155, 157, 158, 159, 160, 163, 165, 166, 167, 169, 170, 173, 175, 177], "haven": 17, "head": [11, 138], "headless": [136, 154], "healthi": [137, 139, 140, 142, 143, 147], "healthy_angle_rang": [139, 147], "healthy_reward": [137, 139, 140, 142, 143, 147, 169], "healthy_state_rang": 139, "healthy_z_rang": [137, 139, 140, 147, 169], "heatmap": [173, 175], "height": [131, 136, 137, 139, 140, 142, 147, 154, 169], "held": 150, "hello": 7, "help": [2, 4, 5, 9, 154, 155, 157, 161, 163, 170, 175], "helper": 154, "helpingstar": 155, "henc": [137, 149, 177], "here": [3, 9, 131, 137, 140, 154, 155, 158, 163, 165, 169, 170], "heurist": [127, 129, 148, 155], "hffg": 152, "hidden": [2, 177], "hidden_space1": 177, "hidden_space2": 177, "hierarch": [5, 153], "high": [2, 6, 7, 8, 11, 12, 13, 14, 18, 131, 132, 134, 135, 137, 148, 154, 155, 157, 167, 169, 170], "higher": [19, 127, 144, 169, 170], "highest": 4, "highli": [5, 9, 126, 130, 148, 154, 160, 165], "highwayenv": [148, 155], "hill": [133, 134], "hing": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "hinge2": 142, "hint": [154, 155], "hinter": 154, "hip": [127, 137, 140, 141], "hip_1": 137, "hip_2": 137, "hip_3": 137, "hip_4": 137, "histor": 148, "histori": 136, "histplot": 173, "hit": [150, 163, 175], "hold": [5, 150, 163, 175], "hole": [152, 173], "homepag": 155, "hood": 155, "hook": [154, 155], "hooper": 155, "hop": [136, 139, 147], "hope": 155, "hopefulli": [154, 163, 175], "hopper": [15, 16, 136, 147, 155], "hopper_v4": 15, "hopperenv": 15, "horizon": [9, 139, 148, 166], "horizon_timestep": 9, "horizont": [127, 129, 131, 158, 165], "host": 155, "hot": 8, "hotter": 154, "how": [2, 5, 9, 11, 12, 14, 141, 142, 144, 145, 148, 155, 157, 158, 161, 163, 165, 166, 167, 169, 170, 175, 177], "howardh": 155, "howev": [2, 5, 6, 7, 8, 9, 10, 11, 13, 16, 17, 131, 137, 138, 139, 140, 141, 145, 146, 147, 154, 155, 157, 158, 160, 165, 166, 167, 170, 175], "hpo": 172, "html": [18, 163, 177], "http": [1, 2, 14, 18, 19, 128, 131, 150, 151, 152, 154, 155, 163, 165, 169, 170, 173, 175, 177], "hue": 173, "huge": [148, 154], "hull": 127, "human": [2, 6, 9, 17, 20, 128, 140, 141, 144, 148, 154, 155, 156, 157, 159, 160, 165, 170], "humanoid": [136, 155, 169], "humanoidstandup": [136, 141], "humanoidstandup_v4": 154, "humanrend": [17, 20, 155], "humnaoid": 169, "hundr": 161, "hundredth": 17, "hyperparam": 170, "hyperparamet": [163, 170, 172, 173, 175, 177], "i": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 166, 167, 169, 170, 173, 175, 177], "i_": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "ianyfan": 155, "ic": 152, "ice1187": 154, "iclr": 170, "icml": 170, "icu": 148, "id": [4, 14, 17, 19, 136, 137, 140, 141, 152, 155, 158, 165], "idea": [163, 166, 175], "ident": [7, 11, 12, 136, 155], "identifi": 154, "idx": 173, "if__name__": 170, "iff": [9, 17, 165], "iforce2d": 128, "igibson": 148, "ignor": [1, 9, 150, 154, 163, 175], "ikamensh": [154, 155], "illeg": 153, "illustr": [14, 165, 166], "im": 143, "imag": [2, 5, 10, 11, 12, 14, 17, 18, 20, 128, 136], "imagesequenceclip": 9, "imagin": [148, 157, 158, 167], "img": 173, "img_titl": 173, "immedi": [9, 20, 137, 139, 140, 141, 147, 150, 158, 165], "immut": 5, "impact": [141, 173], "impact_cost": 141, "impact_cost_rang": 141, "impact_cost_weight": 141, "implement": [5, 6, 9, 10, 15, 16, 17, 20, 129, 132, 135, 142, 148, 149, 153, 154, 155, 157, 158, 159, 160, 161, 163, 165, 168, 170, 171, 177], "implementing_custom_wrapp": 167, "implicitli": 170, "import": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 165, 167, 169, 170, 173, 177], "importantli": [15, 155, 157], "importerror": 155, "improv": [11, 154, 155, 162], "imshow": 173, "includ": [2, 4, 14, 17, 18, 20, 132, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 153, 154, 155, 157, 158, 159, 160, 165, 166, 169], "include_cfrc_ext_in_observ": [137, 140, 141, 169], "include_cinert_in_observ": [140, 141], "include_cvel_in_observ": [140, 141], "include_qfrc_actuator_in_observ": [140, 141], "inclus": [137, 154], "incompleteidea": [150, 151, 163], "incorrect": [153, 154, 155, 166], "incorrectli": [19, 155, 166], "increas": [129, 131, 132, 136, 139, 144, 146, 147, 155, 162, 163, 170, 173, 175], "incur": [131, 151, 153], "independ": [6, 7, 10, 129, 139, 146, 147, 155], "index": [1, 6, 9, 13, 14, 17, 18, 20, 132, 157], "indic": [2, 9, 14, 17, 128, 131, 132, 150, 151, 152, 153, 154, 160, 163, 166, 173, 175, 177], "individu": [2, 11, 12, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 154, 158, 161, 165, 167, 177], "indoril007": 154, "induc": [137, 138, 139, 140, 141, 146, 147], "induct": [137, 138, 139, 140, 141, 146, 147], "ineffici": [155, 162], "inelast": [133, 134], "inertia": [140, 141], "inf": [2, 10, 16, 18, 132, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 167, 169], "infam": [163, 175], "infeas": 7, "infer": [7, 154], "infinit": [16, 17, 18, 129, 139, 150, 154, 155, 163, 166, 175], "info": [2, 3, 9, 10, 11, 12, 14, 17, 18, 19, 20, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 156, 157, 158, 160, 161, 163, 165, 166, 167, 170, 173, 175, 177], "inform": [1, 2, 5, 6, 7, 9, 10, 12, 14, 18, 19, 20, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 158, 160, 163, 165, 166, 169, 175, 177], "infti": [7, 139], "inher": 166, "inherit": [4, 5, 15, 16, 18, 19, 154, 155, 158, 165], "init": [128, 165, 170], "initi": [2, 3, 4, 6, 7, 9, 10, 14, 15, 16, 17, 18, 19, 127, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 158, 163, 167, 169, 170, 173, 175, 177], "initial_epsilon": [163, 175], "initial_sampl": 155, "initialis": [2, 4, 10, 17, 150, 155, 156, 160], "inlin": 170, "inner": 11, "inner_st": 6, "inplac": 177, "input": [9, 10, 155, 157, 158, 163, 170, 175], "inregular": 17, "insert": [156, 177], "insid": [1, 6, 7, 17, 153, 158, 165], "inspect": 9, "inspir": [151, 155, 173], "instabl": 162, "instal": [1, 9, 17, 126, 130, 136, 154, 155, 157, 165, 169], "instanc": [2, 3, 4, 5, 6, 8, 9, 10, 11, 13, 17, 18, 155, 157, 158, 162, 167, 170], "instanti": [9, 14, 17, 18, 19, 158, 165, 173], "instead": [2, 10, 11, 14, 19, 127, 131, 137, 140, 147, 154, 157, 160, 165, 170], "institut": [133, 134], "instruct": [136, 155, 158, 165], "int": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 136, 137, 150, 151, 152, 153, 155, 158, 163, 165, 170, 173, 175, 177], "int23": 17, "int32": [6, 8, 10, 14, 18, 158], "int64": [6, 7, 8, 155, 158], "int8": [5, 7], "int_32": 17, "integ": [2, 6, 7, 8, 9, 14, 16, 17, 18, 154, 155, 158, 160, 165], "integr": [9, 155, 169], "intellig": [153, 154], "intend": [3, 14, 17, 148, 151, 152, 172, 173], "intens": 148, "interact": [5, 9, 14, 17, 20, 148, 163, 165, 170, 175], "interest": [18, 155, 161, 169], "interestingli": 173, "interfac": [9, 156, 157], "interlink": 157, "intermedi": [140, 141], "intermitt": 17, "intern": [2, 3, 10, 154, 155, 158, 161, 163], "internet": 148, "interoper": 148, "interv": [7, 137, 139, 140, 147], "intrins": 167, "introduc": [137, 140, 141, 148, 154, 155, 159, 160], "introduct": [131, 150, 151, 159, 173], "introductori": 155, "invalid": [2, 7, 155, 169, 175], "invers": 4, "invert": [135, 136, 177], "inverted_double_pendulum": 142, "inverted_pendulum": [143, 177], "inverteddoublependulum": [136, 142], "invertedpendulum": [136, 143, 177], "investig": 155, "involv": [6, 126, 142, 148, 151, 152, 153, 154, 155, 165, 177], "io": [18, 152], "ipynb": [165, 166, 167, 170, 173, 175, 177], "irrespect": 9, "is_avail": 170, "is_bound": 7, "is_np_flatten": [5, 155], "is_slipperi": [151, 152, 173], "is_slippi": 155, "is_vector_env": 9, "isaac": 148, "isinst": 155, "isn": [6, 9, 155, 158], "isort": 155, "issu": [17, 18, 129, 132, 136, 137, 139, 140, 141, 142, 143, 144, 145, 147, 154, 155, 157, 160, 165], "itch": 152, "item": [9, 13, 18, 170, 173, 175], "iter": [5, 6, 12, 13, 155], "its": [2, 9, 16, 17, 129, 135, 137, 139, 141, 144, 147, 148, 155, 157, 170, 173, 177], "itself": [3, 8, 10, 11, 12, 15, 17, 154, 157, 159, 170], "j": [155, 170, 177], "jack": 150, "jair": 153, "jamartinh": 155, "jax": [2, 14, 17, 20, 148, 155, 162], "jaxenv": [14, 17], "jaxtonumpi": [14, 17, 20, 155], "jaxtotorch": [14, 17, 20, 155], "jessefarebro": 154, "jet": 155, "jfpettit": 154, "jimini": 148, "jin": 155, "jit": 162, "jjshoot": [154, 155], "jkterry1": 154, "jnp": 17, "job": [148, 165], "job_statu": 6, "john": 155, "johnminelli": 154, "join": [5, 155], "joint": [127, 129, 131, 132, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "joint0": 145, "joint1": 145, "jordan": [137, 155, 170], "journal": 153, "journei": [154, 163, 175], "json": 155, "json_env_spec": 155, "jsonabl": 5, "jump": 169, "jumpi": 155, "jun": 155, "junk": 14, "jupyt": [1, 154, 165, 166, 167, 170, 171, 173, 175, 177], "just": [3, 13, 15, 143, 145, 154, 155, 157, 162, 165, 169, 170], "k": [8, 9, 14, 129, 146, 155, 170], "kad99kev": 155, "kallinteri": [155, 169], "karp": 170, "kavukcuoglu": 170, "kde": 173, "keep": [9, 14, 17, 18, 20, 132, 141, 143, 165], "keep_dim": [14, 18], "kei": [2, 4, 5, 6, 7, 9, 10, 14, 17, 18, 20, 137, 138, 139, 140, 141, 146, 147, 151, 152, 153, 154, 155, 157, 160, 163, 165, 166, 167, 173, 175], "kellycoinflip": 154, "kept": [10, 136, 169], "kexianshen": 154, "key_to_act": 9, "keyb0ardninja": 155, "keyboard": [7, 9, 154], "keys_to_act": [9, 154, 170], "keyword": [4, 6, 10, 11, 12, 14, 131, 132, 150, 154, 155, 157, 158, 165, 175], "kg": [154, 155], "khoda81": 155, "kind": 7, "king": 150, "kir0ul": [154, 155], "klimov": [126, 127, 128, 129], "knee": [127, 140, 141], "know": [2, 10, 155, 157, 161, 162, 165], "knowledg": 4, "known": 2, "krimmel": 155, "kvrban": 155, "kwarg": [3, 4, 9, 10, 11, 12, 14, 15, 137, 138, 139, 140, 146, 147, 155, 159, 165], "kwiatkowski": 155, "l": [14, 17, 146, 155], "l2r": 148, "lab_black": 173, "label": [173, 175], "laboratori": 148, "lack": 155, "ladder": 127, "lai": 141, "lake": [149, 154, 155], "lam": 170, "lambaactionv0": 155, "lambda": [11, 12, 14, 16, 17, 18, 19, 155, 161, 163, 170, 175], "lambdaact": 14, "land": [129, 157], "lander": [126, 154, 155], "lane": 148, "lap": [128, 154], "lap_complete_perc": 128, "larg": [11, 129, 134, 137, 138, 139, 140, 141, 144, 145, 146, 147, 148, 154, 155, 160, 162, 167, 169, 173], "larger": 170, "largest": 154, "last": [9, 13, 14, 17, 18, 20, 141, 144, 145, 154, 155, 165, 166, 173], "latent": 148, "later": [129, 170, 175, 177], "latest": [2, 162, 177], "latin": 7, "latter": 151, "lava": [2, 157], "layer": [3, 15, 157, 158, 177], "lazili": 5, "lazyfram": 18, "lead": 170, "learn": [2, 5, 6, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 138, 142, 143, 146, 148, 149, 150, 151, 153, 154, 155, 156, 157, 158, 162, 163, 165, 169, 170, 171, 173, 176], "learner": 173, "learning_r": [163, 173, 175, 177], "learnt": 4, "least": [7, 129, 136], "leav": [132, 142, 158, 165, 173], "left": [7, 127, 128, 129, 132, 133, 137, 140, 141, 142, 143, 147, 151, 152, 153, 158, 163, 165, 169, 173, 175, 177], "left_elbow": [140, 141], "left_elfbow": [140, 141], "left_foot": [140, 141], "left_hip_i": [140, 141], "left_hip_x": [140, 141], "left_hip_z": [140, 141], "left_hiz_z": [140, 141], "left_kne": [140, 141], "left_lower_arm": [140, 141], "left_shoulder1": [140, 141], "left_shoulder2": [140, 141], "left_sin": [140, 141], "left_thigh": [140, 141], "left_upper_arm": [140, 141], "leg": [127, 129, 137, 139, 140, 141, 147], "leg_joint": [139, 147], "leg_left_joint": 147, "legaci": 154, "legend": [173, 175], "legend_el": 175, "len": [6, 17, 167, 170, 175], "length": [4, 6, 7, 9, 11, 12, 14, 17, 20, 127, 131, 132, 133, 134, 146, 148, 152, 153, 154, 155, 157, 161, 170, 175], "length_mask": 6, "length_moving_averag": 175, "length_queu": [14, 17, 161, 175], "leq": 142, "less": [127, 129, 146, 147, 157, 173], "lesson": 173, "let": [148, 155, 163, 165, 167, 173, 175, 177], "letter": [153, 154], "level": [9, 15, 157, 169], "levin": [137, 170], "li": [141, 157], "lib": 151, "librari": [148, 155, 156, 160, 165, 169], "licens": [154, 155, 165, 170, 173, 175, 177], "lidar": 127, "life": [17, 154, 155, 177], "lift": 144, "light": 148, "lightgreen": 175, "lightweight": 148, "ligibl": 177, "like": [3, 5, 7, 9, 13, 14, 15, 16, 17, 18, 19, 20, 138, 143, 147, 148, 153, 154, 155, 157, 158, 165, 167, 169, 170, 172], "lillicrap": 170, "limit": [17, 20, 132, 136, 143, 154, 155, 157, 158, 160, 162, 163, 168, 170, 171, 175, 177], "lin826": 154, "linalg": [9, 158, 165], "line": [2, 5, 131, 153, 154, 155, 157, 158, 160, 165], "linear": [10, 129, 131, 142, 143, 146, 170, 177], "linearli": [14, 16, 18, 20, 131, 142, 143, 157, 177], "linecolor": 173, "lineplot": [173, 177], "linewidth": [173, 175], "link": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 131, 137, 145, 146, 155], "list": [2, 4, 5, 6, 7, 9, 10, 11, 12, 14, 15, 17, 18, 154, 155, 157, 158, 170, 173, 175], "list_info": 14, "listdir": 17, "liter": 155, "littl": [129, 155], "live": [1, 9, 163, 175], "ll": [154, 163, 170, 175], "llvm": 148, "load": [4, 154, 155], "load_env_cr": 4, "load_ext": 173, "load_state_dict": 170, "load_weight": 170, "loc": [153, 170], "local": 165, "localhost": 1, "locat": [4, 136, 148, 151, 152, 153, 158, 165], "log": [2, 9, 14, 16, 17, 18, 19, 161, 170, 173, 177], "log_prob": [170, 177], "log_prob_mean": 177, "logan": 155, "logger": [17, 154, 161], "logic": [11, 128, 154, 158, 165, 169], "logical_or": [10, 155], "logit": 170, "long": [9, 17, 132, 143, 148, 154, 155], "longer": [2, 136, 137, 139, 143, 147, 154, 155, 177], "look": [5, 9, 17, 157, 158, 165, 166, 167, 170, 173], "loop": [148, 157, 160, 163, 170, 175], "lose": [129, 150, 175], "loss": [17, 150, 170, 177], "lost": 17, "lot": [15, 157, 163, 167, 175, 177], "low": [2, 6, 7, 8, 12, 13, 14, 18, 131, 132, 134, 135, 154, 155, 157, 167, 170], "lower": [7, 14, 15, 19, 20, 140, 141, 154, 157, 163, 170, 175], "lower_waist": [140, 141], "lpizzinidev": 155, "lr": [163, 170, 173, 175, 177], "lt": [154, 155], "lucasalegr": 155, "luck": 163, "lunar": [126, 154, 155], "lunar_land": [129, 155], "lunarland": [17, 129, 154, 155, 156, 157, 160, 170], "lunarlandercontinu": 167, "lwaist": [140, 141], "lz4": 18, "m": [8, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 170], "mab": 148, "machado": [16, 17], "machin": [154, 173, 177], "made": [1, 129, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 169, 170], "magnitud": [9, 129, 134, 142, 143, 177], "mahiuchun": 154, "mai": [2, 5, 7, 8, 9, 11, 14, 15, 16, 18, 19, 151, 152, 157, 158, 161, 165, 169, 170], "main": [2, 129, 139, 147, 155, 169, 173], "main_bodi": [137, 169], "mainli": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "maintain": [136, 148, 154, 155, 156, 172], "mainten": 154, "major": [154, 155], "make": [1, 2, 6, 9, 11, 12, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 163, 167, 168, 169, 170, 171, 173, 175, 177], "make_dict": 14, "make_vec": [2, 4, 9, 10, 11, 12, 14, 132, 155, 158, 162], "malici": 155, "manag": [4, 13, 148], "mandatori": [158, 165], "manhattan": [158, 165], "mani": [7, 14, 17, 136, 148, 154, 155, 157, 163, 165, 173, 175], "manipul": 148, "manner": [3, 7, 18, 20, 154], "manual": [9, 14, 157, 160, 170], "manual_se": 177, "manuel": 155, "map": [9, 152, 153, 155, 157, 158, 165, 175, 177], "map_nam": 152, "map_siz": 173, "mar": [148, 155], "marionj": 155, "mark": 155, "markdown": 1, "market": 148, "markov": [155, 157, 166], "markovian": 148, "marku": 155, "markus28": [154, 155], "marsexplor": 148, "mask": [5, 6, 7, 153, 154, 160, 170], "maskndarrai": 7, "mass": [129, 137, 140, 141, 146], "massiv": 154, "master": [18, 151, 154, 177], "match": [4, 8, 11, 12, 16, 132, 155], "math": [135, 148, 154], "mathbb": 7, "mathcal": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "mathemat": [5, 157, 177], "matplotlib": [9, 170, 173, 175, 177], "matpoliquin": 155, "matric": [5, 148], "matrix": [6, 140, 141, 148], "matter": [7, 154], "matyasch": 155, "max": [4, 7, 14, 16, 17, 18, 20, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 152, 155, 161, 163, 170, 173, 175, 177], "max_a": 166, "max_act": [14, 15, 16, 20], "max_episode_step": [4, 17, 155, 158, 165, 169, 170], "max_geom": 136, "max_id": 173, "max_length": 7, "max_ob": [14, 18, 20], "max_reward": [10, 14, 19, 155, 167], "max_time_step": [132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "maxandskipenv": 18, "maxandskipobserv": [18, 20, 155], "maxim": [170, 177], "maximis": 162, "maximum": [4, 7, 9, 14, 17, 18, 20, 129, 131, 135, 142, 155, 157, 165, 170, 173], "maxq": 153, "mayb": [2, 173], "mclean": 155, "md": [1, 154, 155, 165], "mdp": [2, 17, 133, 134, 148, 157], "mean": [2, 14, 18, 19, 20, 131, 139, 144, 154, 157, 160, 161, 165, 170, 173, 177], "meant": 3, "meantim": 155, "measur": [9, 127, 135, 141, 142, 144, 145, 154], "mechan": 155, "medium": 148, "mel": 152, "melbourn": 173, "melt": 177, "member": 5, "memori": [5, 17, 133, 134, 154, 155, 162], "menageri": 169, "mention": [155, 177], "mere": 165, "merg": 155, "meshcat": 148, "meshgrid": 175, "messag": [5, 154, 155], "meta": [148, 154], "metaclass": 154, "metadata": [2, 3, 9, 10, 15, 17, 154, 155, 165, 167], "metatrad": 148, "metaworld": 148, "meter": [135, 148], "meth": [7, 17, 158], "method": [3, 4, 6, 9, 14, 16, 17, 18, 19, 148, 154, 155, 158, 159, 165, 167, 170, 177], "methodologi": 148, "metric": [2, 9, 155, 160, 170], "mgoulao": 155, "micim": 154, "microgrid": 148, "middl": [139, 147], "might": [2, 5, 7, 8, 15, 16, 157, 160, 161, 162, 163, 165, 167, 170, 175, 177], "migrat": [155, 157], "mihaic": 155, "miller": 173, "min": [7, 14, 16, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "min_act": [14, 15, 16, 20], "min_length": 7, "min_ob": [14, 18, 20], "min_reward": [10, 14, 19, 155, 167], "minari": 148, "minatari": 148, "mind": 155, "minecraft": 148, "minetest": 148, "miniatur": 148, "minibatch": 170, "minigrid": [148, 155], "minim": [2, 148, 155, 165, 166, 170, 173], "minimalist": 148, "minimum": [7, 14, 18, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 157, 173], "miniwob": 148, "miniworld": 148, "minor": [154, 155], "mirza": 170, "misc": [154, 155], "miss": [131, 153, 154, 155, 157], "mission": 5, "mistak": 155, "mit": [131, 170, 173, 175, 177], "mitig": 170, "mixtur": 155, "mj_camera": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "mj_step": [137, 140], "mjcf": 169, "mjdata": 136, "mjtvisflag": 136, "mjvcamera": 136, "mjvis_contactforc": 136, "mjvis_contactpoint": 136, "mjvisual": 136, "mkdir": [170, 173], "ml": [148, 154], "mmcaulif": 155, "mnih": 170, "mo": 148, "moba": 148, "mobil": 148, "mode": [2, 4, 10, 11, 12, 14, 154, 155, 157, 160, 165, 170, 175], "model": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 155, 163, 172, 175], "modern": 148, "modif": 167, "modifi": [2, 4, 5, 10, 14, 15, 16, 18, 19, 20, 131, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 158, 160, 165, 167], "modul": [4, 5, 13, 154, 155, 161, 165, 167, 170, 177], "modular": [14, 15, 148, 157, 165, 167], "module_nam": 155, "moment": 3, "monitor": [17, 154, 155, 160], "monop": 136, "mont": [163, 170, 175, 177], "month": 155, "moon": 129, "moor": [133, 134], "moore90efficientmemori": [133, 134], "more": [1, 2, 5, 6, 7, 14, 15, 19, 20, 127, 129, 136, 137, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 158, 160, 162, 163, 165, 166, 167, 169, 170, 173, 175, 177], "moreov": [5, 14, 15, 17, 165, 167], "moritz": [137, 170], "most": [2, 5, 7, 13, 14, 15, 17, 18, 136, 145, 148, 154, 155, 157, 158, 160, 162, 163, 165, 167, 169, 175], "mostli": 154, "motion": 148, "motor": [127, 146, 148, 154, 169], "motor1_rot": 146, "motor2_rot": 146, "mountain": [130, 154, 157], "mountaincar": [14, 133], "mountaincarcontinu": [14, 16, 19, 134], "mountaincarenv": 133, "move": [2, 14, 17, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 129, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 152, 153, 154, 155, 157, 158, 160, 165, 166, 169, 173, 177], "movement": [158, 165], "moviepi": [9, 17, 154, 155], "mozer": 131, "mp": 13, "mp4": 161, "mpi": 13, "mpi4pi": 13, "mpi_init": 13, "mt19937": 154, "mtsim": 148, "much": [154, 155, 165, 166, 169], "mujoco": [1, 9, 15, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 167, 169, 171, 176], "mujoco_env": 154, "mujoco_gl": 136, "mujoco_menageri": 169, "mujoco_rend": 154, "mujocoenv": 155, "mujocorend": 155, "mujocorender": 155, "multi": [2, 7, 136, 140, 141, 144, 148], "multiag": 148, "multibinari": [5, 6, 7, 8, 13, 18, 155, 157], "multidiscret": [5, 6, 7, 8, 10, 13, 18, 154, 155, 157, 165], "multimod": 148, "multipl": [5, 6, 7, 10, 11, 12, 13, 154, 155, 158, 162, 170], "multipli": [6, 129, 134, 144, 145, 170], "multiprocess": [5, 11, 13, 170], "multivari": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "must": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 128, 139, 147, 154, 155, 157, 160, 165, 166], "mutual": 136, "my": 173, "mypi": [2, 155], "n": [4, 5, 6, 7, 8, 11, 12, 13, 17, 18, 20, 128, 131, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 161, 163, 170, 173, 175, 177], "n_action": 170, "n_env": 170, "n_episod": [163, 175], "n_featur": 170, "n_run": 173, "n_showcase_episod": 170, "n_steps_per_upd": 170, "n_updat": 170, "na": 145, "name": [1, 3, 4, 11, 12, 14, 15, 19, 20, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 158, 165], "name_prefix": [9, 17, 161], "namedtupl": [155, 173], "namespac": [4, 158, 165], "nanoquadrotor": 148, "narrow": 173, "nativ": 17, "natur": [148, 150, 152, 175], "navig": [148, 153, 158, 165, 167], "nbodi": [137, 140, 141], "nchain": 154, "ncol": [151, 152, 170, 173, 175], "ncrement": 177, "ndarrai": [2, 3, 6, 7, 8, 13, 14, 16, 17, 18, 19, 131, 132, 133, 134, 135, 153, 170, 177], "neccari": 9, "necessari": [2, 5, 126, 154, 155], "necessarili": 8, "need": [2, 5, 6, 9, 14, 15, 16, 18, 127, 129, 132, 136, 154, 155, 157, 158, 160, 161, 162, 163, 165, 166, 167, 169, 170, 173, 175], "neg": [2, 7, 9, 134, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 153, 157, 177], "neither": [7, 17, 150, 153], "nerdyespresso": 155, "nervous": 127, "nest": [6, 13, 148], "net": [14, 19, 128, 150, 151, 163, 170, 177], "network": [5, 8, 10, 13, 146, 148, 157, 170], "neural": [5, 8, 10, 13, 131, 146, 148, 170, 177], "neurip": 131, "neuro": 148, "neuroflight": 148, "neuronlik": [132, 142, 143], "never": [2, 138, 141, 144, 145, 146, 155], "nevertheless": 167, "new": [2, 5, 14, 15, 16, 17, 18, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 146, 147, 153, 154, 155, 156, 157, 158, 159, 160, 162, 165, 166, 167, 169, 170, 173, 175], "new_action_spac": 14, "new_obs_spac": 14, "new_spac": 14, "new_stat": 173, "new_throughput": 9, "newer": [136, 154], "newli": [14, 18, 19], "newlin": 2, "next": [2, 3, 10, 13, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 155, 156, 157, 158, 160, 163, 169, 177], "next_ob": [155, 163, 175, 177], "next_stat": [3, 155, 175], "nice": 154, "nich": 157, "nidhish": 155, "nine": 137, "nintendo": 7, "nip": 131, "nlp": 148, "nlpgym": 148, "nn": [170, 177], "no_grad": 170, "node": [5, 6, 8, 154, 157], "node_spac": [6, 8], "nois": [130, 136], "non": [2, 5, 8, 9, 17, 129, 132, 137, 138, 139, 140, 141, 146, 147, 148, 152, 154, 155, 165, 166, 177], "non_random_ob": 128, "nonbreak": 154, "nondeterminist": [4, 165], "none": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 136, 152, 154, 155, 158, 160, 165, 170, 173, 175], "nonlinear": 139, "noop": [7, 9, 17, 153, 170], "noop_max": 17, "nor": [7, 17, 150, 151, 153], "norm": [9, 144, 145, 158, 165], "normal": [2, 4, 7, 10, 13, 14, 17, 18, 19, 20, 127, 128, 135, 137, 138, 142, 154, 155, 158, 161, 162, 170, 177], "normalis": 155, "normalize_tim": 18, "normalizeobserv": [14, 18, 20, 154, 155], "normalizereward": [14, 19, 20, 154], "normalizex": 155, "north": 153, "notabl": [154, 155, 166], "note": [1, 5, 9, 13, 14, 17, 18, 19, 131, 132, 136, 137, 139, 140, 141, 142, 144, 147, 153, 157, 162, 163, 166, 169, 170, 175, 177], "notebook": [1, 154, 165, 166, 167, 170, 171, 173, 175, 177], "noth": [3, 128, 129, 155, 177], "notic": 155, "notif": 155, "notimplementederror": 8, "nov": 153, "now": [2, 127, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 158, 160, 165, 166, 169, 173, 175, 177], "np": [2, 3, 5, 6, 7, 9, 10, 12, 13, 14, 16, 18, 19, 129, 133, 140, 141, 153, 154, 155, 158, 163, 165, 167, 169, 170, 173, 175, 177], "np_random": [2, 5, 9, 10, 11, 12, 15, 154, 155, 158, 160, 165], "np_random_se": [2, 10, 11, 12, 15, 155], "npt": 5, "nrow": [152, 170, 173], "nstep": 155, "num": [14, 17, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "num_col": 4, "num_edg": 6, "num_env": [4, 10, 11, 12, 13, 14, 132, 155, 158, 170], "num_eval_episod": 161, "num_nod": 6, "num_training_episod": 161, "number": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 17, 18, 20, 128, 129, 131, 136, 139, 140, 141, 146, 147, 148, 150, 152, 154, 155, 157, 158, 159, 161, 162, 165, 169, 170, 173, 177], "numer": [142, 143, 150, 154, 155, 167, 177], "numpi": [2, 5, 7, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 154, 155, 158, 160, 163, 165, 167, 169, 170, 173, 175, 177], "numpytotorch": [14, 17, 20, 155], "nutshel": [14, 19], "nv": [140, 141], "nvec": 7, "nvidia": 148, "o": [17, 170, 177], "o_": 166, "o_t": 166, "ob": [9, 12, 14, 16, 17, 18, 128, 153, 154, 155, 161, 163, 167, 175, 177], "obj_slidei": 144, "obj_slidex": 144, "object": [2, 5, 9, 11, 12, 13, 14, 15, 136, 144, 145, 148, 155, 157, 175, 177], "obs0": 18, "obs1": 18, "obs2": 18, "obs3": 18, "obs4": 18, "obs_kei": 18, "obs_shap": 170, "obs_space_dim": 177, "obs_t": 9, "obs_tp": 9, "obs_tp1": 9, "observ": [2, 3, 4, 5, 6, 9, 10, 11, 12, 13, 15, 17, 19, 20, 148, 154, 155, 156, 160, 162, 163, 166, 167, 173, 177], "observation_mod": [11, 12], "observation_spac": [2, 5, 6, 7, 9, 10, 11, 12, 13, 14, 15, 18, 157, 158, 165, 167, 173, 177], "observation_structur": [137, 138, 139, 140, 141, 143, 146, 147], "observationwrapp": [14, 15, 18, 165, 169], "obstyp": [2, 3, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 155, 160, 177], "obswrapp": 154, "obtain": [15, 17, 135, 150, 158, 166], "obviou": 166, "obvious": 155, "occur": [2, 10, 131, 132, 165, 170], "octob": 136, "off": [17, 128, 129, 151, 153, 154, 155, 160, 163, 173], "offer": 148, "offici": 3, "offlin": [148, 155], "offscreen": [154, 155], "offset": [129, 140, 141], "offworld": 148, "often": [2, 5, 148, 158, 165, 173], "oftentim": [158, 165], "old": [10, 16, 154, 155, 156, 159, 160], "old_spac": 14, "old_throughput": 9, "older": [136, 155, 160], "oldv21env": 159, "oleg": [126, 127, 128, 129], "olimoyo": 154, "olipinski": [154, 155], "omar": 155, "omega_1": 142, "omega_2": 142, "omesa": 136, "omg": 148, "omit": [137, 138, 139, 140, 141, 146, 147], "omniisaacgymenv": 148, "omnivers": 148, "onc": [1, 17, 153, 154, 158, 165, 170, 175], "one": [2, 5, 6, 7, 8, 9, 11, 15, 131, 132, 133, 134, 135, 137, 139, 142, 143, 146, 150, 153, 154, 155, 158, 159, 160, 163, 166, 167, 169, 170, 173, 175, 177], "oneof": [5, 6, 8, 13, 155], "ones": [14, 130, 154, 170, 175], "ongo": 170, "onli": [2, 4, 5, 6, 7, 8, 9, 17, 18, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 145, 146, 147, 154, 155, 158, 160, 161, 163, 165, 167, 169, 170, 173, 175], "onlin": [140, 141, 150, 151], "onward": 166, "op": [17, 155], "open": [1, 17, 136, 148, 154, 155, 165], "openai": [2, 19, 126, 154, 155, 156, 157, 160, 166, 175, 177], "opencv": [14, 17, 18, 20, 154], "opengl": [136, 155], "openmodelica": 148, "openreview": [14, 19], "oper": [8, 14, 136, 154], "opportun": 148, "opt": 169, "optim": [13, 14, 17, 127, 129, 140, 141, 148, 155, 163, 170, 173, 175, 177], "optimis": [10, 148, 154, 172], "option": [2, 3, 4, 5, 6, 7, 10, 11, 12, 14, 15, 18, 128, 131, 132, 133, 134, 135, 136, 148, 154, 155, 157, 158, 160, 165], "ord": [9, 158, 165], "order": [4, 5, 6, 15, 17, 129, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 157, 161, 162, 165, 173], "order_enforc": [4, 165], "ordereddict": 155, "orderenforc": [4, 15, 17, 20, 127, 128, 129, 131, 132, 133, 134, 135, 154, 155, 157, 158, 165], "orderli": 154, "org": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 155, 163, 170, 175, 177], "orient": [129, 137, 140, 141], "origin": [8, 10, 14, 16, 17, 18, 128, 131, 132, 142, 154, 155, 160], "orsini": 154, "other": [14, 16, 129, 135, 136, 138, 140, 142, 146, 147, 152, 153, 154, 155, 157, 158, 159, 161, 163, 165, 167, 169, 170, 173, 175, 177], "otherwis": [2, 4, 6, 7, 9, 11, 12, 14, 17, 18, 139, 154, 155, 158, 163, 173, 175], "our": [155, 157, 158, 163, 165, 169, 170, 173, 175], "out": [2, 3, 13, 14, 15, 148, 154, 155, 158, 165, 166, 167, 172, 177], "outcom": [150, 173], "outer": 131, "outermost": 14, "outlin": [150, 154, 157, 158, 160, 163, 175], "output": [1, 9, 10, 13, 17, 154, 155, 157, 158, 165, 170, 175, 177], "output_truncation_bool": 9, "outsid": [2, 14, 18, 128, 129, 154, 155, 166], "outweigh": 155, "over": [5, 9, 13, 17, 138, 140, 150, 152, 154, 155, 161, 163, 167, 169, 170, 175], "overcom": 173, "overfit": [169, 170], "overflow": 155, "overhaul": 155, "overhead": 129, "overrid": [4, 11, 14, 155, 167], "overview": [163, 165, 177], "overwrit": [16, 18, 19], "overwritten": [15, 154, 155], "own": [11, 15, 148, 155, 158, 160, 167, 168, 169, 171], "p": [138, 148, 151, 152, 153, 154, 170, 173], "p_": [144, 145], "p_0": 148, "packag": [1, 10, 17, 126, 148, 154, 155, 173], "package_data": 154, "pad": [18, 129, 155], "padding_typ": [18, 155], "page": [16, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 155, 157, 158, 162, 163, 172], "painlessli": 5, "pair": [136, 140, 141], "palett": 177, "pan": 144, "panda": [148, 173, 177], "paper": [131, 142, 148, 153, 155, 162], "paradigm": 2, "parallel": [10, 11, 12, 14, 148, 155, 158, 161, 162, 170], "param": [3, 7, 9, 155, 173], "paramet": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 158, 160, 165, 167, 170, 177], "parameter": [2, 170, 177], "parameter_tun": 154, "parametr": [5, 177], "parent": [13, 173], "pars": 4, "parse_env_id": 4, "parser": 155, "part": [17, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 155, 158, 166, 169, 173], "parti": [136, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 158, 165], "partial": [2, 148, 154, 155], "particular": [5, 6, 9, 20, 129, 155, 157, 159, 162, 163, 166, 169, 173], "particularli": [9, 17, 18, 132, 155], "pass": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 128, 137, 138, 139, 140, 141, 146, 147, 154, 155, 157, 159, 160, 165, 167, 170], "passeng": 153, "passenger_loc": 153, "passidx": 153, "passiv": [17, 20, 154, 155], "passiveenvcheck": [4, 15, 17, 20, 127, 128, 129, 131, 132, 133, 134, 135, 155, 157, 158], "past": [14, 18, 19, 154], "patch": [154, 175], "path": [4, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 152, 155, 165, 170, 173], "pathlib": 173, "patienc": 154, "paulmest": 155, "paw": 138, "pcg64": 154, "pd": [173, 177], "pdf": [131, 150, 151], "peg": 154, "pelvi": [140, 141], "penal": [137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 167], "penalis": [133, 134], "penalti": 153, "pend": [11, 154], "pendulum": [11, 12, 18, 130, 131, 132, 136, 154, 157, 177], "pendulumenv": 135, "per": [7, 9, 10, 17, 129, 132, 153, 154, 162, 166, 169, 170], "percentag": [128, 163, 175], "peretz": 155, "perez": 155, "perf_count": 154, "perfect": 157, "perfectli": 165, "perform": [2, 5, 14, 145, 154, 155, 157, 159, 162, 177], "period": [139, 161], "periodiclli": 161, "perman": [144, 145, 158], "permiss": [154, 155], "perpendicular": [151, 152, 173], "perpendicularli": 142, "person": 131, "perspect": 148, "perturb": [137, 138, 139, 140, 141, 142, 143, 146, 147], "pettingzoo": [2, 148, 154], "pge": 148, "phase": [148, 170], "phd": [133, 134, 146], "philosoph": 9, "photorealist": 148, "phys2d": 17, "physic": [2, 126, 136, 142, 143, 144, 145, 148, 154, 155, 177], "pi": [129, 131, 133, 135], "pick": [153, 163, 169, 170, 173, 175], "pickl": [1, 13, 154, 155], "pickled_env_spec": 155, "pickup": 153, "pictur": 157, "piec": 165, "pierrecounath": 155, "pierremardon": 155, "pierr\u00e9": [155, 173], "pinocchio": 148, "pip": [1, 126, 130, 136, 154, 155, 157, 165, 169], "pip3": 154, "pipe": [11, 154], "pipx": 165, "pirata": 154, "pitfal": 127, "pix_square_s": 165, "pixel": [2, 18, 128, 148, 165], "pixelobservationwrapp": [18, 154, 155], "pixelobswrapp": 154, "pixels3d": 165, "place": [9, 17, 132, 133, 134, 155, 158, 165, 170], "plai": [9, 128, 148, 154, 155, 163, 175], "plan": [148, 154, 155], "platform": 148, "playablegam": 9, "player": [148, 150, 151, 152, 153, 163, 173, 175], "player_count": 175, "playfield": 128, "playplot": [9, 155], "pleas": [2, 5, 7, 18, 148, 154, 155, 158, 160, 163, 172], "plot": [1, 9, 173, 175], "plot_nam": 9, "plot_q_values_map": 173, "plot_states_actions_distribut": 173, "plot_steps_and_reward": 173, "plot_surfac": 175, "plotter": 9, "plt": [170, 173, 175, 177], "plu": [7, 151], "plugin": [154, 155], "pm": 129, "png": 173, "point": [4, 8, 9, 15, 127, 128, 129, 131, 132, 135, 136, 145, 150, 154, 155, 158, 161, 166], "pole": [2, 130, 136, 142, 143, 148, 177], "pole2": 142, "poli": 148, "polici": [2, 14, 18, 19, 130, 136, 137, 138, 139, 140, 141, 146, 147, 148, 154, 155, 156, 157, 160, 166, 169, 170, 173], "policy_grid": 175, "policy_mean_net": 177, "policy_network": 177, "policy_stddev_net": 177, "polygon": 154, "pomdp": 3, "pong": [17, 155], "pontryagin": 129, "pool": [17, 146], "pop": [2, 154, 155], "pop_fram": 17, "popular": [126, 148, 155, 163, 175], "pose": 148, "posit": [2, 6, 7, 8, 9, 11, 13, 17, 127, 128, 129, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 151, 152, 153, 155, 157, 158, 165, 167, 175], "position_fingertip": 145, "position_target": 145, "possibl": [2, 5, 6, 7, 8, 14, 129, 131, 132, 133, 134, 136, 138, 140, 143, 146, 151, 152, 153, 154, 155, 157, 158, 161, 163, 165, 169, 173, 175], "possibli": [7, 13, 15, 17, 157, 170], "post": [154, 155, 160, 173], "postprocess": 173, "potenti": 155, "pow": 170, "power": [17, 128, 129, 134, 142, 143, 148, 177], "pp": 153, "ppo": 172, "pprint": [14, 155], "pprint_registri": [2, 4, 155, 157, 158], "pr": [140, 148, 154, 155, 160], "practic": [148, 154, 166, 175], "pre": [4, 5, 152, 155, 165], "precis": [6, 154], "predefin": 165, "predict": [139, 148, 170, 177], "predominantli": 3, "prefix": 161, "preload": 152, "premad": 148, "prematur": 2, "prepend": [9, 17], "preprocess": [9, 17, 20, 154, 155], "present": [131, 148, 154, 166, 175], "preserv": 166, "press": [7, 9, 128, 131, 157], "pretti": [4, 155, 173], "prevent": [3, 7, 11, 154, 158], "previou": [17, 155, 160, 166, 167], "previous": [4, 9, 18, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 159], "primari": 160, "primarili": [148, 155], "principl": 129, "print": [4, 155, 161, 165, 167, 170, 173, 177], "print_registri": 4, "prior": 4, "priori": 148, "privat": 165, "prng": [2, 5, 6, 7], "proabil": [151, 153], "prob": [170, 177], "proba_frozen": 173, "probabl": [5, 16, 20, 151, 152, 153, 155, 163, 166, 170, 173, 175, 177], "problem": [17, 129, 132, 135, 137, 140, 141, 142, 143, 146, 148, 150, 153, 156, 161, 162, 166, 169, 170], "procedur": 177, "proceed": 131, "process": [9, 10, 11, 13, 14, 18, 131, 148, 155, 157, 165, 166, 170], "process_ev": 9, "produc": [2, 17, 20, 154, 155, 165], "product": [6, 7, 177], "program": [10, 11, 12, 148, 155, 157], "progress": [6, 9, 155, 170], "project": [9, 148, 154, 155, 157, 158, 160, 163, 165, 175], "prompt": 155, "propag": 173, "proper": 154, "properli": [1, 5, 154], "properti": [2, 5, 10, 11, 12, 14, 15, 18, 19, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 166], "proport": 170, "propos": [16, 155], "protocol": 17, "prototyp": 148, "prove": [148, 163], "prover": 148, "provid": [2, 5, 6, 7, 9, 10, 11, 14, 16, 17, 18, 20, 126, 127, 131, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 157, 158, 159, 161, 162, 163, 165, 172, 175], "psc": 154, "pseudo": [154, 155, 170], "pseudorandom": [5, 7], "publicli": 155, "publish": 155, "pull": [18, 19, 154], "pump": 165, "purpos": [3, 5, 154, 158, 177], "push": [129, 132, 136, 142, 143, 148, 155, 177], "pusher": [136, 155], "pusher_v5": 144, "put": [154, 165, 173], "puzzl": 148, "py": [1, 9, 126, 127, 128, 129, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 151, 154, 155, 165, 166, 167, 170, 173, 175, 177], "pybullet": 148, "pycharm": 155, "pydoc": 154, "pyflyt": [148, 155], "pygam": [2, 9, 126, 154, 155, 160, 165], "pyglet": 154, "pypi": [154, 155], "pyplot": [170, 173, 175, 177], "pyproject": 165, "pyright": [2, 154, 155], "pystk2": 148, "python": [1, 17, 126, 127, 128, 129, 148, 154, 155, 156, 157, 158, 161, 165, 166, 167, 170, 171, 173, 175, 177], "pytorch": [14, 17, 20, 155, 162, 177], "q": [153, 163, 166, 170, 171, 176], "q_": 166, "q_target": 166, "q_updat": 173, "q_valu": [153, 163, 175], "qfrc_actuat": [140, 141], "qfrc_constraint": 142, "qgallouedec": 155, "qlearn": 173, "qpo": [136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "qtabl": 173, "qtable_best_act": 173, "qtable_direct": 173, "qtable_directions_map": 173, "qtable_val_max": 173, "quad": 141, "quad_ctrl_cost": 141, "quad_impact_cost": 141, "quadcopt": 148, "quadrotor": 148, "quadrup": [136, 137, 155], "quagla": 154, "qualiti": [10, 148, 155], "quardup": 136, "quaternion": [137, 140, 141], "queen": 150, "question": [148, 165], "queue": [11, 161], "quicker": 170, "quickli": [133, 148, 163, 173, 175], "quit": [11, 165], "qvel": [136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "qwop": 148, "r": [1, 7, 14, 17, 19, 131, 135, 148, 150, 151, 153, 167, 173, 177], "r1etn1rtpb": [14, 19], "r_": 170, "r_elbow_flex_joint": 144, "r_forearm_roll_joint": 144, "r_shoulder_lift_joint": 144, "r_shoulder_pan_joint": 144, "r_t": 166, "r_upper_arm_roll_joint": 144, "r_wrist_flex_joint": 144, "r_wrist_roll_joint": 144, "race": [126, 148, 154], "racecar": 148, "racecar_gym": 148, "racing_dream": 148, "rad": [131, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "radian": [129, 131, 132, 135, 143], "rafaelcp": 155, "raghuspacerajan": 154, "rainbow": [172, 177], "rais": [2, 4, 7, 8, 9, 11, 12, 13, 17, 18, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 170], "ram": 162, "rand": 154, "randint": [154, 160], "randn": 154, "random": [2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 17, 18, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 145, 146, 147, 152, 153, 154, 155, 157, 158, 163, 165, 168, 171, 173, 175, 177], "randomize_domain": 170, "randomize_ob": 128, "randomli": [5, 6, 9, 129, 152, 154, 155, 158, 165, 170, 173], "randomnumbergener": 154, "randomst": 154, "rang": [9, 13, 14, 16, 17, 18, 20, 127, 130, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 163, 165, 167, 169, 170, 173, 175, 177], "rangefind": 127, "raphajan": 155, "rapidli": 148, "rare": [7, 154], "rate": [144, 145, 148, 163, 170, 173, 175, 177], "rather": [2, 4, 128, 154, 155, 160, 161, 163, 170, 175], "raw": 155, "rayerdyn": 155, "rcparam": 177, "rdnfn": 154, "re": [1, 154, 155, 165, 173, 177], "reach": [2, 4, 17, 18, 20, 130, 131, 132, 133, 134, 136, 148, 151, 152, 153, 155, 158, 160, 163, 165, 166, 172, 173, 175, 177], "reachabl": 153, "reacher": [136, 155, 167], "reacher_weighted_reward": 165, "reacherrewardwrapp": 167, "read": [2, 5, 13, 19, 131, 137, 140, 141, 154, 155, 158, 160, 163], "read_from_shared_memori": [5, 13], "readabl": [5, 6, 155], "reader": [162, 169], "readi": [169, 175], "readm": [154, 155, 165], "readthedoc": 18, "real": [9, 138, 148], "realist": [136, 148], "realli": [155, 163, 169, 175], "rear": 128, "reason": [2, 17, 129, 148, 154, 155, 160, 169], "rebuild": 1, "receiv": [9, 18, 19, 20, 128, 129, 134, 147, 153, 154, 156, 157, 163, 166, 170, 175, 177], "recent": [13, 14, 17, 18, 19, 155, 159], "recogn": 155, "recommend": [3, 4, 9, 10, 11, 17, 129, 136, 137, 140, 141, 154, 155, 158, 160, 163, 169, 170, 175], "reconstruct": 157, "record": [4, 9, 17, 20, 127, 148, 154, 155, 157, 170, 177], "recordconstructorutil": 155, "recorded_fram": 155, "recordepisodestat": 154, "recordepisodestatist": [14, 17, 20, 154, 155, 160, 161, 163, 170, 175, 177], "recordepisodicstatist": 161, "recordvideo": [17, 20, 154, 155, 160, 161], "recordvideov0": 155, "recreat": 155, "recreated_env": 155, "rect": 165, "red": [153, 154, 165], "redirect": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125], "redtachyon": [154, 155], "reduc": [132, 148, 162, 163, 170, 172, 175], "redund": 154, "refactor": [154, 155], "refer": [2, 4, 5, 7, 9, 155, 156, 157, 160, 162, 165, 166, 169], "referenc": 155, "reflect": [18, 132, 154, 155], "refresh": 173, "regard": 130, "regardless": [137, 138, 139, 140, 141, 146, 147, 169], "regex": 4, "reginald": 155, "regist": [13, 154, 155, 159], "register_env": [17, 155], "registr": [4, 154, 155, 165], "registri": [4, 154, 155], "regress": 9, "regularli": 155, "reimplement": 160, "reinforc": [2, 5, 131, 146, 148, 149, 150, 151, 153, 154, 155, 156, 157, 162, 163, 166, 170, 171, 173, 175, 176], "reinforce_invpend_gym_v26": 177, "reiniti": [154, 177], "rel": [131, 140, 141, 155, 165, 167, 170], "relat": [129, 132, 137, 139, 140, 141, 142, 143, 144, 145, 147, 148, 155, 177], "relative_posit": 165, "relativeposit": [165, 167], "releas": [10, 11, 12, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 165], "relev": [154, 159, 165], "relu": [14, 170], "remain": [18, 151, 154, 165, 166], "remak": 170, "rememb": [17, 128, 165, 167], "remind": [155, 169], "remot": [148, 155], "remov": [2, 4, 17, 18, 137, 138, 139, 140, 141, 145, 146, 147, 153, 154, 155], "renam": [14, 17, 18, 138, 140, 154, 155, 177], "render": [2, 3, 4, 10, 14, 15, 17, 18, 20, 126, 127, 137, 138, 139, 140, 146, 147, 148, 154, 155, 157, 158, 161, 163, 167, 173, 175], "render_clos": 3, "render_fp": [2, 9, 17, 155, 165], "render_imag": 3, "render_init": 3, "render_kei": 18, "render_mod": [2, 3, 9, 10, 15, 17, 18, 127, 128, 131, 132, 133, 134, 135, 136, 154, 155, 156, 157, 159, 160, 161, 165, 170, 173], "render_onli": 18, "render_st": 3, "rendercollect": [2, 17, 20, 155], "renderedfram": [17, 20], "renderfram": [2, 9, 10, 14, 15], "renderstatetyp": 3, "renorm": 129, "rep": 173, "repeat": [4, 16, 18, 20, 155, 157, 158, 173], "repeat_action_prob": 16, "rephras": 154, "replac": [17, 18, 137, 146, 150, 154, 155, 160, 161, 163, 165, 175], "replai": 10, "replay_buff": 155, "repo": [1, 154, 155], "report": [127, 137, 140, 141, 154, 155], "repositori": [136, 154], "repres": [2, 6, 7, 9, 18, 129, 131, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 152, 153, 154, 155, 156, 157, 158, 160, 165, 169, 173, 177], "represent": [2, 155, 157, 166], "reproduc": [2, 6, 148, 155, 160, 173], "request": [18, 150, 155], "requir": [1, 2, 5, 6, 9, 10, 11, 14, 18, 136, 147, 154, 155, 157, 158, 162, 163, 165, 169, 170, 175, 177], "rerun": 155, "res_al": 173, "rescal": [14, 16, 18, 20], "rescale_act": 154, "rescaleact": [14, 15, 16, 20, 155, 157], "rescaleobserv": [14, 18, 20], "research": [136, 142, 148, 153, 155, 162], "reseed_sampl": 155, "reseed_valu": 155, "reset": [2, 3, 4, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 146, 147, 151, 152, 153, 154, 155, 156, 157, 161, 163, 170, 173, 175, 177], "reset_async": 155, "reset_clean": 17, "reset_noise_scal": [137, 138, 139, 140, 141, 142, 143, 146, 147, 169], "reset_qt": 173, "reset_wait": 155, "resetneed": 17, "reshap": [14, 18, 20, 173], "reshape_env": 18, "reshapeobserv": [14, 18, 20], "resiz": [14, 17, 18, 20, 155], "resized_env": 18, "resizeobserv": [14, 18, 20, 155], "resolv": 154, "resourc": [10, 11, 12, 165, 175], "respect": [14, 15, 17, 129, 131, 137, 138, 139, 140, 141, 146, 147, 154, 155, 157, 167, 170], "respectivesli": 14, "respond": 175, "rest": [128, 129, 139, 147, 158, 170], "restart": 157, "restor": [137, 138, 139, 140, 141, 146, 147], "restrict": 155, "result": [2, 6, 8, 10, 11, 12, 13, 17, 129, 131, 132, 136, 137, 140, 141, 142, 145, 149, 151, 153, 154, 155, 157, 158, 163, 170, 173, 175], "retriev": 4, "retro": [148, 155], "return": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 127, 137, 138, 139, 140, 141, 145, 146, 147, 150, 151, 152, 153, 154, 155, 157, 158, 160, 163, 165, 166, 167, 170, 173, 175, 177], "return_info": [2, 154], "return_queu": [14, 17, 161, 170, 175, 177], "reveal": 150, "revers": [5, 8, 150, 170, 177], "revert": [154, 155], "revisit": 17, "rew": [9, 14, 17, 19, 166], "reward": [2, 3, 4, 9, 10, 11, 12, 15, 17, 18, 20, 148, 154, 155, 156, 157, 158, 160, 161, 163, 165, 166, 167, 170, 173, 175, 177], "reward_al": 140, "reward_control": [144, 145], "reward_control_weight": [144, 145], "reward_ctrl": [14, 137, 139, 140, 144, 145, 147, 167], "reward_ctrl_weight": 167, "reward_dist": [144, 145, 167], "reward_dist_weight": [144, 145, 167], "reward_forward": [137, 138, 139, 140, 146, 147], "reward_fwd": 146, "reward_linvel": 140, "reward_moving_averag": 175, "reward_near": 144, "reward_near_weight": [144, 145], "reward_over_episod": 177, "reward_quadctrl": 140, "reward_rang": 155, "reward_run": [14, 138], "reward_surv": [137, 139, 140, 142, 143, 147], "reward_threshold": [4, 137, 138, 139, 140, 141, 144, 145, 146, 147, 155, 165], "rewards_df": 173, "rewards_over_se": 177, "rewards_to_plot": 177, "rewardtyp": [3, 155], "rewardwrapp": [14, 15, 19, 169], "rewrit": [154, 155], "rgb": [2, 14, 17, 18, 20, 128, 137, 138, 139, 140, 146, 147, 154], "rgb_arrai": [2, 3, 9, 17, 18, 20, 127, 128, 131, 132, 133, 134, 135, 136, 154, 161, 165, 170, 173], "rgb_array_list": [2, 9, 17, 154], "richard": 173, "right": [2, 7, 127, 128, 129, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 146, 147, 151, 152, 153, 155, 158, 165, 173, 175, 177], "right_back_leg": 137, "right_elbow": [140, 141], "right_foot": [140, 141], "right_hip_i": [140, 141], "right_hip_x": [140, 141], "right_hip_z": [140, 141], "right_kne": [140, 141], "right_lower_arm": [140, 141], "right_shoulder1": [140, 141], "right_shoulder2": [140, 141], "right_sin": [140, 141], "right_thigh": [140, 141], "right_upper_arm": [140, 141], "rigid": [140, 141], "rl": [9, 17, 148, 155, 156, 163, 166, 169, 175, 177], "rlbook2020": [150, 151], "rmsprop": 170, "rnd": [154, 155], "rng": [2, 3, 5, 6, 7, 9, 10, 154, 155, 165, 173], "road": [128, 155], "roadmap": 155, "robertoschiavon": 155, "robo": 148, "robot": [127, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 160, 166], "robust": [148, 170], "rocket": [129, 148], "rodrigo": 155, "rodrigodelazcano": [154, 155], "rogerjl": 155, "rohan138": 154, "roll": [14, 18, 19, 20, 144, 175], "rolling_length": [170, 175], "rollout": 153, "rom": [154, 155], "ronald": 177, "root": [6, 137, 140, 141, 155, 173], "rooti": [138, 139, 147], "rootx": [138, 139, 147], "rootz": [138, 139, 147], "rotat": [129, 131, 137, 140, 141, 144, 154, 175], "rotor": [137, 138, 139, 140, 141, 146, 147], "roughli": 136, "roulett": 154, "roundtrip": 17, "row": [151, 152, 173], "rpg": 152, "rr": 155, "rstride": 175, "rule": [148, 150, 175], "run": [1, 2, 4, 10, 11, 12, 14, 17, 18, 19, 127, 129, 131, 136, 137, 138, 139, 140, 146, 147, 148, 151, 154, 155, 157, 158, 162, 165, 169, 170], "run_": 1, "run_env": 173, "run_gymnasium_env": 165, "runner": 136, "running_g": 177, "runningmeanstd": [14, 18, 19], "runtim": 170, "runtimeerror": [11, 12, 170], "rushivarora": 154, "rusu24edward": [154, 155], "ryanrud": 154, "r\u00e9mi": 146, "s_": 170, "sab": [150, 163, 175], "safe": [129, 148, 155, 157], "safeti": [148, 154, 155], "sai": [17, 154, 167], "said": [3, 140], "same": [4, 6, 8, 9, 10, 11, 12, 13, 14, 16, 18, 20, 127, 128, 131, 136, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 153, 154, 155, 158, 161, 162, 166, 169, 170, 175], "sampl": [2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 17, 18, 19, 129, 144, 153, 154, 155, 156, 157, 158, 160, 161, 162, 163, 165, 169, 170, 173, 175, 177], "sample_act": 177, "sample_mask": 6, "sample_n": 5, "sample_phas": 170, "samplefactori": 155, "sander": 155, "saniti": 173, "sashashura": [154, 155], "satisfi": 2, "satur": 148, "save": [14, 17, 154, 155, 160, 161, 167, 173], "save_logg": 9, "save_video": [9, 154], "save_videos1": 17, "save_videos2": 17, "save_videos3": 17, "save_weight": 170, "savefig": 173, "savefig_fold": 173, "sb3": 154, "scalabl": 148, "scalar": [7, 14, 16, 154], "scale": [14, 17, 18, 19, 129, 137, 138, 139, 140, 141, 142, 143, 146, 147, 148, 157, 162, 170], "scale_and_shift": 14, "scale_ob": 17, "scenario": [128, 148], "scene": [2, 148, 155, 169], "sched": 148, "schedul": [9, 148, 152], "scheme": 128, "schulhoff": 155, "schulman": [137, 170], "scope": [2, 166], "score": [127, 129], "scoreboard": 154, "scratch": [170, 177], "screen": [17, 137, 138, 139, 140, 146, 147, 154], "screen_siz": 17, "script": [1, 148, 161, 175], "seaborn": [173, 175, 177], "search": 155, "second": [1, 6, 9, 10, 17, 129, 132, 142, 145, 146, 148, 155, 161, 162, 169], "section": [16, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 157, 158, 160, 163, 165, 173, 175], "see": [1, 2, 5, 6, 9, 10, 14, 17, 19, 129, 131, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 152, 154, 155, 157, 158, 160, 161, 163, 165, 169, 170, 173], "seed": [2, 3, 4, 5, 6, 7, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 131, 132, 133, 134, 135, 136, 154, 155, 156, 157, 158, 165, 170, 173, 177], "seeded_valu": 155, "seem": 173, "seen": 131, "segment": 146, "select": [5, 7, 9, 136, 155, 157, 163, 169, 170], "select_act": 170, "self": [5, 6, 7, 9, 14, 18, 19, 134, 148, 155, 158, 163, 165, 167, 170, 173, 175, 177], "semant": 154, "send": 14, "sens": [7, 169, 173], "sensor": [128, 169], "sent": 154, "separ": [5, 17, 155, 158, 160, 165, 170], "sepsi": 148, "sequenc": [2, 4, 5, 6, 7, 8, 11, 12, 13, 14, 18, 148, 154, 155, 157], "sequenti": [170, 177], "seri": [6, 157, 161], "serial": [12, 170], "serv": [3, 5, 137, 138, 139, 140, 141, 146, 147, 177], "server": 155, "set": [2, 4, 5, 6, 7, 9, 10, 11, 12, 15, 16, 17, 18, 20, 126, 129, 130, 131, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 152, 154, 155, 158, 160, 163, 165, 166, 167, 169, 170, 173, 175, 177], "set_attr": [11, 12, 155], "set_color": 173, "set_linewidth": 173, "set_mod": 165, "set_np_random_through_se": [2, 10], "set_rotate_label": 175, "set_stat": 154, "set_them": 173, "set_titl": [170, 173, 175], "set_vis": 173, "set_wrapper_attr": [15, 155], "set_xlabel": [170, 175], "set_xtick": 173, "set_xticklabel": 175, "set_ylabel": 175, "set_yticklabel": 175, "set_zlabel": 175, "setup": 154, "seungjaeryanle": 154, "seven": 147, "sever": [2, 4, 7, 10, 17, 129, 154, 155, 157, 158, 160, 161, 173], "sfff": 152, "sfffffff": 152, "shape": [2, 5, 6, 7, 8, 10, 11, 13, 14, 16, 17, 18, 20, 131, 132, 133, 134, 135, 137, 140, 141, 150, 151, 152, 153, 154, 155, 157, 158, 165, 167, 170, 173, 177], "share": [5, 11, 155, 177], "shared_featur": 177, "shared_memori": [11, 13], "shared_net": 177, "shen": 155, "shift": 7, "shimmi": 155, "shin": [138, 140, 141], "shinrl": 148, "shoot": 11, "short": [136, 154, 158, 163], "shorthand": 154, "should": [1, 2, 5, 7, 9, 10, 11, 12, 13, 14, 17, 136, 145, 154, 155, 157, 158, 160, 161, 162, 163, 165, 167, 169, 170, 175], "shoulder": 144, "shouldn": [155, 160, 165], "show": [3, 136, 150, 154, 155, 160, 161, 167, 170, 173, 175, 177], "showcas": [163, 175], "shown": [128, 131, 160, 166], "shrink_act": 14, "siddarth": [155, 177], "side": [13, 129, 173], "sign": [142, 143, 177], "signal": [2, 17, 137, 139, 140, 147, 153, 157, 165, 166, 173], "signatur": 159, "signific": [148, 155, 159, 169], "significantli": [154, 155, 170], "silenc": 9, "silent": 155, "silver": [170, 173], "sim": [148, 173], "similar": [6, 10, 144, 154, 155, 158, 160, 165], "similarli": [129, 155, 157], "simlat": 136, "simonini": 173, "simoninithoma": 173, "simpl": [127, 148, 149, 156, 157, 166, 177], "simplegrid": 148, "simpler": 157, "simplest": 170, "simpli": [14, 16, 17, 18, 19, 158, 165], "simplic": 148, "simplifi": [135, 154, 155, 157], "simplist": [158, 165], "simul": [2, 136, 140, 141, 142, 143, 148, 163, 173, 175, 177], "simultan": 155, "sin": [129, 135, 142, 145], "sinc": [5, 14, 17, 126, 132, 141, 145, 153, 154, 158, 159, 165, 166, 169, 173], "sine": [131, 142, 145], "singl": [2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 18, 20, 139, 147, 148, 154, 155, 157, 158, 162, 165, 175, 177], "single_action_spac": [10, 13, 14, 155, 170], "single_func": 14, "single_observation_spac": [10, 11, 14, 155, 170], "single_rgb_arrai": 154, "sinusoid": [133, 134], "site": 154, "situat": 153, "six": 147, "size": [5, 13, 14, 17, 152, 154, 155, 158, 165, 169], "skeleton": 165, "skip": [9, 17, 18, 20, 154, 155, 165], "skip_and_max_ob": 18, "skip_close_check": 9, "skip_render_check": 9, "sleep": 129, "slide": [138, 139, 142, 143, 144, 145, 146, 147], "slider": [142, 143], "slider1": 146, "slider2": 146, "slight": 127, "slightli": [9, 127, 136, 137, 140, 141, 142, 145], "slime": 148, "slimevolleygym": 148, "slipperi": [151, 152], "slow": [155, 163, 175], "slowdown": 9, "slower": [129, 170], "small": [127, 148, 149, 154, 155, 177], "smallest": 7, "smart": 148, "smoother": [154, 175], "sn": [173, 175, 177], "snake_cas": 1, "snippet": [9, 17], "snow": 152, "so": [1, 9, 11, 14, 17, 18, 20, 129, 130, 137, 138, 139, 140, 141, 144, 145, 146, 147, 151, 152, 154, 155, 157, 163, 165, 166, 167, 169, 170, 173, 175], "soft": 148, "softmax": 170, "softrobot": 148, "softwar": [2, 148, 155], "sokoban": 148, "sole": [154, 159], "solut": [129, 155, 163, 175], "solv": [2, 127, 130, 132, 136, 142, 143, 148, 163, 165, 171, 173, 176, 177], "some": [2, 5, 6, 7, 11, 12, 14, 15, 128, 130, 131, 145, 146, 153, 154, 155, 157, 158, 160, 163, 165, 167, 169, 170, 173, 175], "someth": [154, 155, 157, 169], "sometim": [9, 137, 151, 152, 165, 167], "sonelu": 155, "soon": [154, 160, 165], "sort": 7, "sourc": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 136, 148, 154, 155, 165, 166, 167, 170, 171, 173, 175, 177], "south": 153, "space": [2, 9, 10, 11, 12, 14, 15, 16, 17, 18, 20, 136, 148, 149, 154, 155, 158, 163, 165, 167, 170, 173, 177], "spacecraft": 148, "spaces_kwarg": 6, "spaceship": 157, "span": 17, "spark": 148, "spars": [131, 165], "spawn": [11, 145, 170], "spec": [2, 3, 4, 10, 15, 17, 155], "special": [155, 159, 177], "specialis": 155, "specialist": [159, 162], "specif": [2, 4, 7, 10, 14, 17, 136, 154, 155, 166, 170, 177], "specifi": [2, 5, 6, 9, 14, 16, 17, 18, 20, 126, 127, 129, 130, 135, 137, 138, 139, 140, 141, 143, 146, 147, 149, 152, 153, 154, 155, 157, 158, 160, 161, 165, 167], "speed": [10, 127, 128, 155, 157, 161, 170], "speedup": 170, "spell": 154, "spent": [127, 129], "sphere": 144, "spheric": 145, "sphinx": 1, "spine": 173, "spinningup": 177, "split": [147, 155], "squar": [17, 144, 145, 153, 158, 165, 170, 173], "squeez": 170, "st": 173, "st_all": 173, "stabil": [14, 18, 19, 140, 141, 154, 167, 177], "stabl": [18, 148, 154, 155, 165], "stable_baselines3": 18, "stack": [6, 17, 18, 20, 155, 177], "stack_siz": 18, "stacked_env": 18, "stage": [17, 155], "stai": [158, 170], "stand": [127, 136, 137, 139, 140, 141, 142, 143, 147, 155, 163, 175], "standard": [14, 18, 19, 148, 154, 155, 156, 170, 177], "standard_norm": 154, "standup": 136, "start": [1, 2, 5, 7, 9, 11, 13, 17, 155, 156, 170, 173, 175, 177], "start_epsilon": [163, 175], "state": [2, 3, 4, 17, 18, 20, 130, 136, 148, 149, 154, 155, 157, 158, 160, 163, 166, 169, 170, 173, 175, 177], "state_dict": 170, "state_info": 3, "state_s": 173, "state_valu": [170, 175], "state_value_pr": 170, "statefulli": 3, "stateless": [3, 155], "statement": 155, "statetyp": [3, 155], "stationari": 170, "statist": [9, 14, 17, 18, 19, 129, 155, 161, 177], "stats_kei": [14, 17], "std": [14, 19, 177], "stderr": 154, "steer": 128, "stefanbschneid": 155, "step": [2, 3, 4, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 127, 129, 131, 132, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 151, 152, 153, 154, 155, 156, 157, 161, 162, 163, 166, 167, 170, 173, 175, 177], "step_api_compat": [9, 159], "step_async": 155, "step_index": 9, "step_info": 3, "step_return": 9, "step_starting_index": 9, "step_trigg": [9, 17], "step_wait": 155, "stepapicompat": [155, 159], "steppabl": 148, "steps_df": 173, "stick": [150, 175], "stickyact": [16, 20, 155], "still": [9, 154, 155, 160, 161, 167, 170, 175], "stochast": [130, 131, 133, 134, 136, 151, 153, 154, 173], "stock": 148, "stool": 152, "stop": [17, 150, 155, 157, 166, 175], "stopiter": 13, "storag": 148, "store": [4, 9, 10, 14, 17, 155, 177], "str": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 160, 173, 175, 177], "strang": 155, "strateg": [133, 134], "strategi": [162, 163, 175], "strict": [2, 154, 155], "strictli": 17, "striker": 154, "string": [2, 4, 5, 7, 9, 14, 18, 154, 155, 157, 158, 165], "stringio": 2, "stronger": 169, "structur": [3, 5, 6, 8, 155, 158, 165], "stuck": 160, "stump": 127, "style": [2, 4, 10, 148, 154, 155, 161, 177], "sub": [6, 10, 11, 12, 14, 155, 162], "subclass": [14, 154], "subject": 146, "submit": [148, 154], "subplot": [170, 173, 175], "subprocess": 11, "subse": 154, "subsect": 169, "subsequ": [6, 154], "subset": [7, 155, 167], "subspac": [5, 6, 7, 8, 14, 18, 154], "substanti": [155, 162], "subtract": 170, "succeed": 157, "success": [131, 153, 155, 166], "successfulli": [2, 153], "suffici": [163, 175], "suggest": [14, 19, 154, 170], "suit": 154, "suitabl": [148, 149], "sum": [6, 14, 17, 19, 129, 150, 155, 163, 173, 175, 177], "summari": 160, "summaris": 155, "sumo": 148, "super": [2, 14, 15, 148, 154, 158, 160, 165, 167, 170, 177], "superclass": [5, 16, 19], "supersuit": 155, "supertuxkart": 148, "suppli": 166, "support": [2, 5, 6, 8, 9, 17, 18, 20, 136, 137, 138, 139, 140, 141, 143, 146, 147, 148, 154, 155, 157, 165], "supportsfloat": [2, 7, 15, 19, 160, 167, 177], "suppos": 9, "suptitl": [170, 175], "sure": [1, 2, 9, 17, 158, 165, 167], "surfac": [142, 143, 147, 165], "surfarrai": 165, "surround": [17, 20], "survei": 162, "suspend": 146, "sutton": [2, 131, 132, 142, 143, 150, 151, 154, 163, 173, 175], "sutton_barto_reward": [132, 155], "swig": [126, 154, 155, 157], "swim": 136, "swimmer": [136, 147, 154], "swing": [131, 135], "swingup": 135, "switch": [154, 155], "symbol": 148, "sync": [4, 9, 10, 12, 14, 132, 155], "synchron": [10, 11, 12, 155], "syncvectorenv": [4, 10, 132, 155, 158, 162], "syntax": 4, "synthes": 148, "synthesi": [140, 141], "system": [131, 135, 136, 142, 154, 155, 160, 165], "system_check": 6, "t": [1, 2, 4, 6, 8, 9, 14, 15, 17, 19, 128, 129, 133, 134, 153, 154, 155, 158, 160, 161, 163, 165, 166, 170, 173, 175], "t_cov": [5, 13], "tabl": 154, "tabular": [148, 163], "tag": 148, "tai": 155, "tailkind": 9, "take": [1, 2, 3, 4, 7, 8, 9, 10, 11, 16, 17, 130, 132, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 153, 154, 157, 160, 163, 167, 170, 173, 175, 177], "taken": [10, 17, 18, 20, 130, 132, 157, 161, 162, 165, 170, 172, 177], "talk": [162, 177], "tall": 137, "tan": 155, "tanh": [129, 177], "target": [18, 131, 136, 144, 145, 158, 165, 166, 167, 170], "target_dur": 9, "target_i": 145, "target_posit": 167, "target_x": 145, "task": [2, 6, 15, 128, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 153, 157, 165, 166, 167, 173, 177], "tassa": [139, 140, 141], "tau": 135, "taxi": [149, 154], "taxi_col": 153, "taxi_row": 153, "td": [170, 173], "td3": 172, "td_error": 170, "team": [154, 155], "technic": [16, 155], "techniqu": [17, 20], "techreport": [133, 134], "tell": [163, 166, 175], "templat": [3, 165], "temporal_differ": [163, 175], "temporarili": 13, "ten": [150, 175], "tendon": [140, 141], "tendon_length": [140, 141], "tendon_veloc": [140, 141], "tensor": [14, 17, 20, 170, 177], "tensorboard": 161, "term": [2, 14, 18, 130, 131, 136, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 158, 165, 167, 170], "termin": [2, 3, 9, 10, 11, 12, 14, 17, 18, 19, 20, 131, 132, 133, 134, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 165, 167, 170, 173, 175, 177], "terminal_observ": 154, "terminal_on_life_loss": 17, "terminaltyp": [3, 155], "terminate_when_unhealthi": [137, 139, 140, 147, 169], "terminatedtruncatedsteptyp": 9, "terminationwrapp": 169, "terrain": [127, 148], "terri": 155, "terribli": 154, "test": [9, 17, 127, 148, 154, 155, 165, 177], "tetri": 148, "text": [2, 5, 7, 8, 13, 150, 151, 152, 153, 154, 155, 157], "textencod": 154, "th": [17, 18, 20], "than": [1, 4, 129, 131, 132, 133, 134, 143, 144, 150, 154, 155, 160, 161, 163, 169, 170, 173, 175], "thank": 155, "theden": 154, "thei": [5, 11, 17, 20, 136, 149, 150, 151, 152, 154, 155, 158, 160, 162, 165, 169, 170], "theirfor": 170, "them": [3, 5, 14, 15, 17, 129, 137, 138, 145, 148, 153, 154, 155, 159, 167, 169, 170], "themselv": [148, 161], "theo": 155, "theorem": 148, "theoret": [148, 154, 155], "theori": [135, 142, 154, 157, 170, 173], "therefor": [2, 5, 17, 137, 140, 141, 145, 147, 154, 155, 158, 160, 161], "thesi": [133, 134, 146], "theta": [135, 154], "theta1": 131, "theta2": 131, "theta_dt": 135, "thi": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 166, 167, 169, 170, 172, 173, 175, 177], "thigh": [138, 139, 140, 141, 147], "thigh_joint": [139, 147], "thigh_left_joint": 147, "thing": [13, 15, 154, 158, 167], "think": [13, 148], "third": [137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 157, 165], "third_party_environ": 154, "thoma": 173, "those": [13, 14, 131, 154, 167, 170, 173], "though": [6, 11, 154], "thought": [154, 155], "thousand": [155, 161], "thousandth": 17, "thread": 155, "three": [5, 6, 15, 139, 140, 141, 146, 154, 158, 161, 165], "threshold": [4, 131, 132, 153, 165], "throttl": 129, "through": [2, 3, 4, 5, 10, 11, 12, 14, 17, 18, 19, 20, 140, 141, 148, 154, 155, 156, 158, 159, 160, 161, 162, 166], "throw": [155, 167], "thrower": 154, "thruster": 129, "thu": [11, 137, 140, 141, 142, 170], "thytu": 155, "ti": 173, "tick": 165, "tight": 173, "tight_layout": [170, 173, 175], "tile": [128, 173], "tileset": 152, "till": [170, 175], "till2": 155, "tilleri": 152, "tilt": 129, "tim": 173, "time": [1, 2, 6, 9, 10, 14, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 128, 132, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 151, 153, 154, 155, 158, 160, 161, 162, 163, 165, 168, 169, 170, 171, 172, 175, 177], "time_limit": [152, 153], "time_queu": [14, 17, 161], "timeawareobserv": [10, 18, 20, 155, 157], "timelimit": [2, 4, 15, 17, 20, 127, 128, 129, 131, 132, 133, 134, 135, 155, 157, 158, 165, 166], "times10": [137, 140, 141], "timestamp": 2, "timestep": [2, 17, 18, 20, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 157, 158, 165, 169, 170, 177], "timgates42": 154, "timschneider42": 155, "tip": [138, 142, 146, 147], "tips_arm": 144, "titl": [9, 133, 134, 155, 173, 175, 177], "tmrl": 148, "to_json": [5, 155], "tobirohr": 155, "toctre": 1, "todo": 153, "todorov": [139, 140, 141], "togeth": [5, 12, 136, 173], "toi": [126, 150, 151, 152, 153, 154], "tom": 153, "toml": 165, "ton": 154, "too": [137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 169, 173], "tool": [137, 138, 139, 140, 141, 143, 146, 147], "toolbox": 148, "toolkit": 148, "top": [128, 129, 130, 133, 134, 139, 142, 143, 147, 148, 154, 155, 158, 165, 173, 177], "torch": [2, 14, 17, 155, 170, 177], "torqu": [127, 129, 131, 135, 137, 138, 139, 140, 141, 144, 145, 146, 147], "torso": [137, 138, 139, 140, 141, 147, 169], "toss": 7, "total": [2, 127, 128, 129, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 153, 161, 170, 173, 177], "total_episod": 173, "total_num_episod": 177, "total_reward": 173, "total_timestep": 155, "touch": [14, 158, 165], "touretzki": 131, "toward": [146, 154], "tower": 155, "towers_gymnasium_2023": 155, "town": 155, "toy_text": [152, 154, 163, 173, 175], "tpu": 162, "tqdm": [163, 170, 173, 175], "trace": 127, "traceback": [13, 17], "track": [9, 14, 17, 20, 128, 132, 137, 138, 139, 140, 146, 147, 148, 161], "trackmania": 148, "trade": [148, 155], "tradeoff": 170, "traffic": 148, "train": [137, 140, 141, 142, 145, 148, 154, 155, 157, 160, 168, 172], "training_error": [163, 175], "training_error_moving_averag": 175, "training_period": 161, "trajectori": [14, 18, 19, 129, 140, 141], "transfer": [148, 162], "transform": [3, 5, 8, 9, 14, 15, 16, 18, 19, 20, 145, 157, 167], "transformact": [14, 16, 20], "transformobserv": [14, 18, 20], "transformreward": [14, 19, 20], "transit": [3, 9, 140, 141, 151, 152, 153, 155, 156, 157, 166, 170], "transition_info": 3, "translat": [137, 140, 141, 155, 158, 165], "transport": 148, "transpos": [9, 165], "treasur": 173, "treatment": 148, "tremend": 154, "tri": 155, "trigaten": 154, "trigger": [9, 17, 153], "tristan": 155, "tristandeleu": 154, "trivial": 165, "true": [2, 4, 6, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19, 127, 128, 132, 136, 137, 138, 139, 140, 141, 146, 147, 150, 152, 154, 155, 157, 159, 160, 161, 165, 169, 170, 173, 175, 177], "true_": [16, 18], "trunc": [14, 18], "truncat": [2, 4, 9, 10, 11, 12, 14, 17, 18, 19, 20, 128, 131, 132, 133, 134, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 165, 167, 169, 170, 173, 175, 177], "trunk": 169, "try": [141, 148, 154, 155, 163, 175], "tuckerbmorgan": 154, "tune": [148, 177], "tupl": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 20, 137, 139, 140, 147, 150, 154, 155, 157, 158, 160, 163, 165, 170, 175, 177], "turbul": [129, 154], "turbulence_pow": [129, 170], "turn": [9, 11, 17, 128, 129, 142, 148, 154, 155, 160], "tutori": [2, 15, 155, 158, 160, 161, 163, 167, 169, 170, 173, 175, 177], "tutorials_jupyt": 171, "tutorials_python": 171, "tweak": [136, 146], "two": [2, 5, 6, 7, 9, 10, 17, 18, 20, 127, 129, 131, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 145, 146, 147, 148, 150, 154, 155, 158, 160, 161, 162, 163, 165, 170, 175], "txt": [1, 154], "type": [2, 4, 5, 6, 7, 10, 14, 16, 17, 18, 20, 128, 132, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 158, 160, 161, 165, 167, 173, 177], "typeerror": 155, "typic": [2, 9, 153, 169], "typo": [154, 155], "u": [5, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 158, 163, 165, 167, 175, 177], "uav": 148, "uint8": [18, 128], "un": 132, "unattach": [144, 145], "unbound": [7, 18], "unclear": 155, "uncommon": 155, "undefin": 2, "under": [2, 155, 163, 175, 177], "underli": [2, 15, 17, 131, 155, 157], "underneath": [15, 132, 157], "underpow": 130, "underscor": 154, "understand": [2, 8, 10, 144, 155, 177], "undocu": 155, "uneven": 127, "unexpect": [11, 129, 154], "unflatten": [5, 8, 155], "unhealthi": [137, 139, 140, 142, 143, 147], "unicod": 9, "uniform": [5, 7, 8, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 173], "uniformli": [7, 131, 132, 153, 158, 165], "unintend": 155, "union": [6, 7, 19, 155], "uniqu": [6, 10, 126, 130, 136, 142, 154], "unit": [5, 14, 18, 19, 20, 129, 131, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148], "uniti": 148, "unitless": [142, 145], "unitre": 169, "unitree_go1": 169, "univers": [133, 134, 173], "unknown": [2, 9, 10, 148, 157], "unless": [7, 15, 150, 151, 152, 153, 154, 165], "unlik": [6, 177], "unnecessari": [154, 155], "unnescesari": 154, "unord": 5, "unpickl": [9, 13, 154, 155], "unplay": 9, "unsupport": 7, "untermin": 132, "until": [17, 144, 150, 151, 152, 154, 155, 157, 158, 165, 170, 173, 177], "unus": 154, "unwrap": [2, 10, 15, 131, 155, 157, 158], "up": [2, 6, 7, 10, 127, 129, 136, 137, 139, 140, 141, 147, 150, 151, 152, 153, 155, 157, 158, 161, 163, 165, 170, 173, 175], "updat": [1, 2, 3, 9, 14, 16, 17, 18, 19, 137, 139, 140, 147, 153, 154, 155, 157, 158, 159, 160, 163, 165, 166, 167, 170, 173, 175, 177], "update_paramet": 170, "update_running_mean": [18, 19], "upgrad": [154, 155], "uph": 141, "uph_cost": 141, "uph_cost_weight": 141, "upon": [133, 134], "upper": [7, 14, 19, 20, 140, 141, 154, 157], "upright": [132, 135, 142, 143], "upstream": 155, "urandom": 2, "urdf": 169, "url": [154, 155, 170], "urldat": 155, "us": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 126, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 152, 153, 154, 155, 157, 159, 160, 161, 162, 163, 166, 167, 169, 171, 175, 176], "usabl": [150, 154, 163, 175], "usable_ac": 175, "usag": [9, 155, 158, 163, 167], "use_contact_forc": [137, 155], "use_cuda": 170, "useless": 169, "user": [2, 3, 4, 5, 9, 10, 12, 14, 18, 20, 131, 132, 133, 134, 135, 154, 155, 157, 160, 165, 177], "usual": [2, 6, 7, 17, 139, 158, 165, 167, 169, 170, 175], "util": [18, 148, 154, 155, 157, 159, 165, 170], "utilis": [10, 154, 160], "utli": 155, "v": [4, 133, 170], "v0": [2, 4, 9, 10, 14, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 158, 165, 166, 177], "v1": [2, 4, 9, 10, 11, 12, 14, 16, 17, 18, 19, 127, 128, 129, 131, 132, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 157, 159, 161, 163, 173, 175], "v2": [127, 128, 129, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 167], "v21": [154, 155], "v24": 154, "v25": [154, 160], "v26": [2, 155, 177], "v28": 155, "v2g": 148, "v3": [9, 14, 17, 18, 127, 128, 129, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 156, 157, 160, 170], "v4": [14, 15, 16, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 177], "v5": [17, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 169], "vairodp": 155, "val": 173, "valentin": 155, "valid": [2, 4, 5, 7, 10, 13, 14, 157, 160, 170, 175], "vallei": [133, 134], "valu": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 127, 129, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 155, 157, 158, 160, 163, 165, 166, 169, 170, 173, 175, 177], "value_grid": 175, "value_pr": 170, "valueerror": [7, 8, 11, 12, 13, 17], "vampir": 148, "var": 19, "vari": [132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 177], "variabl": [2, 5, 7, 10, 11, 13, 14, 15, 17, 136, 137, 139, 146, 147, 148, 154, 155, 157, 161, 163, 165, 166, 175, 177], "varianc": [14, 18, 19, 20, 170], "variant": [128, 158, 165], "variat": 136, "varieti": [9, 148], "variou": [5, 141, 148, 154, 177], "ve": [154, 155], "vec_env": 9, "vecenvwrapp": 154, "vector": [2, 4, 7, 9, 11, 12, 16, 17, 18, 19, 127, 129, 142, 143, 145, 148, 154, 155, 158, 161, 168, 171, 177], "vector_entry_point": [4, 132, 155], "vector_env": 14, "vector_envs_tutori": 170, "vector_kwarg": 4, "vector_list_info": 154, "vectoractionwrapp": [14, 155], "vectorenv": [4, 5, 14, 154, 155], "vectorenvcr": 4, "vectorenvwrapp": 155, "vectoris": [5, 148], "vectorization_mod": [4, 9, 10, 11, 12, 14, 132, 155], "vectorizemod": 4, "vectorizetransformact": [14, 20], "vectorizetransformobserv": [14, 20], "vectorizetransformreward": [14, 20], "vectorizor": [4, 132], "vectorlistinfo": [14, 154, 155], "vectorobservationwrapp": [14, 155], "vectorrewardwrapp": [14, 155], "vectorwrapp": [14, 155], "vehicl": 148, "veloc": [2, 8, 13, 127, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148], "velocity_penalti": 142, "veri": [6, 129, 144, 154, 155, 157, 158, 162, 165, 170], "verifi": 9, "vermouth1992": [154, 155], "versa": [13, 17, 18], "version": [1, 2, 4, 5, 14, 16, 17, 18, 19, 148, 154, 155, 158, 163, 165, 166, 170, 175], "vertic": [127, 139, 141, 142, 143, 147, 158, 165], "vf_next_stat": 166, "vf_target": 166, "via": [5, 9, 14, 15, 17, 126, 130, 136, 148, 149, 157, 158, 165, 177], "viabl": [163, 175], "vice": [13, 17, 18], "vicent": 155, "video": [1, 17, 20, 127, 154, 155, 161, 173], "video_fold": [9, 17, 161], "video_length": [9, 17], "videorecord": [17, 154, 155], "view": [2, 155], "view_init": 175, "viewer": [10, 11, 12, 155], "viewport": 129, "viewport_h": 129, "viewport_w": 129, "virgilt": 154, "viridi": 175, "viscou": 146, "visibl": 165, "visit": [128, 173], "visual": 157, "visual_opt": [136, 155], "visualis": 2, "vizdoom": 148, "vmoen": [154, 155], "voidflight": 155, "vol": [131, 153], "volleybal": 148, "volunt": 155, "vscode": 155, "vwxyzjn": 154, "vx": [14, 17, 129], "vy": 129, "w": [9, 137, 140, 141, 170], "w_": [137, 138, 139, 140, 141, 144, 145, 146, 147], "wa": [2, 4, 6, 9, 10, 14, 17, 18, 19, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 159, 160, 163, 166, 167, 170, 177], "wai": [2, 13, 15, 17, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 158, 163, 166, 167, 170, 173, 175], "wait": [9, 10, 11, 154, 166], "wait_on_play": 9, "wake": 129, "walk": [136, 140, 147, 149, 152, 155, 158, 165, 169], "walker": [126, 145, 147, 154], "walker2d": [136, 155], "walker2d_v5": 147, "wall": [133, 134, 153], "wandb": 161, "want": [1, 2, 4, 6, 11, 15, 17, 137, 140, 141, 154, 155, 157, 158, 161, 163, 165, 167, 169, 170, 173, 175], "ward": 177, "warehous": 148, "warn": [6, 9, 11, 17, 144, 154, 155], "wasn": 17, "watch": 1, "watkin": 163, "wawrzy\u0144ski": 138, "we": [1, 2, 3, 4, 5, 7, 9, 10, 13, 18, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 166, 167, 169, 170, 175, 177], "web": 148, "websit": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 136, 154, 155], "weigh": 137, "weight": [137, 138, 139, 140, 141, 144, 145, 146, 147, 148, 154, 166, 167, 177], "well": [2, 5, 14, 131, 148, 154, 165, 170, 173], "were": [126, 131, 136, 154, 155, 160, 165, 169, 170], "west": 153, "what": [2, 5, 14, 19, 148, 151, 155, 162], "wheel": [126, 128, 154], "when": [2, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, 17, 18, 20, 128, 129, 137, 139, 140, 141, 142, 143, 144, 147, 151, 152, 153, 154, 155, 157, 158, 159, 161, 165, 166, 167, 169, 170, 173, 177], "whenev": [17, 129, 160, 165], "where": [1, 4, 6, 7, 8, 9, 14, 17, 128, 129, 131, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 160, 162, 163, 165, 166, 167, 169, 170, 173, 175, 177], "wherea": 173, "whether": [2, 3, 4, 5, 7, 9, 17, 129, 137, 138, 139, 140, 141, 146, 147, 148, 150, 160, 163, 165, 166, 175, 177], "which": [1, 2, 4, 5, 6, 8, 9, 10, 14, 17, 18, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 157, 158, 160, 162, 165, 166, 167, 169, 170, 172, 173, 177], "while": [4, 8, 9, 11, 14, 17, 18, 19, 129, 131, 132, 135, 137, 140, 141, 146, 150, 151, 154, 157, 158, 160, 161, 163, 165, 166, 170, 173, 175, 177], "white": 154, "who": 155, "whole": [4, 154, 155, 169], "whose": [137, 150], "why": 129, "wide": [148, 157], "width": [136, 155, 165], "wikipedia": 173, "willdudlei": 155, "william": [133, 134, 177], "win": [150, 163, 175], "wind": [129, 154, 155], "wind_pow": [129, 154, 170], "window": [2, 9, 128, 136, 154, 155, 165], "window_s": 165, "wing": 148, "wire": 148, "wireless": 148, "wise": 177, "wish": [9, 154, 155, 157, 160, 161], "within": [2, 5, 9, 13, 14, 16, 17, 18, 20, 129, 130, 143, 154, 155, 157, 158, 160], "without": [14, 15, 17, 19, 140, 148, 150, 152, 154, 155, 157, 158, 160, 163, 165, 169, 175], "won": [2, 163, 175], "word": [144, 177], "work": [5, 9, 128, 129, 131, 138, 139, 142, 143, 148, 154, 155], "worker": 11, "workflow": 155, "workshop": 148, "world": [129, 148, 151, 152, 153, 155, 173], "worldbodi": [137, 140, 141], "worri": 165, "wors": 154, "would": [6, 8, 16, 18, 19, 137, 138, 139, 140, 146, 147, 154, 155, 156, 158, 165, 167, 170], "wouldn": 155, "wrap": [2, 4, 9, 11, 12, 14, 15, 16, 17, 18, 19, 20, 148, 157, 165, 167, 170, 177], "wrapped_env": [14, 15, 16, 17, 18, 157, 158, 165, 167, 177], "wrapped_env_ob": 16, "wrapped_obs0": 18, "wrapped_obs1": 18, "wrappedenvacttyp": 155, "wrappedenvobstyp": 155, "wrapper": [2, 4, 6, 7, 9, 10, 13, 137, 138, 139, 140, 141, 143, 146, 147, 148, 152, 153, 154, 155, 156, 157, 159, 161, 163, 168, 170, 171, 175, 177], "wrapper_spec": 15, "wrapperacttyp": [15, 16], "wrapperobstyp": [15, 18], "wrapperspec": [4, 15], "wrist": 144, "write": [5, 11, 13, 155, 165, 169], "write_to_shared_memori": [5, 13], "written": [9, 155, 160, 162], "wrong": [154, 155], "www": [128, 150, 151, 152, 154], "x": [2, 5, 8, 14, 129, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 151, 155, 158, 161, 165, 170, 173, 177], "x_": [137, 138, 139, 140, 142, 146, 147], "x_init": 133, "x_posit": [14, 137, 138, 139, 140, 141, 146, 147], "x_veloc": 14, "xml": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 169], "xml_file": [136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 169], "xpo": [137, 140, 144, 145], "xtick": 175, "xticklabel": 173, "xuanhien070594": 155, "xuehaipan": [154, 155], "xx": 173, "xy": 142, "y": [2, 129, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 153, 158, 173, 177], "y_": 142, "y_coordin": 142, "y_init": 133, "y_posit": [137, 138, 139, 140, 141, 146, 147], "yaniv": 155, "year": [133, 134, 154, 155], "yellow": 153, "yet": [148, 155], "yield": 13, "ylabel": 173, "you": [1, 2, 5, 6, 7, 8, 9, 11, 15, 16, 17, 18, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 128, 137, 140, 141, 148, 154, 155, 156, 157, 158, 161, 163, 165, 166, 167, 169, 170, 172, 173], "youjiacheng": 154, "youni": [154, 155], "younik": [154, 155], "your": [1, 2, 5, 9, 11, 15, 16, 17, 18, 128, 137, 140, 141, 148, 154, 155, 156, 157, 158, 161, 162, 163, 167, 168, 170, 171, 175], "your_env": 9, "yourself": [11, 128, 163, 165, 175], "ytick": 175, "yticklabel": 173, "z": [137, 138, 139, 140, 141, 142, 144, 145, 147], "z_": 141, "z_distance_from_origin": [139, 141, 147], "zaxi": 175, "zemann": [170, 175], "zenodo": 155, "zero": [7, 13, 18, 135, 137, 139, 140, 141, 147, 155, 158, 163, 165, 166, 170, 173, 175, 177], "zero_grad": [170, 177], "zhiqingxiao": 154, "zip": 171, "zlig": 154, "zoom": 9, "zuoxingdong": 154, "zzyunzhi": 154}, "titles": ["404 - Page Not Found", "Gymnasium-docs", "Env", "Functional Env", "Make and register", "Spaces", "Composite Spaces", "Fundamental Spaces", "Spaces Utils", "Utility functions", "Vectorize", "AsyncVectorEnv", "SyncVectorEnv", "Utility functions", "Wrappers", "Wrappers", "Action Wrappers", "Misc Wrappers", "Observation Wrappers", "Reward Wrappers", "List of Wrappers", "Atari", "Adventure", "Air Raid", "Alien", "Amidar", "Assault", "Asterix", "Asteroids", "Atlantis", "Atlantis2", "Backgammon", "Bank Heist", "Basic Math", "Battle Zone", "Beam Rider", "Berzerk", "Blackjack", "Bowling", "Boxing", "Breakout", "Carnival", "Casino", "Centipede", "Chopper Command", "Crazy Climber", "Crossbow", "Darkchambers", "Defender", "Demon Attack", "Donkey Kong", "Double Dunk", "Earthworld", "Elevator Action", "Enduro", "Entombed", "Et", "Fishing Derby", "Flag Capture", "Freeway", "Frogger", "Frostbite", "Galaxian", "Gopher", "Gravitar", "Hangman", "Haunted House", "Hero", "Human Cannonball", "Ice Hockey", "Jamesbond", "Journey Escape", "Kaboom", "Kangaroo", "Keystone Kapers", "King Kong", "Klax", "Koolaid", "Krull", "Kung Fu Master", "Laser Gates", "Lost Luggage", "Mario Bros", "Miniature Golf", "Montezuma Revenge", "Mr Do", "Ms Pacman", "Name This Game", "Othello", "Pacman", "Phoenix", "Pitfall", "Pitfall2", "Pong", "Pooyan", "Private Eye", "Qbert", "Riverraid", "Road Runner", "Robotank", "Seaquest", "Sir Lancelot", "Skiing", "Solaris", "Space Invaders", "Space War", "Star Gunner", "Superman", "Surround", "Tennis", "Tetris", "Tic Tac Toe 3D", "Time Pilot", "Trondead", "Turmoil", "Tutankham", "Up N Down", "Venture", "Video Checkers", "Video Chess", "Video Cube", "Video Pinball", "Wizard Of Wor", "Word Zapper", "Yars Revenge", "Zaxxon", "Box2D", "Bipedal Walker", "Car Racing", "Lunar Lander", "Classic Control", "Acrobot", "Cart Pole", "Mountain Car", "Mountain Car Continuous", "Pendulum", "MuJoCo", "Ant", "Half Cheetah", "Hopper", "Humanoid", "Humanoid Standup", "Inverted Double Pendulum", "Inverted Pendulum", "Pusher", "Reacher", "Swimmer", "Walker2D", "External Environments", "Toy Text", "Blackjack", "Cliff Walking", "Frozen Lake", "Taxi", "Gym Release Notes", "Gymnasium Release Notes", "<no title>", "Basic Usage", "Create a Custom Environment", "Compatibility with Gym", "Migration Guide - v0.21 to v1.0.0", "Recording Agents", "Speeding Up Training", "Training an Agent", "Gymnasium Basics", "Make your own custom environment", "Handling Time Limits", "Implementing Custom Wrappers", "Gymnasium Basics", "Load custom quadruped robot environments", "Training A2C with Vector Envs and Domain Randomization", "Tutorials", "Third-Party Tutorials", "Frozenlake benchmark", "Training Agents", "Solving Blackjack with Q-Learning", "Training Agents", "Training using REINFORCE for Mujoco"], "titleterms": {"0": [154, 155, 160, 169], "0a1": 155, "0a2": 155, "1": [154, 155, 169], "11": 173, "12": 154, "19": 154, "2": [154, 155, 169], "20": 154, "21": [154, 159, 160], "22": 154, "23": 154, "24": 154, "25": 154, "26": [154, 155, 160], "27": 155, "28": 155, "29": 155, "3": [154, 155, 169], "3d": 111, "4": [154, 169, 173], "404": 0, "5": 154, "6": 154, "7": [154, 173], "9": [154, 173], "Not": 0, "Of": 122, "The": [0, 173], "__init__": 158, "a2c": 170, "about": 172, "acrobot": 131, "across": 136, "action": [14, 16, 53, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 163, 175], "actionwrapp": 167, "actor": 170, "ad": 1, "addit": [4, 10, 11, 12], "advantag": 170, "adventur": 22, "agent": [161, 163, 169, 170, 171, 174, 175, 176, 177], "agilerl": 172, "air": 23, "algorithm": 162, "alien": 24, "alpha": 155, "alpha1": 155, "altern": 165, "amidar": 25, "an": [1, 163, 175, 177], "analysi": 170, "ant": 137, "api": [9, 159], "argument": [127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "assault": 26, "asterix": 27, "asteroid": 28, "asynchron": 170, "asyncvectorenv": 11, "atari": 21, "atlanti": 29, "atlantis2": 30, "attack": 49, "attribut": [2, 5, 10, 15], "autonom": 148, "avail": 16, "backend": 136, "backgammon": 31, "bank": 32, "base": [3, 16], "basic": [33, 157, 164, 168, 171], "battl": 34, "beam": 35, "benchmark": [9, 173], "berzerk": 36, "better": 175, "biolog": 148, "biped": 127, "blackjack": [37, 150, 175], "bowl": 38, "box": 39, "box2d": 126, "breakout": 40, "bro": 82, "build": [1, 163, 175, 177], "can": 175, "cannonbal": 68, "captur": 58, "car": [128, 133, 134], "carniv": 41, "cart": 132, "casino": 42, "centiped": 43, "check": 9, "checker": 118, "cheetah": 138, "chess": 119, "chopper": 44, "class": 16, "classic": 130, "cliff": 151, "climber": 45, "close": 165, "code": [157, 160, 166], "command": 44, "common": [14, 17], "compar": 136, "compat": [9, 159], "composit": [5, 6], "comput": 148, "construct": [158, 165], "continu": 134, "control": 130, "convers": [14, 17], "convert": 3, "core": 4, "could": 0, "crazi": 45, "creat": [2, 158, 165, 173], "credit": [127, 128, 129], "critic": 170, "crossbow": 46, "cube": 120, "curv": 177, "custom": [158, 165, 167, 169], "darkchamb": 47, "data": [14, 17], "declar": 165, "defend": 48, "demon": 49, "depend": 173, "derbi": 57, "descript": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "do": [85, 175], "doc": 1, "document": 1, "domain": 170, "donkei": 50, "doubl": [51, 142], "down": 116, "download": 169, "drive": 148, "dunk": 51, "dure": 161, "dynam": [133, 134], "earthworld": 52, "econom": 148, "edit": 1, "electr": 148, "elev": 53, "end": [131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "enduro": 54, "energi": 148, "entomb": 55, "env": [2, 3, 165, 170], "environ": [1, 2, 3, 9, 10, 132, 148, 157, 158, 159, 160, 162, 165, 169, 170, 173, 175], "epilogu": 169, "episod": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161], "escap": 71, "et": 56, "everi": 161, "exact": 136, "exampl": 160, "execut": [163, 175], "explain": 157, "extern": 148, "ey": 95, "financi": 148, "first": 148, "fish": 57, "flag": 58, "found": 0, "freewai": 59, "frogger": 60, "from": [165, 167], "frostbit": 61, "frozen": 152, "frozenlak": 173, "fu": 79, "function": [3, 4, 5, 9, 13, 158], "fundament": [5, 7], "galaxian": 62, "game": [87, 148], "gate": 80, "gener": [160, 165], "golf": 83, "gopher": 63, "gravitar": 64, "guid": 160, "gunner": 106, "gym": [148, 154, 159], "gymnasium": [1, 10, 148, 155, 164, 165, 167, 168, 171, 172], "half": 138, "handl": 166, "hangman": 65, "haunt": 66, "heist": 32, "hero": 67, "heurist": 162, "histori": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "hockei": 69, "hopper": 139, "hous": 66, "human": 68, "humanoid": [140, 141], "ic": 69, "implement": [2, 14, 18, 19, 167], "import": [166, 175], "inform": [150, 151, 152, 153, 157, 161], "inherit": 167, "initi": [155, 157, 165], "instanc": 165, "instruct": 1, "interact": 157, "introduct": 170, "invad": 104, "invert": [142, 143], "jamesbond": 70, "jax": 3, "journei": 71, "kaboom": 72, "kangaroo": 73, "kaper": 74, "keyston": 74, "king": 75, "klax": 76, "kong": [50, 75], "koolaid": 77, "krull": 78, "kung": 79, "lake": 152, "lancelot": 101, "lander": 129, "laser": 80, "learn": [166, 175, 177], "librari": 172, "limit": 166, "list": 20, "ll": 173, "load": [159, 169, 170], "lost": 81, "luggag": 81, "lunar": 129, "m": 86, "make": [4, 10, 158, 165], "map": 173, "mario": 82, "master": 79, "math": 33, "mathemat": 148, "medic": 148, "memori": 13, "method": [2, 5, 10, 11, 12, 15], "migrat": 160, "miniatur": 83, "misc": 17, "miscellan": 13, "model": 169, "modifi": [1, 157], "montezuma": 84, "more": [157, 161], "mountain": [133, 134], "mr": 85, "mujoco": [136, 177], "n": 116, "name": 87, "network": 177, "new": [1, 9], "note": [129, 146, 154, 155], "notic": 170, "number": 160, "observ": [14, 18, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 158, 165, 169, 175], "observationwrapp": 167, "old": 9, "onli": [14, 20], "openai": 159, "optim": 162, "othello": 88, "other": [1, 148], "own": 165, "packag": 165, "pacman": [86, 89], "page": [0, 1], "paramet": [169, 173], "parti": [148, 172], "pendulum": [135, 142, 143], "perform": [9, 136, 170], "phoenix": 90, "pickl": 9, "pilot": 112, "pinbal": 121, "pitfal": 91, "pitfall2": 92, "plai": 170, "plot": [170, 177], "pole": 132, "polici": [163, 175, 177], "pong": 93, "pooyan": 94, "privat": 95, "pusher": 144, "q": [173, 175], "qbert": 96, "quadrup": 169, "race": 128, "raid": 23, "random": [160, 170], "reacher": 145, "recommend": 165, "record": 161, "refer": [128, 131, 150, 151, 153, 170, 173, 177], "regist": [4, 158, 165], "reinforc": 177, "releas": [154, 155], "remov": 160, "render": [9, 136, 160, 165], "reproduc": 136, "request": 0, "reset": [128, 158, 160, 165], "reveng": [84, 124], "reward": [14, 19, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 169], "rewardwrapp": 167, "rider": 35, "riverraid": 97, "road": 98, "robot": [148, 169], "robotank": 99, "run": 173, "runner": 98, "runtim": 9, "save": [9, 170], "seaquest": 100, "seed": [9, 160], "setup": [165, 169, 170, 175], "share": 13, "showcas": 170, "simul": 169, "sir": 101, "size": 173, "ski": 102, "solari": 103, "solut": [165, 166], "solv": 175, "space": [5, 6, 7, 8, 13, 104, 105, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157], "speed": 162, "standard": 3, "standup": 141, "star": 106, "start": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "state": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 165], "step": [1, 9, 158, 159, 160, 165, 169], "subclass": 165, "superman": 107, "surround": 108, "swimmer": 146, "synchron": 170, "syncvectorenv": 12, "system": 148, "tabl": 173, "tac": 111, "taxi": 153, "telecommun": 148, "tenni": 109, "termin": [127, 128, 129, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 166, 169], "tetri": 110, "text": 149, "thi": 87, "think": 175, "third": [148, 172], "tic": 111, "time": [112, 166, 173], "timelimit": 160, "toe": 111, "toi": 149, "train": [136, 161, 162, 163, 169, 170, 171, 174, 175, 176, 177], "transit": [133, 134], "trondead": 113, "truncat": [135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 166], "try": 170, "turmoil": 114, "tutankham": 115, "tutori": [1, 171, 172], "tweak": 169, "uncommon": 17, "up": [116, 162], "us": [148, 158, 165, 170, 172, 173, 177], "usag": 157, "util": [5, 8, 9, 13], "v0": [154, 155, 159, 160], "v1": [155, 160], "variabl": 4, "vector": [5, 10, 13, 14, 20, 132, 162, 170], "vectorenv": 10, "ventur": 117, "version": [127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "video": [9, 118, 119, 120, 121, 148], "visual": [9, 173, 175], "visualis": [163, 175], "walk": 151, "walker": 127, "walker2d": 147, "war": 105, "we": 173, "weight": 170, "wizard": 122, "wor": 122, "word": 123, "wrapper": [14, 15, 16, 17, 18, 19, 20, 158, 160, 165, 167], "write": 1, "yar": 124, "you": 175, "your": [165, 169], "yourself": 170, "zapper": 123, "zaxxon": 125, "zone": 34}}) \ No newline at end of file +Search.setIndex({"alltitles": {"0.12.5": [[154, "release-0-12-5"]], "0.19.0": [[154, "release-0-19-0"]], "0.22.0": [[154, "release-0-22-0"]], "0.23.0": [[154, "release-0-23-0"]], "0.23.1": [[154, "release-0-23-1"]], "0.24.0": [[154, "release-0-24-0"]], "0.24.1": [[154, "release-0-24-1"]], "0.25.0": [[154, "release-0-25-0"]], "0.25.1": [[154, "release-0-25-1"]], "0.25.2": [[154, "release-0-25-2"]], "0.26.0": [[154, "release-0-26-0"]], "0.26.1": [[154, "release-0-26-1"]], "0.26.2": [[154, "release-0-26-2"]], "404 - Page Not Found": [[0, null]], "Acrobot": [[131, null]], "Action Space": [[127, "action-space"], [128, "action-space"], [129, "action-space"], [131, "action-space"], [132, "action-space"], [133, "action-space"], [134, "action-space"], [135, "action-space"], [137, "action-space"], [138, "action-space"], [139, "action-space"], [140, "action-space"], [141, "action-space"], [142, "action-space"], [143, "action-space"], [144, "action-space"], [145, "action-space"], [146, "action-space"], [147, "action-space"], [150, "action-space"], [151, "action-space"], [152, "action-space"], [153, "action-space"]], "Action Wrappers": [[16, null]], "Action and observation spaces": [[157, "action-and-observation-spaces"]], "Adding a new environment": [[1, "adding-a-new-environment"]], "Additional Methods": [[10, "additional-methods"], [11, "additional-methods"], [12, "additional-methods"]], "Additional functions": [[4, "additional-functions"]], "Advantage Actor-Critic (A2C)": [[170, "advantage-actor-critic-a2c"]], "Adventure": [[22, null]], "AgileRL": [[172, "agilerl"]], "Air Raid": [[23, null]], "Algorithmic heuristics": [[162, "algorithmic-heuristics"]], "Alien": [[24, null]], "Alternative solutions": [[165, "alternative-solutions"]], "Amidar": [[25, null]], "Ant": [[137, null]], "Arguments": [[127, "arguments"], [128, "arguments"], [129, "arguments"], [131, "arguments"], [132, "arguments"], [133, "arguments"], [134, "arguments"], [135, "arguments"], [137, "arguments"], [138, "arguments"], [139, "arguments"], [140, "arguments"], [141, "arguments"], [142, "arguments"], [143, "arguments"], [144, "arguments"], [145, "arguments"], [146, "arguments"], [147, "arguments"], [150, "arguments"], [151, "arguments"], [152, "arguments"], [153, "arguments"]], "Assault": [[26, null]], "Asterix": [[27, null]], "Asteroids": [[28, null]], "AsyncVectorEnv": [[11, null]], "Atari": [[21, null]], "Atlantis": [[29, null]], "Atlantis2": [[30, null]], "Attributes": [[2, "attributes"], [5, "attributes"], [10, "attributes"], [15, "attributes"]], "Autonomous Driving environments": [[148, "autonomous-driving-environments"], [148, "id2"]], "Available Action Wrappers": [[16, "available-action-wrappers"]], "Backgammon": [[31, null]], "Bank Heist": [[32, null]], "Base Class": [[16, "base-class"]], "Basic Math": [[33, null]], "Basic Usage": [[157, null]], "Battle Zone": [[34, null]], "Beam Rider": [[35, null]], "Berzerk": [[36, null]], "Biological / Medical environments": [[148, "biological-medical-environments"]], "Bipedal Walker": [[127, null]], "Blackjack": [[37, null], [150, null]], "Bowling": [[38, null]], "Box2D": [[126, null]], "Boxing": [[39, null]], "Breakout": [[40, null]], "Build the Documentation": [[1, "build-the-documentation"]], "Building an agent": [[163, "building-an-agent"], [175, "building-an-agent"], [177, "building-an-agent"]], "Car Racing": [[128, null]], "Carnival": [[41, null]], "Cart Pole": [[132, null]], "Casino": [[42, null]], "Centipede": [[43, null]], "Chopper Command": [[44, null]], "Classic Control": [[130, null]], "Cliff Walking": [[151, null]], "Close": [[165, "close"]], "Common Wrappers": [[17, "common-wrappers"]], "Comparing training performance across versions": [[136, "comparing-training-performance-across-versions"]], "Compatibility with Gym": [[159, null]], "Composite Spaces": [[5, "composite-spaces"], [6, null]], "Constructing Observations": [[158, "constructing-observations"]], "Constructing Observations From Environment States": [[165, "constructing-observations-from-environment-states"]], "Converting Jax-based Functional environments to standard Env": [[3, "converting-jax-based-functional-environments-to-standard-env"]], "Core variables": [[4, "core-variables"]], "Crazy Climber": [[45, null]], "Create a Custom Environment": [[158, null]], "Creating Environment Instances": [[165, "creating-environment-instances"]], "Creating a Package": [[165, "creating-a-package"]], "Creating environments": [[2, "creating-environments"]], "Creating the Q-table": [[173, "creating-the-q-table"]], "Credits": [[127, "credits"], [128, "credits"], [129, "credits"]], "Crossbow": [[46, null]], "Darkchambers": [[47, null]], "Data Conversion Wrappers": [[17, "data-conversion-wrappers"]], "Declaration and Initialization": [[165, "declaration-and-initialization"]], "Defender": [[48, null]], "Demon Attack": [[49, null]], "Dependencies": [[173, "dependencies"]], "Description": [[127, "description"], [128, "description"], [129, "description"], [131, "description"], [132, "description"], [133, "description"], [134, "description"], [135, "description"], [137, "description"], [138, "description"], [139, "description"], [140, "description"], [141, "description"], [142, "description"], [143, "description"], [144, "description"], [145, "description"], [146, "description"], [147, "description"], [150, "description"], [151, "description"], [152, "description"], [153, "description"]], "Domain Randomization": [[170, "domain-randomization"]], "Donkey Kong": [[50, null]], "Double Dunk": [[51, null]], "Earthworld": [[52, null]], "Economic / Financial environments": [[148, "economic-financial-environments"]], "Editing an environment page": [[1, "editing-an-environment-page"]], "Electrical / Energy environments": [[148, "electrical-energy-environments"]], "Elevator Action": [[53, null]], "Enduro": [[54, null]], "Entombed": [[55, null]], "Env": [[2, null]], "Environment Checking": [[9, "environment-checking"]], "Environment Render": [[160, "environment-render"]], "Environment Reset": [[160, "environment-reset"]], "Environment Step": [[160, "environment-step"]], "Environment __init__": [[158, "environment-init"]], "Environment pickling": [[9, "environment-pickling"]], "Epilogue": [[169, "epilogue"]], "Episode End": [[131, "episode-end"], [132, "episode-end"], [133, "episode-end"], [134, "episode-end"], [137, "episode-end"], [138, "episode-end"], [139, "episode-end"], [140, "episode-end"], [141, "episode-end"], [142, "episode-end"], [143, "episode-end"], [144, "episode-end"], [145, "episode-end"], [146, "episode-end"], [147, "episode-end"], [150, "episode-end"], [151, "episode-end"], [152, "episode-end"], [153, "episode-end"]], "Episode Termination": [[127, "episode-termination"], [128, "episode-termination"], [129, "episode-termination"]], "Episode Truncation": [[135, "episode-truncation"]], "Et": [[56, null]], "Exact reproducibility": [[136, "exact-reproducibility"]], "Example code for v0.21": [[160, "example-code-for-v0-21"]], "Example code for v0.26 and later, including v1.0.0": [[160, "example-code-for-v0-26-and-later-including-v1-0-0"]], "Executing an action": [[163, "executing-an-action"], [175, "executing-an-action"]], "Explaining the code": [[157, "explaining-the-code"]], "External Environments": [[148, null]], "First-Party Environments": [[148, "first-party-environments"]], "Fishing Derby": [[57, null]], "Flag Capture": [[58, null]], "Freeway": [[59, null]], "Frogger": [[60, null]], "Frostbite": [[61, null]], "Frozen Lake": [[152, null]], "Frozenlake benchmark": [[173, null]], "Functional Env": [[3, null]], "Fundamental Spaces": [[5, "fundamental-spaces"], [7, null]], "Galaxian": [[62, null]], "Game environments": [[148, "game-environments"]], "Generate your environment": [[165, "generate-your-environment"]], "Gopher": [[63, null]], "Gravitar": [[64, null]], "Gym Release Notes": [[154, null]], "Gym v0.21 Environment Compatibility": [[159, "gym-v0-21-environment-compatibility"]], "Gymnasium Basics": [[164, null], [168, null], [171, "gymnasium-basics"]], "Gymnasium Release Notes": [[155, null]], "Gymnasium-docs": [[1, null]], "Gymnasium.vector.VectorEnv": [[10, "gymnasium-vector-vectorenv"]], "Half Cheetah": [[138, null]], "Handling Time Limits": [[166, null]], "Hangman": [[65, null]], "Haunted House": [[66, null]], "Hero": [[67, null]], "Hopper": [[139, null]], "Human Cannonball": [[68, null]], "Humanoid": [[140, null]], "Humanoid Standup": [[141, null]], "Ice Hockey": [[69, null]], "Implemented Action wrappers": [[14, "implemented-action-wrappers"]], "Implemented Data Conversion wrappers": [[14, "implemented-data-conversion-wrappers"]], "Implemented Observation wrappers": [[14, "implemented-observation-wrappers"]], "Implemented Reward wrappers": [[14, "implemented-reward-wrappers"]], "Implemented Wrappers": [[18, "implemented-wrappers"], [19, "implemented-wrappers"]], "Implementing Custom Wrappers": [[167, null]], "Implementing environments": [[2, "implementing-environments"]], "Importance in learning code": [[166, "importance-in-learning-code"]], "Imports and Environment Setup": [[175, "imports-and-environment-setup"]], "Information": [[150, "information"], [151, "information"], [152, "information"], [153, "information"]], "Inheriting from gymnasium.ActionWrapper": [[167, "inheriting-from-gymnasium-actionwrapper"]], "Inheriting from gymnasium.ObservationWrapper": [[167, "inheriting-from-gymnasium-observationwrapper"]], "Inheriting from gymnasium.RewardWrapper": [[167, "inheriting-from-gymnasium-rewardwrapper"]], "Inheriting from gymnasium.Wrapper": [[167, "inheriting-from-gymnasium-wrapper"]], "Initializing Environments": [[157, "initializing-environments"]], "Instructions for modifying environment pages": [[1, "instructions-for-modifying-environment-pages"]], "Interacting with the Environment": [[157, "interacting-with-the-environment"]], "Introduction": [[170, "introduction"]], "Inverted Double Pendulum": [[142, null]], "Inverted Pendulum": [[143, null]], "Jamesbond": [[70, null]], "Journey Escape": [[71, null]], "Kaboom": [[72, null]], "Kangaroo": [[73, null]], "Keystone Kapers": [[74, null]], "King Kong": [[75, null]], "Klax": [[76, null]], "Koolaid": [[77, null]], "Krull": [[78, null]], "Kung Fu Master": [[79, null]], "Laser Gates": [[80, null]], "List of Wrappers": [[20, null]], "Load custom quadruped robot environments": [[169, null]], "Loading OpenAI Gym environments": [[159, "loading-openai-gym-environments"]], "Lost Luggage": [[81, null]], "Lunar Lander": [[129, null]], "Make and register": [[4, null]], "Make your own custom environment": [[165, null]], "Making Vector Environments": [[10, "making-vector-environments"]], "Map size: 11 \\times 11": [[173, "map-size-11-times-11"]], "Map size: 4 \\times 4": [[173, "map-size-4-times-4"]], "Map size: 7 \\times 7": [[173, "map-size-7-times-7"]], "Map size: 9 \\times 9": [[173, "map-size-9-times-9"]], "Mario Bros": [[82, null]], "Mathematics / Computational": [[148, "mathematics-computational"]], "Methods": [[2, "methods"], [5, "methods"], [10, "methods"], [15, "methods"]], "Migration Guide - v0.21 to v1.0.0": [[160, null]], "Miniature Golf": [[83, null]], "Misc Wrappers": [[17, null]], "Miscellaneous": [[13, "miscellaneous"]], "Modifying the environment": [[157, "modifying-the-environment"]], "Montezuma Revenge": [[84, null]], "More information": [[157, "more-information"], [161, "more-information"]], "Mountain Car": [[133, null]], "Mountain Car Continuous": [[134, null]], "Mr Do": [[85, null]], "Ms Pacman": [[86, null]], "MuJoCo": [[136, null]], "Name This Game": [[87, null]], "Notes": [[129, "notes"], [146, "notes"]], "Notice": [[170, "notice"]], "Observation Space": [[127, "observation-space"], [128, "observation-space"], [129, "observation-space"], [131, "observation-space"], [132, "observation-space"], [133, "observation-space"], [134, "observation-space"], [135, "observation-space"], [137, "observation-space"], [138, "observation-space"], [139, "observation-space"], [140, "observation-space"], [141, "observation-space"], [142, "observation-space"], [143, "observation-space"], [144, "observation-space"], [145, "observation-space"], [146, "observation-space"], [147, "observation-space"], [150, "observation-space"], [151, "observation-space"], [152, "observation-space"], [153, "observation-space"]], "Observation Wrappers": [[18, null]], "Observing the environment": [[175, "observing-the-environment"]], "Old to New Step API Compatibility": [[9, "old-to-new-step-api-compatibility"]], "Optimizing training": [[162, "optimizing-training"]], "Othello": [[88, null]], "Other": [[148, "other"]], "Other environments": [[148, "other-environments"]], "Other steps": [[1, "other-steps"]], "Pacman": [[89, null]], "Parameters we\u2019ll use": [[173, "parameters-we-ll-use"]], "Pendulum": [[135, null]], "Performance Analysis of Synchronous and Asynchronous Vectorized Environments": [[170, "performance-analysis-of-synchronous-and-asynchronous-vectorized-environments"]], "Phoenix": [[90, null]], "Pitfall": [[91, null]], "Pitfall2": [[92, null]], "Plot learning curve": [[177, "plot-learning-curve"]], "Plotting": [[170, "plotting"]], "Policy Network": [[177, "policy-network"]], "Pong": [[93, null]], "Pooyan": [[94, null]], "Private Eye": [[95, null]], "Pusher": [[144, null]], "Qbert": [[96, null]], "Reacher": [[145, null]], "Recommended solution": [[165, "recommended-solution"]], "Recording Agents": [[161, null]], "Recording Every Episode": [[161, "recording-every-episode"]], "Recording the Agent during Training": [[161, "recording-the-agent-during-training"]], "References": [[128, "references"], [131, "references"], [150, "references"], [151, "references"], [153, "references"], [170, "references"], [173, "references"], [177, "references"]], "Registering Envs": [[165, "registering-envs"]], "Registering and making the environment": [[158, "registering-and-making-the-environment"]], "Removed code": [[160, "removed-code"]], "Rendering": [[165, "rendering"]], "Rendering Arguments": [[136, "rendering-arguments"]], "Rendering Backend": [[136, "rendering-backend"]], "Reset": [[165, "reset"]], "Reset Arguments": [[128, "reset-arguments"]], "Reset function": [[158, "reset-function"]], "Reward": [[134, "reward"], [151, "reward"]], "Reward Wrappers": [[19, null]], "Reward:": [[133, "reward"]], "Rewards": [[127, "rewards"], [128, "rewards"], [129, "rewards"], [131, "rewards"], [132, "rewards"], [135, "rewards"], [137, "rewards"], [138, "rewards"], [139, "rewards"], [140, "rewards"], [141, "rewards"], [142, "rewards"], [143, "rewards"], [144, "rewards"], [145, "rewards"], [146, "rewards"], [147, "rewards"], [150, "rewards"], [152, "rewards"], [153, "rewards"]], "Riverraid": [[97, null]], "Road Runner": [[98, null]], "Robotank": [[99, null]], "Robotics environments": [[148, "robotics-environments"], [148, "id1"]], "Running the environment": [[173, "running-the-environment"]], "Runtime Performance benchmark": [[9, "runtime-performance-benchmark"]], "Save Rendering Videos": [[9, "save-rendering-videos"]], "Saving/ Loading Weights": [[170, "saving-loading-weights"]], "Seaquest": [[100, null]], "Seed and random number generator": [[160, "seed-and-random-number-generator"]], "Seeding": [[9, "seeding"]], "Setup": [[165, "setup"], [169, "setup"], [170, "setup"]], "Shared Memory for a Space": [[13, "shared-memory-for-a-space"]], "Showcase the Agent": [[170, "showcase-the-agent"]], "Sir Lancelot": [[101, null]], "Skiing": [[102, null]], "Solaris": [[103, null]], "Solution": [[166, "solution"]], "Solving Blackjack with Q-Learning": [[175, null]], "Space Invaders": [[104, null]], "Space War": [[105, null]], "Spaces": [[5, null]], "Spaces Utils": [[8, null]], "Speeding Up Training": [[162, null]], "Star Gunner": [[106, null]], "Starting State": [[127, "starting-state"], [128, "starting-state"], [129, "starting-state"], [131, "starting-state"], [132, "starting-state"], [133, "starting-state"], [134, "starting-state"], [135, "starting-state"], [137, "starting-state"], [138, "starting-state"], [139, "starting-state"], [140, "starting-state"], [141, "starting-state"], [142, "starting-state"], [143, "starting-state"], [144, "starting-state"], [145, "starting-state"], [146, "starting-state"], [147, "starting-state"], [150, "starting-state"], [151, "starting-state"], [152, "starting-state"], [153, "starting-state"]], "Step": [[165, "step"]], "Step 0.1 - Download a Robot Model": [[169, "step-0-1-download-a-robot-model"]], "Step 1 - Load the model": [[169, "step-1-load-the-model"]], "Step 2 - Tweaking the Environment Parameters": [[169, "step-2-tweaking-the-environment-parameters"]], "Step 2.1 - Tweaking the Environment Simulation Parameters": [[169, "step-2-1-tweaking-the-environment-simulation-parameters"]], "Step 2.2 - Tweaking the Environment Termination Parameters": [[169, "step-2-2-tweaking-the-environment-termination-parameters"]], "Step 2.3 - Tweaking the Environment Reward Parameters": [[169, "step-2-3-tweaking-the-environment-reward-parameters"]], "Step 2.4 - Tweaking the Environment Observation Parameters": [[169, "step-2-4-tweaking-the-environment-observation-parameters"]], "Step 3 - Train your Agent": [[169, "step-3-train-your-agent"]], "Step API Compatibility": [[159, "step-api-compatibility"]], "Step function": [[158, "step-function"]], "Subclassing gymnasium.Env": [[165, "subclassing-gymnasium-env"]], "Superman": [[107, null]], "Surround": [[108, null]], "Swimmer": [[146, null]], "SyncVectorEnv": [[12, null]], "Taxi": [[153, null]], "Telecommunication Systems environments": [[148, "telecommunication-systems-environments"]], "Tennis": [[109, null]], "Termination": [[137, "termination"], [138, "termination"], [139, "termination"], [140, "termination"], [141, "termination"], [142, "termination"], [143, "termination"], [144, "termination"], [145, "termination"], [146, "termination"], [147, "termination"], [166, "termination"]], "Tetris": [[110, null]], "The FrozenLake environment": [[173, "the-frozenlake-environment"]], "The requested page could not be found.": [[0, "the-requested-page-could-not-be-found"]], "Think you can do better?": [[175, "think-you-can-do-better"]], "Third-Party Environments using Gym": [[148, "third-party-environments-using-gym"]], "Third-Party Tutorials": [[172, null]], "Third-Party tutorials about gymnasium": [[172, "third-party-tutorials-about-gymnasium"]], "Third-Party tutorials about libraries that use gymnasium": [[172, "third-party-tutorials-about-libraries-that-use-gymnasium"]], "Third-party environments with Gymnasium": [[148, "third-party-environments-with-gymnasium"]], "Tic Tac Toe 3D": [[111, null]], "Time Pilot": [[112, null]], "TimeLimit Wrapper": [[160, "timelimit-wrapper"]], "Toy Text": [[149, null]], "Training A2C with Vector Envs and Domain Randomization": [[170, null]], "Training Agents": [[171, "training-agents"], [174, null], [176, null]], "Training an Agent": [[163, null]], "Training the A2C Agent": [[170, "training-the-a2c-agent"]], "Training the agent": [[163, "training-the-agent"]], "Training using REINFORCE for Mujoco": [[177, null]], "Transition Dynamics:": [[133, "transition-dynamics"], [134, "transition-dynamics"]], "Trondead": [[113, null]], "Truncation": [[137, "truncation"], [138, "truncation"], [139, "truncation"], [140, "truncation"], [141, "truncation"], [142, "truncation"], [143, "truncation"], [144, "truncation"], [145, "truncation"], [146, "truncation"], [147, "truncation"], [166, "truncation"]], "Try playing the environment yourself": [[170, "try-playing-the-environment-yourself"]], "Turmoil": [[114, null]], "Tutankham": [[115, null]], "Tutorials": [[171, null]], "Uncommon Wrappers": [[17, "uncommon-wrappers"]], "Up N Down": [[116, null]], "Using Vectorized Environments": [[170, "using-vectorized-environments"]], "Using Wrappers": [[158, "using-wrappers"], [165, "using-wrappers"]], "Utility functions": [[5, "utility-functions"], [9, null], [13, null]], "Vector Only wrappers": [[14, "vector-only-wrappers"]], "Vector Utility functions": [[5, "vector-utility-functions"]], "Vector only Wrappers": [[20, "vector-only-wrappers"]], "Vectorize": [[10, null]], "Vectorized Common wrappers": [[14, "vectorized-common-wrappers"]], "Vectorized environment": [[132, "vectorized-environment"]], "Vectorized environments": [[162, "vectorized-environments"]], "Vectorizing Spaces": [[13, "vectorizing-spaces"]], "Venture": [[117, null]], "Version History": [[127, "version-history"], [128, "version-history"], [129, "version-history"], [131, "version-history"], [132, "version-history"], [133, "version-history"], [134, "version-history"], [135, "version-history"], [137, "version-history"], [138, "version-history"], [139, "version-history"], [140, "version-history"], [141, "version-history"], [142, "version-history"], [143, "version-history"], [144, "version-history"], [145, "version-history"], [146, "version-history"], [147, "version-history"], [150, "version-history"], [151, "version-history"], [152, "version-history"], [153, "version-history"]], "Versions": [[136, "versions"]], "Video Checkers": [[118, null]], "Video Chess": [[119, null]], "Video Cube": [[120, null]], "Video Game environments": [[148, "video-game-environments"]], "Video Pinball": [[121, null]], "Visualising the policy": [[163, "visualising-the-policy"], [175, "visualising-the-policy"]], "Visualization": [[9, "visualization"], [173, "visualization"]], "Visualizing the training": [[175, "visualizing-the-training"]], "Walker2D": [[147, null]], "Wizard Of Wor": [[122, null]], "Word Zapper": [[123, null]], "Wrappers": [[14, null], [15, null]], "Writing Tutorials": [[1, "writing-tutorials"]], "Yars Revenge": [[124, null]], "Zaxxon": [[125, null]], "v0.20.0": [[154, "release-v0-20-0"]], "v0.21.0": [[154, "release-v0-21-0"]], "v0.26.0: Initial Release": [[155, "release-v0-26-0"]], "v0.26.1": [[155, "release-v0-26-1"]], "v0.26.2 ": [[155, "release-v0-26-2"]], "v0.26.3": [[155, "release-v0-26-3"]], "v0.27.0": [[155, "release-v0-27-0"]], "v0.27.1": [[155, "release-v0-27-1"]], "v0.28.0": [[155, "release-v0-28-0"]], "v0.28.1": [[155, "release-v0-28-1"]], "v0.29.0": [[155, "release-v0-29-0"]], "v0.29.1 ": [[155, "release-v0-29-1"]], "v0.7.3": [[154, "release-v0-7-3"]], "v0.7.4": [[154, "release-v0-7-4"]], "v0.9.5": [[154, "release-v0-9-5"]], "v0.9.6": [[154, "release-v0-9-6"]], "v1.0.0": [[155, "release-v1-0-0"]], "v1.0.0a1: v1.0.0 alpha1 ": [[155, "release-v1-0-0a1"]], "v1.0.0a2: v1.0.0 alpha 2": [[155, "release-v1-0-0a2"]]}, "docnames": ["404", "README", "api/env", "api/functional", "api/registry", "api/spaces", "api/spaces/composite", "api/spaces/fundamental", "api/spaces/utils", "api/utils", "api/vector", "api/vector/async_vector_env", "api/vector/sync_vector_env", "api/vector/utils", "api/vector/wrappers", "api/wrappers", "api/wrappers/action_wrappers", "api/wrappers/misc_wrappers", "api/wrappers/observation_wrappers", "api/wrappers/reward_wrappers", "api/wrappers/table", "environments/atari", "environments/atari/adventure", "environments/atari/air_raid", "environments/atari/alien", "environments/atari/amidar", "environments/atari/assault", "environments/atari/asterix", "environments/atari/asteroids", "environments/atari/atlantis", "environments/atari/atlantis2", "environments/atari/backgammon", "environments/atari/bank_heist", "environments/atari/basic_math", "environments/atari/battle_zone", "environments/atari/beam_rider", "environments/atari/berzerk", "environments/atari/blackjack", "environments/atari/bowling", "environments/atari/boxing", "environments/atari/breakout", "environments/atari/carnival", "environments/atari/casino", "environments/atari/centipede", "environments/atari/chopper_command", "environments/atari/crazy_climber", "environments/atari/crossbow", "environments/atari/darkchambers", "environments/atari/defender", "environments/atari/demon_attack", "environments/atari/donkey_kong", "environments/atari/double_dunk", "environments/atari/earthworld", "environments/atari/elevator_action", "environments/atari/enduro", "environments/atari/entombed", "environments/atari/et", "environments/atari/fishing_derby", "environments/atari/flag_capture", "environments/atari/freeway", "environments/atari/frogger", "environments/atari/frostbite", "environments/atari/galaxian", "environments/atari/gopher", "environments/atari/gravitar", "environments/atari/hangman", "environments/atari/haunted_house", "environments/atari/hero", "environments/atari/human_cannonball", "environments/atari/ice_hockey", "environments/atari/jamesbond", "environments/atari/journey_escape", "environments/atari/kaboom", "environments/atari/kangaroo", "environments/atari/keystone_kapers", "environments/atari/king_kong", "environments/atari/klax", "environments/atari/koolaid", "environments/atari/krull", "environments/atari/kung_fu_master", "environments/atari/laser_gates", "environments/atari/lost_luggage", "environments/atari/mario_bros", "environments/atari/miniature_golf", "environments/atari/montezuma_revenge", "environments/atari/mr_do", "environments/atari/ms_pacman", "environments/atari/name_this_game", "environments/atari/othello", "environments/atari/pacman", "environments/atari/phoenix", "environments/atari/pitfall", "environments/atari/pitfall2", "environments/atari/pong", "environments/atari/pooyan", "environments/atari/private_eye", "environments/atari/qbert", "environments/atari/riverraid", "environments/atari/road_runner", "environments/atari/robotank", "environments/atari/seaquest", "environments/atari/sir_lancelot", "environments/atari/skiing", "environments/atari/solaris", "environments/atari/space_invaders", "environments/atari/space_war", "environments/atari/star_gunner", "environments/atari/superman", "environments/atari/surround", "environments/atari/tennis", "environments/atari/tetris", "environments/atari/tic_tac_toe_3d", "environments/atari/time_pilot", "environments/atari/trondead", "environments/atari/turmoil", "environments/atari/tutankham", "environments/atari/up_n_down", "environments/atari/venture", "environments/atari/video_checkers", "environments/atari/video_chess", "environments/atari/video_cube", "environments/atari/video_pinball", "environments/atari/wizard_of_wor", "environments/atari/word_zapper", "environments/atari/yars_revenge", "environments/atari/zaxxon", "environments/box2d", "environments/box2d/bipedal_walker", "environments/box2d/car_racing", "environments/box2d/lunar_lander", "environments/classic_control", "environments/classic_control/acrobot", "environments/classic_control/cart_pole", "environments/classic_control/mountain_car", "environments/classic_control/mountain_car_continuous", "environments/classic_control/pendulum", "environments/mujoco", "environments/mujoco/ant", "environments/mujoco/half_cheetah", "environments/mujoco/hopper", "environments/mujoco/humanoid", "environments/mujoco/humanoid_standup", "environments/mujoco/inverted_double_pendulum", "environments/mujoco/inverted_pendulum", "environments/mujoco/pusher", "environments/mujoco/reacher", "environments/mujoco/swimmer", "environments/mujoco/walker2d", "environments/third_party_environments", "environments/toy_text", "environments/toy_text/blackjack", "environments/toy_text/cliff_walking", "environments/toy_text/frozen_lake", "environments/toy_text/taxi", "gym_release_notes/index", "gymnasium_release_notes/index", "index", "introduction/basic_usage", "introduction/create_custom_env", "introduction/gym_compatibility", "introduction/migration_guide", "introduction/record_agent", "introduction/speed_up_env", "introduction/train_agent", "tutorials/gymnasium_basics/README", "tutorials/gymnasium_basics/environment_creation", "tutorials/gymnasium_basics/handling_time_limits", "tutorials/gymnasium_basics/implementing_custom_wrappers", "tutorials/gymnasium_basics/index", "tutorials/gymnasium_basics/load_quadruped_model", "tutorials/gymnasium_basics/vector_envs_tutorial", "tutorials/index", "tutorials/third-party-tutorials", "tutorials/training_agents/FrozenLake_tuto", "tutorials/training_agents/README", "tutorials/training_agents/blackjack_tutorial", "tutorials/training_agents/index", "tutorials/training_agents/reinforce_invpend_gym_v26"], "envversion": {"sphinx": 62, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.viewcode": 1}, "filenames": ["404.md", "README.md", "api/env.md", "api/functional.md", "api/registry.md", "api/spaces.md", "api/spaces/composite.md", "api/spaces/fundamental.md", "api/spaces/utils.md", "api/utils.md", "api/vector.md", "api/vector/async_vector_env.md", "api/vector/sync_vector_env.md", "api/vector/utils.md", "api/vector/wrappers.md", "api/wrappers.md", "api/wrappers/action_wrappers.md", "api/wrappers/misc_wrappers.md", "api/wrappers/observation_wrappers.md", "api/wrappers/reward_wrappers.md", "api/wrappers/table.md", "environments/atari.md", "environments/atari/adventure.md", "environments/atari/air_raid.md", "environments/atari/alien.md", "environments/atari/amidar.md", "environments/atari/assault.md", "environments/atari/asterix.md", "environments/atari/asteroids.md", "environments/atari/atlantis.md", "environments/atari/atlantis2.md", "environments/atari/backgammon.md", "environments/atari/bank_heist.md", "environments/atari/basic_math.md", "environments/atari/battle_zone.md", "environments/atari/beam_rider.md", "environments/atari/berzerk.md", "environments/atari/blackjack.md", "environments/atari/bowling.md", "environments/atari/boxing.md", "environments/atari/breakout.md", "environments/atari/carnival.md", "environments/atari/casino.md", "environments/atari/centipede.md", "environments/atari/chopper_command.md", "environments/atari/crazy_climber.md", "environments/atari/crossbow.md", "environments/atari/darkchambers.md", "environments/atari/defender.md", "environments/atari/demon_attack.md", "environments/atari/donkey_kong.md", "environments/atari/double_dunk.md", "environments/atari/earthworld.md", "environments/atari/elevator_action.md", "environments/atari/enduro.md", "environments/atari/entombed.md", "environments/atari/et.md", "environments/atari/fishing_derby.md", "environments/atari/flag_capture.md", "environments/atari/freeway.md", "environments/atari/frogger.md", "environments/atari/frostbite.md", "environments/atari/galaxian.md", "environments/atari/gopher.md", "environments/atari/gravitar.md", "environments/atari/hangman.md", "environments/atari/haunted_house.md", "environments/atari/hero.md", "environments/atari/human_cannonball.md", "environments/atari/ice_hockey.md", "environments/atari/jamesbond.md", "environments/atari/journey_escape.md", "environments/atari/kaboom.md", "environments/atari/kangaroo.md", "environments/atari/keystone_kapers.md", "environments/atari/king_kong.md", "environments/atari/klax.md", "environments/atari/koolaid.md", "environments/atari/krull.md", "environments/atari/kung_fu_master.md", "environments/atari/laser_gates.md", "environments/atari/lost_luggage.md", "environments/atari/mario_bros.md", "environments/atari/miniature_golf.md", "environments/atari/montezuma_revenge.md", "environments/atari/mr_do.md", "environments/atari/ms_pacman.md", "environments/atari/name_this_game.md", "environments/atari/othello.md", "environments/atari/pacman.md", "environments/atari/phoenix.md", "environments/atari/pitfall.md", "environments/atari/pitfall2.md", "environments/atari/pong.md", "environments/atari/pooyan.md", "environments/atari/private_eye.md", "environments/atari/qbert.md", "environments/atari/riverraid.md", "environments/atari/road_runner.md", "environments/atari/robotank.md", "environments/atari/seaquest.md", "environments/atari/sir_lancelot.md", "environments/atari/skiing.md", "environments/atari/solaris.md", "environments/atari/space_invaders.md", "environments/atari/space_war.md", "environments/atari/star_gunner.md", "environments/atari/superman.md", "environments/atari/surround.md", "environments/atari/tennis.md", "environments/atari/tetris.md", "environments/atari/tic_tac_toe_3d.md", "environments/atari/time_pilot.md", "environments/atari/trondead.md", "environments/atari/turmoil.md", "environments/atari/tutankham.md", "environments/atari/up_n_down.md", "environments/atari/venture.md", "environments/atari/video_checkers.md", "environments/atari/video_chess.md", "environments/atari/video_cube.md", "environments/atari/video_pinball.md", "environments/atari/wizard_of_wor.md", "environments/atari/word_zapper.md", "environments/atari/yars_revenge.md", "environments/atari/zaxxon.md", "environments/box2d.md", "environments/box2d/bipedal_walker.md", "environments/box2d/car_racing.md", "environments/box2d/lunar_lander.md", "environments/classic_control.md", "environments/classic_control/acrobot.md", "environments/classic_control/cart_pole.md", "environments/classic_control/mountain_car.md", "environments/classic_control/mountain_car_continuous.md", "environments/classic_control/pendulum.md", "environments/mujoco.md", "environments/mujoco/ant.md", "environments/mujoco/half_cheetah.md", "environments/mujoco/hopper.md", "environments/mujoco/humanoid.md", "environments/mujoco/humanoid_standup.md", "environments/mujoco/inverted_double_pendulum.md", "environments/mujoco/inverted_pendulum.md", "environments/mujoco/pusher.md", "environments/mujoco/reacher.md", "environments/mujoco/swimmer.md", "environments/mujoco/walker2d.md", "environments/third_party_environments.md", "environments/toy_text.md", "environments/toy_text/blackjack.md", "environments/toy_text/cliff_walking.md", "environments/toy_text/frozen_lake.md", "environments/toy_text/taxi.md", "gym_release_notes/index.md", "gymnasium_release_notes/index.md", "index.md", "introduction/basic_usage.md", "introduction/create_custom_env.md", "introduction/gym_compatibility.md", "introduction/migration_guide.md", "introduction/record_agent.md", "introduction/speed_up_env.md", "introduction/train_agent.md", "tutorials/gymnasium_basics/README.rst", "tutorials/gymnasium_basics/environment_creation.rst", "tutorials/gymnasium_basics/handling_time_limits.rst", "tutorials/gymnasium_basics/implementing_custom_wrappers.rst", "tutorials/gymnasium_basics/index.rst", "tutorials/gymnasium_basics/load_quadruped_model.md", "tutorials/gymnasium_basics/vector_envs_tutorial.rst", "tutorials/index.rst", "tutorials/third-party-tutorials.md", "tutorials/training_agents/FrozenLake_tuto.rst", "tutorials/training_agents/README.rst", "tutorials/training_agents/blackjack_tutorial.rst", "tutorials/training_agents/index.rst", "tutorials/training_agents/reinforce_invpend_gym_v26.rst"], "indexentries": {"action() (gymnasium.actionwrapper method)": [[16, "gymnasium.ActionWrapper.action", false]], "action_space (gymnasium.env attribute)": [[2, "gymnasium.Env.action_space", false]], "action_space (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.action_space", false]], "action_space (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.action_space", false]], "actions() (gymnasium.vector.vectoractionwrapper method)": [[14, "gymnasium.vector.VectorActionWrapper.actions", false]], "actionwrapper (class in gymnasium)": [[16, "gymnasium.ActionWrapper", false]], "addrenderobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.AddRenderObservation", false]], "asyncvectorenv (class in gymnasium.vector)": [[11, "gymnasium.vector.AsyncVectorEnv", false]], "ataripreprocessing (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.AtariPreprocessing", false]], "autoreset (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.Autoreset", false]], "batch_space() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.batch_space", false]], "benchmark_init() (in module gymnasium.utils.performance)": [[9, "gymnasium.utils.performance.benchmark_init", false]], "benchmark_render() (in module gymnasium.utils.performance)": [[9, "gymnasium.utils.performance.benchmark_render", false]], "benchmark_step() (in module gymnasium.utils.performance)": [[9, "gymnasium.utils.performance.benchmark_step", false]], "box (class in gymnasium.spaces)": [[7, "gymnasium.spaces.Box", false]], "call() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.call", false]], "call() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.call", false]], "callback() (gymnasium.utils.play.playplot method)": [[9, "gymnasium.utils.play.PlayPlot.callback", false]], "capped_cubic_video_schedule() (in module gymnasium.utils.save_video)": [[9, "gymnasium.utils.save_video.capped_cubic_video_schedule", false]], "check_env() (in module gymnasium.utils.env_checker)": [[9, "gymnasium.utils.env_checker.check_env", false]], "clear_mpi_env_vars() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.clear_mpi_env_vars", false]], "clipaction (class in gymnasium.wrappers)": [[16, "gymnasium.wrappers.ClipAction", false]], "clipaction (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.ClipAction", false]], "clipreward (class in gymnasium.wrappers)": [[19, "gymnasium.wrappers.ClipReward", false]], "clipreward (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.ClipReward", false]], "close() (gymnasium.env method)": [[2, "gymnasium.Env.close", false]], "close() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.close", false]], "close() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.close", false]], "close() (gymnasium.vector.vectorenv method)": [[10, "gymnasium.vector.VectorEnv.close", false]], "close() (gymnasium.vector.vectorwrapper method)": [[14, "gymnasium.vector.VectorWrapper.close", false]], "close() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.close", false]], "closed (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.closed", false]], "cloudpicklewrapper() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.CloudpickleWrapper", false]], "concatenate() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.concatenate", false]], "contains() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.contains", false]], "convert_to_done_step_api() (in module gymnasium.utils.step_api_compatibility)": [[9, "gymnasium.utils.step_api_compatibility.convert_to_done_step_api", false]], "convert_to_terminated_truncated_step_api() (in module gymnasium.utils.step_api_compatibility)": [[9, "gymnasium.utils.step_api_compatibility.convert_to_terminated_truncated_step_api", false]], "create_empty_array() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.create_empty_array", false]], "create_shared_memory() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.create_shared_memory", false]], "current_namespace (gymnasium.envs.registration attribute)": [[4, "gymnasium.envs.registration.current_namespace", false]], "delayobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.DelayObservation", false]], "dict (class in gymnasium.spaces)": [[6, "gymnasium.spaces.Dict", false]], "dictinfotolist (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.DictInfoToList", false]], "discrete (class in gymnasium.spaces)": [[7, "gymnasium.spaces.Discrete", false]], "dtype (gymnasium.spaces.space property)": [[5, "gymnasium.spaces.Space.dtype", false]], "dtypeobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.DtypeObservation", false]], "dtypeobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.DtypeObservation", false]], "env (class in gymnasium)": [[2, "gymnasium.Env", false]], "env (gymnasium.wrapper attribute)": [[15, "gymnasium.Wrapper.env", false]], "envspec (class in gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.EnvSpec", false]], "ezpickle (class in gymnasium.utils.ezpickle)": [[9, "gymnasium.utils.ezpickle.EzPickle", false]], "filterobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.FilterObservation", false]], "filterobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.FilterObservation", false]], "find_highest_version() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.find_highest_version", false]], "flatdim() (in module gymnasium.spaces.utils)": [[8, "gymnasium.spaces.utils.flatdim", false]], "flatten() (in module gymnasium.spaces.utils)": [[8, "gymnasium.spaces.utils.flatten", false]], "flatten_space() (in module gymnasium.spaces.utils)": [[8, "gymnasium.spaces.utils.flatten_space", false]], "flattenobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.FlattenObservation", false]], "flattenobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.FlattenObservation", false]], "framestackobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.FrameStackObservation", false]], "from_jsonable() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.from_jsonable", false]], "funcenv (class in gymnasium.experimental.functional)": [[3, "gymnasium.experimental.functional.FuncEnv", false]], "functionaljaxenv (class in gymnasium.envs.functional_jax_env)": [[3, "gymnasium.envs.functional_jax_env.FunctionalJaxEnv", false]], "get_attr() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.get_attr", false]], "get_attr() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.get_attr", false]], "get_env_id() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.get_env_id", false]], "get_wrapper_attr() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.get_wrapper_attr", false]], "graph (class in gymnasium.spaces)": [[6, "gymnasium.spaces.Graph", false]], "grayscaleobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.GrayscaleObservation", false]], "grayscaleobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.GrayscaleObservation", false]], "gymnasium.spaces": [[5, "module-gymnasium.spaces", false]], "gymnasium.wrappers": [[15, "module-gymnasium.wrappers", false]], "humanrendering (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.HumanRendering", false]], "initial() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.initial", false]], "is_bounded() (gymnasium.spaces.box method)": [[7, "gymnasium.spaces.Box.is_bounded", false]], "is_np_flattenable (gymnasium.spaces.space property)": [[5, "gymnasium.spaces.Space.is_np_flattenable", false]], "iterate() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.iterate", false]], "jaxtonumpy (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.JaxToNumpy", false]], "jaxtonumpy (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.JaxToNumpy", false]], "jaxtotorch (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.JaxToTorch", false]], "jaxtotorch (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.JaxToTorch", false]], "load_env_creator() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.load_env_creator", false]], "make() (in module gymnasium)": [[4, "gymnasium.make", false]], "make_vec() (in module gymnasium)": [[4, "gymnasium.make_vec", false]], "maxandskipobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.MaxAndSkipObservation", false]], "metadata (gymnasium.env attribute)": [[2, "gymnasium.Env.metadata", false]], "metadata (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.metadata", false]], "metadata (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.metadata", false]], "module": [[5, "module-gymnasium.spaces", false], [15, "module-gymnasium.wrappers", false]], "multibinary (class in gymnasium.spaces)": [[7, "gymnasium.spaces.MultiBinary", false]], "multidiscrete (class in gymnasium.spaces)": [[7, "gymnasium.spaces.MultiDiscrete", false]], "namespace() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.namespace", false]], "normalizeobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.NormalizeObservation", false]], "normalizeobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.NormalizeObservation", false]], "normalizereward (class in gymnasium.wrappers)": [[19, "gymnasium.wrappers.NormalizeReward", false]], "normalizereward (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.NormalizeReward", false]], "np_random (gymnasium.env property)": [[2, "gymnasium.Env.np_random", false]], "np_random (gymnasium.spaces.space property)": [[5, "gymnasium.spaces.Space.np_random", false]], "np_random (gymnasium.vector.asyncvectorenv property)": [[11, "gymnasium.vector.AsyncVectorEnv.np_random", false]], "np_random (gymnasium.vector.syncvectorenv property)": [[12, "gymnasium.vector.SyncVectorEnv.np_random", false]], "np_random (gymnasium.vector.vectorenv property)": [[10, "gymnasium.vector.VectorEnv.np_random", false]], "np_random (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.np_random", false]], "np_random() (in module gymnasium.utils.seeding)": [[9, "gymnasium.utils.seeding.np_random", false]], "np_random_seed (gymnasium.env property)": [[2, "gymnasium.Env.np_random_seed", false]], "np_random_seed (gymnasium.vector.asyncvectorenv property)": [[11, "gymnasium.vector.AsyncVectorEnv.np_random_seed", false]], "np_random_seed (gymnasium.vector.syncvectorenv property)": [[12, "gymnasium.vector.SyncVectorEnv.np_random_seed", false]], "np_random_seed (gymnasium.vector.vectorenv property)": [[10, "gymnasium.vector.VectorEnv.np_random_seed", false]], "np_random_seed (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.np_random_seed", false]], "num_envs (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.num_envs", false]], "numpytotorch (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.NumpyToTorch", false]], "numpytotorch (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.NumpyToTorch", false]], "observation() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.observation", false]], "observation() (gymnasium.observationwrapper method)": [[18, "gymnasium.ObservationWrapper.observation", false]], "observation_space (gymnasium.env attribute)": [[2, "gymnasium.Env.observation_space", false]], "observation_space (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.observation_space", false]], "observation_space (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.observation_space", false]], "observations() (gymnasium.vector.vectorobservationwrapper method)": [[14, "gymnasium.vector.VectorObservationWrapper.observations", false]], "observationwrapper (class in gymnasium)": [[18, "gymnasium.ObservationWrapper", false]], "oneof (class in gymnasium.spaces)": [[6, "gymnasium.spaces.OneOf", false]], "orderenforcing (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.OrderEnforcing", false]], "parse_env_id() (in module gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.parse_env_id", false]], "passiveenvchecker (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.PassiveEnvChecker", false]], "play() (in module gymnasium.utils.play)": [[9, "gymnasium.utils.play.play", false]], "playablegame (class in gymnasium.utils.play)": [[9, "gymnasium.utils.play.PlayableGame", false]], "playplot (class in gymnasium.utils.play)": [[9, "gymnasium.utils.play.PlayPlot", false]], "pprint_registry() (in module gymnasium)": [[4, "gymnasium.pprint_registry", false]], "process_event() (gymnasium.utils.play.playablegame method)": [[9, "gymnasium.utils.play.PlayableGame.process_event", false]], "read_from_shared_memory() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.read_from_shared_memory", false]], "recordepisodestatistics (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.RecordEpisodeStatistics", false]], "recordepisodestatistics (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.RecordEpisodeStatistics", false]], "recordvideo (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.RecordVideo", false]], "register() (in module gymnasium)": [[4, "gymnasium.register", false]], "registry (gymnasium.envs.registration attribute)": [[4, "gymnasium.envs.registration.registry", false]], "render() (gymnasium.env method)": [[2, "gymnasium.Env.render", false]], "render() (gymnasium.envs.functional_jax_env.functionaljaxenv method)": [[3, "gymnasium.envs.functional_jax_env.FunctionalJaxEnv.render", false]], "render() (gymnasium.vector.vectorenv method)": [[10, "gymnasium.vector.VectorEnv.render", false]], "render() (gymnasium.vector.vectorwrapper method)": [[14, "gymnasium.vector.VectorWrapper.render", false]], "render() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.render", false]], "render_close() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.render_close", false]], "render_image() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.render_image", false]], "render_init() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.render_init", false]], "render_mode (gymnasium.env attribute)": [[2, "gymnasium.Env.render_mode", false]], "render_mode (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.render_mode", false]], "rendercollection (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.RenderCollection", false]], "rescaleaction (class in gymnasium.wrappers)": [[16, "gymnasium.wrappers.RescaleAction", false]], "rescaleaction (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.RescaleAction", false]], "rescaleobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.RescaleObservation", false]], "rescaleobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.RescaleObservation", false]], "reset() (gymnasium.env method)": [[2, "gymnasium.Env.reset", false]], "reset() (gymnasium.envs.functional_jax_env.functionaljaxenv method)": [[3, "gymnasium.envs.functional_jax_env.FunctionalJaxEnv.reset", false]], "reset() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.reset", false]], "reset() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.reset", false]], "reset() (gymnasium.vector.vectorenv method)": [[10, "gymnasium.vector.VectorEnv.reset", false]], "reset() (gymnasium.vector.vectorwrapper method)": [[14, "gymnasium.vector.VectorWrapper.reset", false]], "reset() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.reset", false]], "reshapeobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.ReshapeObservation", false]], "reshapeobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.ReshapeObservation", false]], "resizeobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.ResizeObservation", false]], "resizeobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.ResizeObservation", false]], "reward() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.reward", false]], "reward() (gymnasium.rewardwrapper method)": [[19, "gymnasium.RewardWrapper.reward", false]], "rewards() (gymnasium.vector.vectorrewardwrapper method)": [[14, "gymnasium.vector.VectorRewardWrapper.rewards", false]], "rewardwrapper (class in gymnasium)": [[19, "gymnasium.RewardWrapper", false]], "sample() (gymnasium.spaces.box method)": [[7, "gymnasium.spaces.Box.sample", false]], "sample() (gymnasium.spaces.dict method)": [[6, "gymnasium.spaces.Dict.sample", false]], "sample() (gymnasium.spaces.discrete method)": [[7, "gymnasium.spaces.Discrete.sample", false]], "sample() (gymnasium.spaces.graph method)": [[6, "gymnasium.spaces.Graph.sample", false]], "sample() (gymnasium.spaces.multibinary method)": [[7, "gymnasium.spaces.MultiBinary.sample", false]], "sample() (gymnasium.spaces.multidiscrete method)": [[7, "gymnasium.spaces.MultiDiscrete.sample", false]], "sample() (gymnasium.spaces.oneof method)": [[6, "gymnasium.spaces.OneOf.sample", false]], "sample() (gymnasium.spaces.sequence method)": [[6, "gymnasium.spaces.Sequence.sample", false]], "sample() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.sample", false]], "sample() (gymnasium.spaces.text method)": [[7, "gymnasium.spaces.Text.sample", false]], "sample() (gymnasium.spaces.tuple method)": [[6, "gymnasium.spaces.Tuple.sample", false]], "save_video() (in module gymnasium.utils.save_video)": [[9, "gymnasium.utils.save_video.save_video", false]], "seed() (gymnasium.spaces.box method)": [[7, "gymnasium.spaces.Box.seed", false]], "seed() (gymnasium.spaces.dict method)": [[6, "gymnasium.spaces.Dict.seed", false]], "seed() (gymnasium.spaces.discrete method)": [[7, "gymnasium.spaces.Discrete.seed", false]], "seed() (gymnasium.spaces.graph method)": [[6, "gymnasium.spaces.Graph.seed", false]], "seed() (gymnasium.spaces.multibinary method)": [[7, "gymnasium.spaces.MultiBinary.seed", false]], "seed() (gymnasium.spaces.multidiscrete method)": [[7, "gymnasium.spaces.MultiDiscrete.seed", false]], "seed() (gymnasium.spaces.oneof method)": [[6, "gymnasium.spaces.OneOf.seed", false]], "seed() (gymnasium.spaces.sequence method)": [[6, "gymnasium.spaces.Sequence.seed", false]], "seed() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.seed", false]], "seed() (gymnasium.spaces.text method)": [[7, "gymnasium.spaces.Text.seed", false]], "seed() (gymnasium.spaces.tuple method)": [[6, "gymnasium.spaces.Tuple.seed", false]], "sequence (class in gymnasium.spaces)": [[6, "gymnasium.spaces.Sequence", false]], "set_attr() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.set_attr", false]], "set_attr() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.set_attr", false]], "set_wrapper_attr() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.set_wrapper_attr", false]], "shape (gymnasium.spaces.space property)": [[5, "gymnasium.spaces.Space.shape", false]], "single_action_space (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.single_action_space", false]], "single_observation_space (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.single_observation_space", false]], "space (class in gymnasium.spaces)": [[5, "gymnasium.spaces.Space", false]], "spec (gymnasium.env attribute)": [[2, "gymnasium.Env.spec", false]], "spec (gymnasium.vector.vectorenv attribute)": [[10, "gymnasium.vector.VectorEnv.spec", false]], "spec (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.spec", false]], "spec() (in module gymnasium)": [[4, "gymnasium.spec", false]], "state_info() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.state_info", false]], "step() (gymnasium.env method)": [[2, "gymnasium.Env.step", false]], "step() (gymnasium.envs.functional_jax_env.functionaljaxenv method)": [[3, "gymnasium.envs.functional_jax_env.FunctionalJaxEnv.step", false]], "step() (gymnasium.vector.asyncvectorenv method)": [[11, "gymnasium.vector.AsyncVectorEnv.step", false]], "step() (gymnasium.vector.syncvectorenv method)": [[12, "gymnasium.vector.SyncVectorEnv.step", false]], "step() (gymnasium.vector.vectorenv method)": [[10, "gymnasium.vector.VectorEnv.step", false]], "step() (gymnasium.vector.vectorwrapper method)": [[14, "gymnasium.vector.VectorWrapper.step", false]], "step() (gymnasium.wrapper method)": [[15, "gymnasium.Wrapper.step", false]], "step_api_compatibility() (in module gymnasium.utils.step_api_compatibility)": [[9, "gymnasium.utils.step_api_compatibility.step_api_compatibility", false]], "stickyaction (class in gymnasium.wrappers)": [[16, "gymnasium.wrappers.StickyAction", false]], "syncvectorenv (class in gymnasium.vector)": [[12, "gymnasium.vector.SyncVectorEnv", false]], "terminal() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.terminal", false]], "text (class in gymnasium.spaces)": [[7, "gymnasium.spaces.Text", false]], "timeawareobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.TimeAwareObservation", false]], "timelimit (class in gymnasium.wrappers)": [[17, "gymnasium.wrappers.TimeLimit", false]], "to_jsonable() (gymnasium.spaces.space method)": [[5, "gymnasium.spaces.Space.to_jsonable", false]], "transform() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.transform", false]], "transformaction (class in gymnasium.wrappers)": [[16, "gymnasium.wrappers.TransformAction", false]], "transformaction (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.TransformAction", false]], "transformobservation (class in gymnasium.wrappers)": [[18, "gymnasium.wrappers.TransformObservation", false]], "transformobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.TransformObservation", false]], "transformreward (class in gymnasium.wrappers)": [[19, "gymnasium.wrappers.TransformReward", false]], "transformreward (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.TransformReward", false]], "transition() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.transition", false]], "transition_info() (gymnasium.experimental.functional.funcenv method)": [[3, "gymnasium.experimental.functional.FuncEnv.transition_info", false]], "tuple (class in gymnasium.spaces)": [[6, "gymnasium.spaces.Tuple", false]], "unflatten() (in module gymnasium.spaces.utils)": [[8, "gymnasium.spaces.utils.unflatten", false]], "unwrapped (gymnasium.env property)": [[2, "gymnasium.Env.unwrapped", false]], "unwrapped (gymnasium.vector.vectorenv property)": [[10, "gymnasium.vector.VectorEnv.unwrapped", false]], "unwrapped (gymnasium.wrapper property)": [[15, "gymnasium.Wrapper.unwrapped", false]], "vectoractionwrapper (class in gymnasium.vector)": [[14, "gymnasium.vector.VectorActionWrapper", false]], "vectorenv (class in gymnasium.vector)": [[10, "gymnasium.vector.VectorEnv", false]], "vectorizetransformaction (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.VectorizeTransformAction", false]], "vectorizetransformobservation (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.VectorizeTransformObservation", false]], "vectorizetransformreward (class in gymnasium.wrappers.vector)": [[14, "gymnasium.wrappers.vector.VectorizeTransformReward", false]], "vectorobservationwrapper (class in gymnasium.vector)": [[14, "gymnasium.vector.VectorObservationWrapper", false]], "vectorrewardwrapper (class in gymnasium.vector)": [[14, "gymnasium.vector.VectorRewardWrapper", false]], "vectorwrapper (class in gymnasium.vector)": [[14, "gymnasium.vector.VectorWrapper", false]], "wrapper (class in gymnasium)": [[15, "gymnasium.Wrapper", false]], "wrapper_spec() (gymnasium.wrapper class method)": [[15, "gymnasium.Wrapper.wrapper_spec", false]], "wrapperspec (class in gymnasium.envs.registration)": [[4, "gymnasium.envs.registration.WrapperSpec", false]], "write_to_shared_memory() (in module gymnasium.vector.utils)": [[13, "gymnasium.vector.utils.write_to_shared_memory", false]]}, "objects": {"gymnasium": [[16, 0, 1, "", "ActionWrapper"], [2, 0, 1, "", "Env"], [18, 0, 1, "", "ObservationWrapper"], [19, 0, 1, "", "RewardWrapper"], [15, 0, 1, "", "Wrapper"], [4, 4, 1, "", "make"], [4, 4, 1, "", "make_vec"], [4, 4, 1, "", "pprint_registry"], [4, 4, 1, "", "register"], [5, 5, 0, "-", "spaces"], [4, 4, 1, "", "spec"], [15, 5, 0, "-", "wrappers"]], "gymnasium.ActionWrapper": [[16, 1, 1, "", "action"]], "gymnasium.Env": [[2, 2, 1, "", "action_space"], [2, 1, 1, "", "close"], [2, 2, 1, "", "metadata"], [2, 3, 1, "", "np_random"], [2, 3, 1, "", "np_random_seed"], [2, 2, 1, "", "observation_space"], [2, 1, 1, "", "render"], [2, 2, 1, "", "render_mode"], [2, 1, 1, "", "reset"], [2, 2, 1, "", "spec"], [2, 1, 1, "", "step"], [2, 3, 1, "", "unwrapped"]], "gymnasium.ObservationWrapper": [[18, 1, 1, "", "observation"]], "gymnasium.RewardWrapper": [[19, 1, 1, "", "reward"]], "gymnasium.Wrapper": [[15, 3, 1, "", "action_space"], [15, 1, 1, "", "close"], [15, 2, 1, "", "env"], [15, 1, 1, "", "get_wrapper_attr"], [15, 3, 1, "", "metadata"], [15, 3, 1, "", "np_random"], [15, 3, 1, "", "np_random_seed"], [15, 3, 1, "", "observation_space"], [15, 1, 1, "", "render"], [15, 1, 1, "", "reset"], [15, 1, 1, "", "set_wrapper_attr"], [15, 3, 1, "", "spec"], [15, 1, 1, "", "step"], [15, 3, 1, "", "unwrapped"], [15, 1, 1, "", "wrapper_spec"]], "gymnasium.envs.functional_jax_env": [[3, 0, 1, "", "FunctionalJaxEnv"]], "gymnasium.envs.functional_jax_env.FunctionalJaxEnv": [[3, 1, 1, "", "render"], [3, 1, 1, "", "reset"], [3, 1, 1, "", "step"]], "gymnasium.envs.registration": [[4, 0, 1, "", "EnvSpec"], [4, 0, 1, "", "WrapperSpec"], [4, 2, 1, "", "current_namespace"], [4, 4, 1, "", "find_highest_version"], [4, 4, 1, "", "get_env_id"], [4, 4, 1, "", "load_env_creator"], [4, 4, 1, "", "namespace"], [4, 4, 1, "", "parse_env_id"], [4, 2, 1, "", "registry"]], "gymnasium.experimental.functional": [[3, 0, 1, "", "FuncEnv"]], "gymnasium.experimental.functional.FuncEnv": [[3, 1, 1, "", "initial"], [3, 1, 1, "", "observation"], [3, 1, 1, "", "render_close"], [3, 1, 1, "", "render_image"], [3, 1, 1, "", "render_init"], [3, 1, 1, "", "reward"], [3, 1, 1, "", "state_info"], [3, 1, 1, "", "terminal"], [3, 1, 1, "", "transform"], [3, 1, 1, "", "transition"], [3, 1, 1, "", "transition_info"]], "gymnasium.spaces": [[7, 0, 1, "", "Box"], [6, 0, 1, "", "Dict"], [7, 0, 1, "", "Discrete"], [6, 0, 1, "", "Graph"], [7, 0, 1, "", "MultiBinary"], [7, 0, 1, "", "MultiDiscrete"], [6, 0, 1, "", "OneOf"], [6, 0, 1, "", "Sequence"], [5, 0, 1, "", "Space"], [7, 0, 1, "", "Text"], [6, 0, 1, "", "Tuple"]], "gymnasium.spaces.Box": [[7, 1, 1, "", "is_bounded"], [7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.Dict": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.Discrete": [[7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.Graph": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.MultiBinary": [[7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.MultiDiscrete": [[7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.OneOf": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.Sequence": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.Space": [[5, 1, 1, "", "contains"], [5, 3, 1, "", "dtype"], [5, 1, 1, "", "from_jsonable"], [5, 3, 1, "", "is_np_flattenable"], [5, 3, 1, "", "np_random"], [5, 1, 1, "", "sample"], [5, 1, 1, "", "seed"], [5, 3, 1, "", "shape"], [5, 1, 1, "", "to_jsonable"]], "gymnasium.spaces.Text": [[7, 1, 1, "", "sample"], [7, 1, 1, "", "seed"]], "gymnasium.spaces.Tuple": [[6, 1, 1, "", "sample"], [6, 1, 1, "", "seed"]], "gymnasium.spaces.utils": [[8, 4, 1, "", "flatdim"], [8, 4, 1, "", "flatten"], [8, 4, 1, "", "flatten_space"], [8, 4, 1, "", "unflatten"]], "gymnasium.utils.env_checker": [[9, 4, 1, "", "check_env"]], "gymnasium.utils.ezpickle": [[9, 0, 1, "", "EzPickle"]], "gymnasium.utils.performance": [[9, 4, 1, "", "benchmark_init"], [9, 4, 1, "", "benchmark_render"], [9, 4, 1, "", "benchmark_step"]], "gymnasium.utils.play": [[9, 0, 1, "", "PlayPlot"], [9, 0, 1, "", "PlayableGame"], [9, 4, 1, "", "play"]], "gymnasium.utils.play.PlayPlot": [[9, 1, 1, "", "callback"]], "gymnasium.utils.play.PlayableGame": [[9, 1, 1, "", "process_event"]], "gymnasium.utils.save_video": [[9, 4, 1, "", "capped_cubic_video_schedule"], [9, 4, 1, "", "save_video"]], "gymnasium.utils.seeding": [[9, 4, 1, "", "np_random"]], "gymnasium.utils.step_api_compatibility": [[9, 4, 1, "", "convert_to_done_step_api"], [9, 4, 1, "", "convert_to_terminated_truncated_step_api"], [9, 4, 1, "", "step_api_compatibility"]], "gymnasium.vector": [[11, 0, 1, "", "AsyncVectorEnv"], [12, 0, 1, "", "SyncVectorEnv"], [14, 0, 1, "", "VectorActionWrapper"], [10, 0, 1, "", "VectorEnv"], [14, 0, 1, "", "VectorObservationWrapper"], [14, 0, 1, "", "VectorRewardWrapper"], [14, 0, 1, "", "VectorWrapper"]], "gymnasium.vector.AsyncVectorEnv": [[11, 1, 1, "", "call"], [11, 1, 1, "", "close"], [11, 1, 1, "", "get_attr"], [11, 3, 1, "", "np_random"], [11, 3, 1, "", "np_random_seed"], [11, 1, 1, "", "reset"], [11, 1, 1, "", "set_attr"], [11, 1, 1, "", "step"]], "gymnasium.vector.SyncVectorEnv": [[12, 1, 1, "", "call"], [12, 1, 1, "", "close"], [12, 1, 1, "", "get_attr"], [12, 3, 1, "", "np_random"], [12, 3, 1, "", "np_random_seed"], [12, 1, 1, "", "reset"], [12, 1, 1, "", "set_attr"], [12, 1, 1, "", "step"]], "gymnasium.vector.VectorActionWrapper": [[14, 1, 1, "", "actions"]], "gymnasium.vector.VectorEnv": [[10, 2, 1, "", "action_space"], [10, 1, 1, "", "close"], [10, 2, 1, "", "closed"], [10, 2, 1, "", "metadata"], [10, 3, 1, "", "np_random"], [10, 3, 1, "", "np_random_seed"], [10, 2, 1, "", "num_envs"], [10, 2, 1, "", "observation_space"], [10, 1, 1, "", "render"], [10, 2, 1, "", "render_mode"], [10, 1, 1, "", "reset"], [10, 2, 1, "", "single_action_space"], [10, 2, 1, "", "single_observation_space"], [10, 2, 1, "", "spec"], [10, 1, 1, "", "step"], [10, 3, 1, "", "unwrapped"]], "gymnasium.vector.VectorObservationWrapper": [[14, 1, 1, "", "observations"]], "gymnasium.vector.VectorRewardWrapper": [[14, 1, 1, "", "rewards"]], "gymnasium.vector.VectorWrapper": [[14, 1, 1, "", "close"], [14, 1, 1, "", "render"], [14, 1, 1, "", "reset"], [14, 1, 1, "", "step"]], "gymnasium.vector.utils": [[13, 4, 1, "", "CloudpickleWrapper"], [13, 4, 1, "", "batch_space"], [13, 4, 1, "", "clear_mpi_env_vars"], [13, 4, 1, "", "concatenate"], [13, 4, 1, "", "create_empty_array"], [13, 4, 1, "", "create_shared_memory"], [13, 4, 1, "", "iterate"], [13, 4, 1, "", "read_from_shared_memory"], [13, 4, 1, "", "write_to_shared_memory"]], "gymnasium.wrappers": [[18, 0, 1, "", "AddRenderObservation"], [17, 0, 1, "", "AtariPreprocessing"], [17, 0, 1, "", "Autoreset"], [16, 0, 1, "", "ClipAction"], [19, 0, 1, "", "ClipReward"], [18, 0, 1, "", "DelayObservation"], [18, 0, 1, "", "DtypeObservation"], [18, 0, 1, "", "FilterObservation"], [18, 0, 1, "", "FlattenObservation"], [18, 0, 1, "", "FrameStackObservation"], [18, 0, 1, "", "GrayscaleObservation"], [17, 0, 1, "", "HumanRendering"], [17, 0, 1, "", "JaxToNumpy"], [17, 0, 1, "", "JaxToTorch"], [18, 0, 1, "", "MaxAndSkipObservation"], [18, 0, 1, "", "NormalizeObservation"], [19, 0, 1, "", "NormalizeReward"], [17, 0, 1, "", "NumpyToTorch"], [17, 0, 1, "", "OrderEnforcing"], [17, 0, 1, "", "PassiveEnvChecker"], [17, 0, 1, "", "RecordEpisodeStatistics"], [17, 0, 1, "", "RecordVideo"], [17, 0, 1, "", "RenderCollection"], [16, 0, 1, "", "RescaleAction"], [18, 0, 1, "", "RescaleObservation"], [18, 0, 1, "", "ReshapeObservation"], [18, 0, 1, "", "ResizeObservation"], [16, 0, 1, "", "StickyAction"], [18, 0, 1, "", "TimeAwareObservation"], [17, 0, 1, "", "TimeLimit"], [16, 0, 1, "", "TransformAction"], [18, 0, 1, "", "TransformObservation"], [19, 0, 1, "", "TransformReward"]], "gymnasium.wrappers.vector": [[14, 0, 1, "", "ClipAction"], [14, 0, 1, "", "ClipReward"], [14, 0, 1, "", "DictInfoToList"], [14, 0, 1, "", "DtypeObservation"], [14, 0, 1, "", "FilterObservation"], [14, 0, 1, "", "FlattenObservation"], [14, 0, 1, "", "GrayscaleObservation"], [14, 0, 1, "", "JaxToNumpy"], [14, 0, 1, "", "JaxToTorch"], [14, 0, 1, "", "NormalizeObservation"], [14, 0, 1, "", "NormalizeReward"], [14, 0, 1, "", "NumpyToTorch"], [14, 0, 1, "", "RecordEpisodeStatistics"], [14, 0, 1, "", "RescaleAction"], [14, 0, 1, "", "RescaleObservation"], [14, 0, 1, "", "ReshapeObservation"], [14, 0, 1, "", "ResizeObservation"], [14, 0, 1, "", "TransformAction"], [14, 0, 1, "", "TransformObservation"], [14, 0, 1, "", "TransformReward"], [14, 0, 1, "", "VectorizeTransformAction"], [14, 0, 1, "", "VectorizeTransformObservation"], [14, 0, 1, "", "VectorizeTransformReward"]]}, "objnames": {"0": ["py", "class", "Python class"], "1": ["py", "method", "Python method"], "2": ["py", "attribute", "Python attribute"], "3": ["py", "property", "Python property"], "4": ["py", "function", "Python function"], "5": ["py", "module", "Python module"]}, "objtypes": {"0": "py:class", "1": "py:method", "2": "py:attribute", "3": "py:property", "4": "py:function", "5": "py:module"}, "terms": {"": [1, 2, 3, 5, 6, 7, 9, 10, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 161, 163, 165, 166, 167, 170, 173, 175, 177], "0": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 157, 158, 161, 163, 165, 166, 167, 170, 173, 175, 177], "00": [2, 10, 18], "00000000e": [10, 18], "00048403": 14, "000635": [10, 14], "00080468": 14, "0008876301247721108": 19, "001": [133, 135, 140, 163, 170, 175], "00105192": 14, "00127": 14, "00133522": 14, "00142543": 14, "0015": 134, "002": [18, 139, 147, 169], "00209899": 14, "00214607": 11, "0025": [133, 134], "00266526": 14, "003": 140, "00372536": 14, "00395268": 14, "005": [144, 145, 170], "00529102": 14, "00545912": 14, "0059888": 18, "00611216": [10, 18], "00614867": 14, "00615723": 14, "00624371": 14, "00708": 6, "00711833": 6, "00715587": 14, "007812": 14, "008": [139, 147], "00848456": 10, "00942293": 10, "01": [2, 10, 14, 16, 18, 129, 137, 138, 141, 142, 143, 144, 145, 146, 154, 155, 163, 169, 170, 175], "010162116476634746": 19, "01098382": [10, 14], "01162461": 10, "0123456789": 7, "0123456789abcdefghijklmnopqrstuvwxyzabcdefghijklmnopqrstuvwxyz": 7, "01431748": 10, "015": 140, "01522993": 10, "01734283": [10, 16], "01823519": [10, 14, 16, 18], "0197368": [10, 18], "02": [10, 18, 145, 154, 155], "0203607": 16, "0219676": 14, "02418869": 10, "024251968": 14, "02438": 170, "02480598": [10, 14], "02727336": [10, 18], "02728892": 16, "0273956": [10, 18], "02796401": [10, 14, 16, 18], "02852531": [10, 14], "02858594": [10, 14], "02859527": [10, 16], "029028230434438706": 14, "02909703": 10, "03": [14, 129, 154, 155], "03156282": [10, 14, 16, 18], "03203924": [10, 14], "0333221090036294": 14, "03332211": 14, "03359492141887935": 14, "03392126": 10, "03516225": 10, "03517495": [10, 14], "03525399": 16, "03585979": [10, 18], "03625453": [10, 18], "03647037": 132, "03774345": 10, "03812904": 16, "03822722": 10, "04": [14, 16, 146, 154], "0446179": [10, 14, 16, 18], "04562247": 10, "0469136": [10, 14], "0469184": 10, "04731862": 10, "04740972": 10, "04794393": 16, "04799704": 10, "05": [14, 132, 137, 138, 141, 144, 154, 155, 169], "055928": 14, "05592803": 132, "06": 154, "06296527": 14, "06296527291998574": 14, "0631256": 14, "06312564": 132, "0640786": 14, "06540678": 18, "06670535": 16, "07": [133, 134, 154, 155], "07287608": 131, "07422512": 18, "08": [154, 155], "0808398": 11, "08227695": 18, "0892358": [14, 132], "09": [154, 155], "09417735": 13, "09613613": 18, "0_": [137, 138, 142, 144], "0_2": [144, 145], "0_3": 144, "0a2": 132, "0th": 17, "0x7f04efcb8850": 157, "0x7fbb5efd0490": 15, "1": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 158, 160, 163, 165, 166, 167, 170, 173, 175, 177], "10": [6, 7, 14, 17, 18, 127, 129, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 150, 151, 153, 154, 155, 158, 165, 170, 175, 177], "100": [6, 8, 14, 17, 127, 128, 129, 131, 134, 139, 144, 148, 151, 152, 158], "1000": [9, 17, 128, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 156, 169, 170, 177], "1006": 155, "1009": 155, "100_000": [163, 175], "1013": 155, "10172354684460168": 14, "10172355": 14, "1018": 155, "1019": 155, "1027": 155, "1033": 155, "105": [137, 155], "107": 137, "1075": 155, "1086": 155, "1087": 155, "1094": 155, "1095": 155, "10_000": 161, "10x": [163, 175], "11": [14, 137, 138, 139, 140, 141, 144, 147, 150, 151, 154, 170, 175], "110": 155, "1105": 155, "1107": 155, "1109": 155, "11185605": 131, "1119": 155, "112": 155, "1120": 155, "11448676": 14, "117": 155, "12": [8, 16, 17, 18, 129, 131, 132, 137, 138, 139, 140, 141, 144, 147, 151, 155, 170, 175], "120": 170, "12224312": [11, 12], "123": [2, 6, 10, 11, 14, 16, 17, 18, 19, 131, 132, 133, 134, 135, 155, 160, 173], "124": 155, "125": 155, "125637": 6, "12625128": 131, "128": 170, "1280": 136, "13": [137, 138, 140, 141, 144, 147, 153, 154, 155], "130": [140, 141, 155], "132": [151, 155], "136": 155, "138": 155, "139": 155, "14": [137, 138, 140, 141, 144, 147, 154, 155, 175], "141": 155, "1415927": 127, "143": 155, "145": 137, "1455": 17, "146": 155, "14995256": [11, 12], "15": [8, 16, 17, 18, 19, 129, 137, 138, 140, 141, 144, 147, 152, 170, 173], "150": 9, "1500": 154, "1506": 170, "15089367": [10, 16], "1511158": 18, "155": 155, "1598639586606745": 14, "16": [135, 137, 138, 140, 141, 144, 147, 152, 154, 155, 175, 177], "160": 155, "1600": 127, "1613": 153, "164": 155, "16717631": 10, "17": [137, 138, 140, 141, 144, 147, 150, 154], "17032": 155, "171": 155, "1710671": 10, "172": 155, "173": 155, "174": 155, "17752565": 131, "178": 155, "179": 155, "18": [18, 137, 138, 140, 141, 144, 147, 154], "1800": 173, "183": 155, "184": 155, "1851753": 11, "186": 155, "1878752": 12, "18847767": 10, "18879032e": [10, 18], "1887903e": 2, "18933342": 6, "19": [137, 140, 141, 144, 170], "190": 155, "19049619": 6, "1938739": 14, "195": [155, 169], "19559774": 10, "1989": 163, "199": 9, "1990": [133, 134], "1995": 131, "1996": 131, "1_": 145, "1b15": 154, "1d": 177, "1e": [14, 18, 19, 139, 140, 141, 146, 147, 177], "2": [2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 158, 163, 165, 167, 170, 173, 175, 177], "20": [129, 137, 140, 141, 144, 153, 155, 170, 173, 175], "200": [9, 17, 129, 131, 132, 133, 135, 152, 153, 155], "2000": [9, 127, 153, 173], "2004": 177, "200th": 17, "2014": 128, "2016": 170, "2017": 154, "20172954": 18, "2018": [16, 17, 131, 154], "2019": 154, "2020": [148, 150, 151], "2021": [136, 148, 154], "2022": [136, 154, 155], "2023": 155, "2024": 155, "205": 155, "206": 155, "208": 155, "20946532": 14, "2095": 132, "21": [18, 19, 137, 140, 141, 144, 150, 155, 157, 163, 170, 175], "210x180": 17, "215": 155, "216": 155, "21649833": 6, "218": 155, "219": 155, "21944423": 14, "22": [17, 137, 140, 141, 144, 155, 175], "220": 175, "221": 155, "223": 155, "224": 155, "225": 155, "227": 153, "229": 177, "23": [137, 140, 141, 144, 155], "230": 155, "234": 155, "235": 155, "235794": 6, "2359734": 14, "238": 155, "24": [14, 17, 18, 127, 128, 129, 132, 137, 140, 141, 155, 175], "24002443": 10, "2407": 155, "241": 155, "244": 155, "24503504": 14, "2456": 154, "246": 155, "247": 155, "2487226": 10, "249": 155, "25": [2, 8, 10, 17, 19, 137, 139, 140, 141, 147, 153, 155, 169], "250": [155, 161], "252": 155, "2524": 154, "254": 155, "255": [17, 18, 128, 155, 165], "25533703": 18, "256": 177, "257": 155, "25715804": 10, "258": 155, "259": 155, "25g": 137, "26": [2, 17, 137, 140, 141, 157, 159, 166, 177], "26141977": 10, "264": 155, "267": 155, "2671": 154, "2678": 154, "268": 155, "2684": 154, "27": [9, 19, 140, 141], "270": 155, "272": 155, "2736044": 135, "274": [131, 155], "274334": 131, "27648": [14, 18, 157], "277": 155, "278": 155, "27800309628058434": 14, "279": 155, "28": [14, 131, 140, 141], "281": 155, "282": 155, "2831855": 129, "284": 155, "29": [17, 140, 141, 154], "2906": 154, "292": 155, "2d": [136, 145, 148, 158, 167], "2gb": 154, "2nd": 163, "3": [2, 4, 5, 6, 7, 8, 9, 10, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 158, 159, 163, 165, 170, 173, 175, 177], "30": [2, 9, 17, 140, 141, 155], "300": [127, 153, 165, 173], "3000": 9, "3016": 154, "303": 153, "3034542e": 14, "3036": 154, "3037": 154, "304": 155, "3040": 154, "3041": 154, "3044": 154, "3072": 154, "3076": 154, "3080": 154, "3083": 154, "31": [140, 141, 155], "3110827": 10, "3118435e": 14, "3152": 19, "3154": 19, "32": [18, 140, 141, 150, 170, 177], "321": 155, "323": [144, 155], "32351476": 18, "327": [137, 155], "329": 155, "33": [140, 141], "330": 155, "331": 155, "3319138": 10, "332": 155, "33293587": [10, 16], "333": 155, "337": 155, "338": 155, "339": 155, "34": [140, 141], "340": 155, "341": 155, "342": 155, "34641072": 16, "34756234": 16, "348": [140, 141], "35": [140, 141, 155], "350": [140, 141], "355": 155, "36": [6, 140, 141, 151], "37": [140, 141, 155], "373": 155, "38": [140, 141], "39": [140, 141, 155], "3991573": 6, "3d": [136, 137, 140, 141, 148, 158, 175], "3rd": 136, "3v3": 148, "4": [2, 6, 7, 8, 9, 10, 14, 16, 17, 18, 127, 128, 129, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 155, 158, 161, 165, 167, 175, 177], "40": [140, 141], "400": [18, 153], "403": 155, "404": [153, 155], "405": 155, "407": 155, "41": [140, 141, 155], "4123625": 135, "418": [132, 155], "41887903": [10, 18, 132], "4189": 6, "42": [6, 7, 10, 11, 12, 13, 16, 18, 140, 141, 155, 156, 170], "424": 155, "42683297": 14, "42884544": 14, "429": 155, "43": [140, 141], "433": 155, "435": 155, "4351738": 14, "4354835e": 14, "43887845": 13, "44": [6, 140, 141], "440": 155, "444": 155, "44504836": 14, "44799727": 14, "4488689e": 14, "45": [134, 144], "450": 155, "451": 155, "459": 155, "4624777": 14, "46352962": [14, 133], "46515748": 14, "46543318": 14, "46546045": 14, "46553135": 14, "47": 151, "47037": 6, "47377947": 14, "474": 155, "47599354": 14, "48": 151, "480": 136, "4808415e": 16, "482": 155, "48468155": 14, "48657528": 14, "487": 155, "491": 155, "495": 155, "497": 155, "498371": 14, "4x12": 151, "4x4": 152, "4\u03c0": 131, "5": [6, 7, 8, 9, 10, 13, 14, 16, 17, 18, 19, 127, 128, 129, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 153, 158, 163, 165, 170, 173, 175, 177], "50": [17, 129, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 177], "500": [17, 18, 131, 132, 153, 154, 170, 175], "503": 155, "51": [6, 155], "510": 155, "512": [155, 165], "515": 155, "516": 155, "518": 155, "520": 155, "522": 155, "524": 155, "525": 155, "5281": 155, "529": 155, "533": 155, "5342437e": 14, "535": 155, "536": 155, "538": 155, "54": 6, "540": 155, "5420062": 16, "548": 155, "55": [6, 155], "553": 155, "554": 155, "557": 155, "558": 155, "561": 155, "5635296": 134, "564": 155, "566371": 131, "567": 131, "5676788": 18, "569": 155, "57": 170, "572": 155, "573": 155, "576": 155, "5760367": [11, 12], "58": 155, "580": 155, "581": 155, "583": 155, "5836242": 6, "588": 155, "589": 155, "590": 155, "594091": 6, "5e": [137, 139, 140, 141, 147, 169], "5e3": 177, "5gb": 154, "5x": 155, "5x5": 153, "6": [8, 14, 16, 17, 18, 129, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 151, 153, 155, 177], "60": [8, 155], "600": [18, 170], "604": 155, "608": 155, "61": 155, "6102389": 12, "612": 155, "6120394": 18, "616": 155, "6193494": 11, "62": [6, 11, 12], "62259156": 14, "63": [6, 7], "63214064": 6, "634": 155, "6343064e": 14, "6344974": 16, "635": 155, "6382770e": 16, "639": 153, "6397772e": 16, "64": [6, 155], "6608303": 16, "666": 129, "669": 6, "6822636": 6, "683": 155, "68512": 6, "695": 155, "697368": 13, "7": [9, 129, 134, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 155, 170], "70": 155, "708": 155, "709": 155, "714599": 11, "7183299": 18, "72": 155, "720": 136, "7257502": 6, "729": 9, "73": 155, "731": 155, "732": 128, "74": 155, "748": 155, "75": [16, 137, 169], "75cm": 137, "7619033": 14, "762": 155, "7695615": 12, "77": 6, "77395606": 13, "774": 155, "78": [137, 140, 141, 155], "7851154": 11, "788": 155, "789": 155, "791": 155, "79221743": 12, "8": [6, 9, 10, 14, 18, 19, 128, 129, 131, 132, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 152, 154, 170, 173, 177], "80": 152, "800": [155, 173], "8000": 1, "80000019": [10, 18], "80000019e": [10, 18], "8000002e": 2, "801": 155, "81": [11, 12, 135, 155], "810": 155, "811": 155, "812": 155, "8127025": 155, "8127026": 155, "817": 155, "8174238": [11, 12], "81882": 6, "827": 155, "83": 155, "830": 155, "83506": 6, "838": 155, "84": [6, 17], "8498053": 12, "84x84": 17, "85859793": 13, "867": 155, "886": 155, "889": 155, "88g": 137, "89": [6, 155], "8914354": 18, "89235795": 135, "893": 155, "89345848": 14, "8934584807363618": 14, "8971417e": 14, "898": 155, "8f1d43620bc6bb580df6e80b0dc05c48": 131, "8gb": 154, "8x8": 152, "9": [11, 12, 14, 16, 131, 135, 137, 138, 139, 140, 141, 142, 144, 145, 147, 150, 155, 170, 175], "90": 175, "9053838": 6, "9061728": 14, "90849805": 14, "910": 137, "91101986": 135, "91244936": [11, 12], "915": 155, "91g": 137, "9216": 14, "924": 155, "926": [128, 155], "928024": 6, "9296501": 14, "932": 155, "934": 155, "9375233e": 14, "9380709": 16, "941": 155, "9416149": 14, "9428282": 14, "9429494": 14, "95": [128, 163, 170, 173, 175], "9503881": 14, "950912": 6, "952": 155, "9532328e": 14, "956": 155, "957": 155, "958": 155, "959": 155, "96": [14, 18, 128, 157], "9635296": 14, "96495728": 11, "965": 155, "96562607": 12, "96x96": 128, "97562236": 13, "977": 155, "978": 155, "97854": 6, "979": 155, "9808417e": 16, "982": 155, "98219293": 12, "98270553": 11, "9841162": 131, "9886932": [11, 12], "9898634e": 14, "99": [14, 19, 170, 177], "9944268": 18, "997341": 131, "999": [134, 170], "99902063": 12, "9999": 129, "9\u03c0": 131, "A": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 14, 16, 17, 18, 19, 20, 127, 128, 131, 132, 134, 136, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 150, 151, 152, 154, 155, 158, 159, 165, 166, 167, 170, 175, 177], "And": [165, 169], "As": [2, 5, 10, 15, 17, 131, 132, 136, 149, 151, 153, 154, 155, 157, 158, 162, 167, 170, 173, 177], "At": [14, 17, 157], "Be": 155, "But": 137, "By": [2, 4, 17, 18, 131, 137, 138, 139, 140, 141, 146, 147, 148], "For": [2, 3, 5, 7, 8, 9, 10, 17, 18, 19, 128, 129, 136, 151, 152, 153, 154, 155, 157, 158, 159, 160, 161, 162, 163, 165, 167, 169, 170, 175], "If": [1, 2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 128, 129, 132, 134, 137, 139, 140, 147, 148, 150, 151, 152, 154, 155, 156, 157, 159, 163, 165, 167, 169, 170, 172, 173, 175], "In": [2, 7, 9, 13, 14, 15, 16, 19, 128, 129, 130, 131, 132, 137, 138, 139, 140, 141, 145, 146, 147, 153, 154, 155, 157, 158, 160, 161, 162, 165, 166, 167, 169, 170, 173, 175, 177], "It": [2, 4, 6, 7, 9, 10, 11, 12, 17, 127, 136, 137, 140, 141, 144, 145, 146, 148, 154, 155, 158, 162, 163, 165, 175, 177], "Its": 148, "No": [16, 17, 18, 150, 154, 177], "Not": [17, 154], "On": [131, 132, 133, 134, 135, 154, 155, 158, 173], "One": [7, 137, 140, 141, 155, 157, 158], "Or": 153, "Such": [15, 167], "That": [3, 132], "The": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 165, 166, 169, 170, 175, 177], "Then": [1, 15, 157, 165, 167], "There": [7, 15, 127, 129, 130, 133, 134, 136, 140, 142, 148, 151, 153, 154, 155, 157, 158, 165], "These": [9, 126, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 160, 163, 175, 177], "To": [1, 2, 4, 9, 10, 13, 15, 17, 18, 127, 128, 129, 131, 132, 144, 153, 154, 155, 157, 158, 159, 160, 163, 165, 167, 169, 170, 173, 175, 177], "Will": [9, 17, 20, 155], "With": [6, 14, 17, 148, 151, 155, 170, 173, 175], "_": [9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 128, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 156, 163, 167, 170, 173], "_2": [137, 138, 139, 140, 141, 144, 145, 146, 147], "__all__": 155, "__class__": 170, "__del__": 155, "__eq__": 154, "__future__": [170, 175, 177], "__get_attr__": 155, "__getattr__": 155, "__getitem__": 154, "__init__": [2, 9, 14, 15, 16, 18, 155, 163, 165, 167, 170, 173, 175, 177], "__len__": 154, "__main__": [167, 170], "__name__": [167, 170], "__setitem__": 154, "_action_to_direct": [158, 165], "_agent_loc": [158, 165], "_async_work": 11, "_bigint_from_byt": 154, "_build": 1, "_control": 141, "_cost": [140, 141], "_episod": [14, 17], "_final_info": 14, "_final_observ": 14, "_get_info": [158, 165], "_get_ob": [158, 165], "_int_list_from_bigint": 154, "_k": 14, "_modul": 18, "_nois": [137, 138, 139, 140, 141, 142, 143, 146, 147], "_np_random": [2, 5, 10, 154], "_np_random_se": [2, 10], "_rang": [140, 141], "_render_fram": 165, "_replac": 173, "_reward_ctrl": 14, "_reward_run": 14, "_scale": [137, 138, 139, 140, 141, 142, 143, 146, 147], "_script": 1, "_skip": 169, "_static": [1, 173], "_step_mujoco_simul": 155, "_target_loc": [158, 165], "_timelimit": 154, "_update_running_mean": [14, 19], "_worker": 11, "_x_posit": 14, "_x_veloc": 14, "a2c": [155, 168, 171], "a3c": 170, "a_": [166, 170], "a_0": 6, "a_i": 6, "a_max": 170, "a_min": 170, "a_n": 6, "a_t": 166, "aaronwalsman": 155, "ab": [128, 155, 170], "abbeel": [137, 170], "abdomen": [140, 141], "abdomen_i": [140, 141], "abdomen_x": [140, 141], "abdomen_z": [140, 141], "abil": [155, 157, 162], "abl": [155, 166], "about": [2, 3, 9, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 155, 157, 158, 162, 163, 165, 166, 175, 177], "abov": [2, 6, 7, 9, 131, 132, 135, 136, 155, 157, 159, 160, 161, 170, 177], "absolut": [141, 143, 147, 169], "abstract": [155, 158, 165], "ac": [150, 175], "academ": 162, "academia": 155, "acceler": [128, 133, 134, 135, 148, 154, 155, 170], "accent_r": 175, "accept": [2, 5, 7, 9, 17, 154, 155, 158, 165, 169], "access": [14, 15, 17, 148, 154, 155, 160, 167], "accid": 154, "accident": 154, "accord": [4, 6, 7, 129, 170], "accordingli": 165, "account": [148, 154, 155, 173], "accumul": 161, "accur": [136, 155], "accuraci": 136, "achiev": [2, 131, 150, 154, 161, 162, 170], "acquir": 136, "acrobot": [130, 154], "acrobotenv": 131, "acronym": 177, "across": 13, "act": [14, 155, 161, 162, 163, 167, 175], "action": [2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 15, 17, 20, 130, 136, 148, 149, 154, 155, 156, 158, 160, 161, 165, 167, 169, 170, 173, 177], "action_log_prob": 170, "action_logit": 170, "action_logits_vec": 170, "action_mask": 153, "action_mean": 177, "action_pd": 170, "action_s": 173, "action_shap": 170, "action_spac": [2, 5, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 153, 155, 156, 157, 158, 160, 161, 163, 165, 167, 173, 175, 177], "action_space_dim": 177, "action_stddev": 177, "action_valu": 175, "actionwrapp": [14, 15, 16], "actor": 177, "actor_lay": 170, "actor_loss": 170, "actor_losses_moving_averag": 170, "actor_lr": 170, "actor_optim": 170, "actor_weight": 170, "actor_weights_path": 170, "acttyp": [2, 3, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 155], "actual": [137, 153, 155, 158, 161, 165, 170, 173], "actuat": [131, 132, 140, 141, 155, 169], "acycl": 148, "ad": [10, 14, 16, 17, 18, 19, 129, 132, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 157, 158, 160, 165], "adamw": 177, "adapt": [132, 142, 143, 148, 151], "add": [1, 4, 5, 9, 11, 16, 17, 18, 19, 20, 128, 136, 137, 140, 141, 151, 153, 154, 155, 157, 158, 165, 167, 170, 175], "add_subplot": 175, "addit": [2, 5, 129, 130, 150, 153, 154, 155, 157, 160, 163, 165, 169, 170, 175, 177], "addition": [148, 154, 155, 158, 159], "additional_wrapp": [4, 155], "addrenderobserv": [18, 20, 155], "address": [155, 160], "adilzouitin": 154, "adjac": 6, "adjust": 169, "advanc": [11, 131, 155], "advantag": [137, 155, 165], "advis": 14, "affect": 155, "affin": [14, 16, 18, 20, 129, 157], "after": [2, 3, 5, 9, 10, 17, 129, 137, 138, 139, 140, 141, 144, 145, 146, 147, 150, 153, 154, 155, 157, 158, 163, 165, 166, 170, 175, 177], "again": [2, 129, 158, 165], "against": [148, 154], "agen": 158, "agent": [2, 4, 5, 14, 17, 20, 127, 128, 129, 133, 137, 138, 139, 140, 141, 142, 143, 145, 146, 147, 148, 154, 155, 157, 158, 160, 162, 165, 166, 167, 173], "agent_posit": 167, "agnost": [137, 138, 139, 140, 141, 146, 147], "ago": 155, "ahead": 154, "ahmedo42": 154, "ai": 148, "aim": [136, 139, 146, 147, 154, 155, 177], "air": [144, 155], "al": [4, 14, 16, 17, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 154, 155, 163, 175], "ale_pi": [17, 155], "alexdluken": 155, "algorithm": [2, 5, 148, 149, 153, 154, 155, 163, 166, 169, 170, 173, 175, 177], "alias": 148, "align": 155, "aliv": [140, 147], "alive_bonu": 142, "all": [2, 4, 5, 6, 7, 10, 11, 12, 13, 14, 15, 16, 17, 18, 126, 128, 129, 130, 131, 132, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 154, 155, 157, 158, 167, 169, 170, 171, 173, 175, 177], "all_act": 173, "all_stat": 173, "allow": [3, 4, 5, 6, 9, 11, 12, 14, 15, 17, 18, 19, 20, 131, 132, 133, 134, 135, 140, 141, 142, 143, 147, 148, 154, 155, 157, 158, 159, 160, 165, 167, 170, 177], "almost": 154, "along": [2, 9, 10, 15, 18, 128, 132, 133, 134, 142, 143, 146, 151, 154, 155, 157, 159], "alongsid": 157, "alphabet": 7, "alphanumer": 7, "alreadi": [2, 10, 15, 155, 157], "alreadypendingcallerror": 11, "also": [2, 7, 9, 10, 11, 12, 17, 18, 127, 128, 129, 130, 136, 137, 143, 148, 154, 155, 157, 158, 163, 165, 166, 167, 169, 170, 173, 175, 177], "alter": [2, 15, 157, 165], "altern": [1, 148, 155, 158], "although": [7, 137, 169], "altogeth": 169, "alwai": [7, 8, 17, 129, 133, 134, 137, 140, 141, 142, 145, 146, 150, 151, 152, 153, 155, 165], "among": [16, 130, 136], "amount": [9, 127, 132, 142, 143, 177], "amp": 155, "amtamasi": 154, "an": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 165, 166, 167, 169, 170, 172, 173], "analog": 2, "analogi": 144, "analysi": 148, "analyt": 142, "anderson": [132, 142, 143], "andrea": [155, 169, 173], "andrew": [133, 134, 155, 173], "andrewtanj": 154, "angl": [127, 129, 131, 132, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154], "angle_1": 137, "angle_2": 137, "angle_3": 137, "angle_4": 137, "angular": [127, 129, 131, 132, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "ani": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 129, 132, 137, 139, 140, 142, 143, 146, 147, 148, 152, 154, 155, 157, 158, 160, 163, 165, 166, 169, 173, 175, 177], "anim": [9, 136], "ankle_1": 137, "ankle_2": 137, "ankle_3": 137, "ankle_4": 137, "annot": [154, 155, 170, 173, 175, 177], "annot_kw": 173, "annoy": 155, "anoth": [14, 15, 137, 140, 141, 143, 147, 153, 154, 163, 166, 175, 177], "ansi": [2, 154, 155, 165], "ansi_list": 2, "answer": [148, 165], "ant": [136, 154, 155, 169], "anymor": [14, 18, 19, 154, 155], "anyth": [148, 169, 173], "anytrad": 148, "anywher": 155, "apach": 148, "apart": 165, "api": [2, 3, 17, 20, 148, 154, 155, 156, 157, 160, 165, 166, 169, 170], "apicompat": 155, "appear": [133, 134], "append": [14, 19, 155, 163, 170, 173, 175, 177], "appli": [2, 4, 6, 8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 127, 129, 130, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 154, 155, 157, 158, 161, 163, 165, 167, 175, 177], "applic": [5, 7, 146, 148, 157], "apply_along_axi": 175, "apply_api_compat": [4, 154, 155, 159], "apply_env_compat": 155, "appreci": 154, "approach": [19, 142, 154, 155, 158, 165, 175, 177], "appropri": [155, 158, 165, 169, 177], "approxim": [14, 19, 148], "ar": [2, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 157, 158, 159, 160, 161, 162, 163, 165, 166, 167, 169, 170, 172, 173, 175, 177], "araffin": 154, "arang": [170, 173, 175], "arbitrari": [2, 4, 7, 9, 148, 170], "arbitrarili": 155, "arcad": [17, 148], "archiv": 154, "archiveprefix": 155, "area": 136, "aren": 154, "arg": [9, 11, 12, 163, 170, 175, 177], "argmax": [153, 163, 173, 175], "argument": [4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 126, 130, 149, 154, 155, 157, 158, 159, 165, 169, 175], "ariel": 155, "arjun": [154, 155], "arm": [136, 140, 141, 144, 145, 148], "around": [3, 14, 126, 148, 154], "arr": 175, "arrai": [2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 20, 129, 131, 132, 133, 134, 135, 154, 155, 158, 165, 167, 170, 175, 177], "arrang": 4, "array_equ": [158, 165], "arraytyp": [10, 11, 12, 14], "arrow": [7, 173], "articl": 173, "articul": [146, 148], "artifici": 153, "arxiv": [155, 170], "as_cmap": 173, "ascii": 154, "aspect": [154, 157, 166], "assert": [155, 165], "assess": 2, "asset": 152, "assign": [2, 132, 133, 134, 144, 145, 148, 155, 173, 175], "assist": 148, "associ": [154, 155, 160], "assum": [7, 14, 18, 136, 155, 165], "async": [4, 11, 155], "asynchron": [5, 10, 11, 12, 155], "asyncvectorenv": [10, 155, 162, 170], "atari": [9, 17, 20, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 154, 155, 157, 163, 175], "atari_wrapp": 18, "ataripreprocess": [17, 20, 155], "attach": [129, 132, 135, 137, 142, 143, 147], "attempt": [8, 129, 136, 153, 170], "attitud": 148, "attr": 6, "attribut": [4, 6, 11, 12, 17, 18, 19, 155, 157, 160, 165, 167], "attributeerror": 155, "augment": [18, 20, 157], "author": [133, 134, 154, 155, 169, 170, 173, 175, 177], "auto": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 155], "autobuild": 1, "autom": [148, 154], "automat": [1, 2, 4, 10, 11, 12, 17, 18, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 154, 155, 159, 160, 165, 170], "automata": [148, 155], "autoreset": [4, 10, 17, 20, 155], "autoresetwrapp": [17, 154, 155], "autorom": [154, 155], "aux_1": 137, "aux_2": 137, "aux_3": 137, "aux_4": 137, "auxiliari": [2, 158, 165], "avail": [4, 129, 148, 150, 151, 154, 155, 158, 161, 165], "averag": [9, 14, 19, 20, 166, 173, 175, 177], "avg_reward": 177, "avoid": [10, 13, 15, 151, 153, 157, 169], "awai": [137, 138, 139, 140, 144, 145, 146, 147, 167, 173], "awak": 129, "awar": [155, 160], "award": 132, "ax": [5, 7, 165, 170, 173, 175], "ax1": 175, "ax2": 175, "axb2035": 155, "axi": [17, 18, 133, 134, 137, 140, 141, 146, 173, 175], "b": [1, 7, 144, 145, 153, 154], "b2dtut": 128, "b5": 7, "back": [11, 15, 18, 20, 126, 136, 137, 138, 155, 169], "back_leg": 137, "backend": [154, 155, 165], "background": 128, "backlog": 155, "backport": 154, "backward": [138, 154, 155, 159, 170, 177], "bad": 13, "badia": 170, "balanc": [132, 142, 143, 170, 177], "bali": 155, "balisujohn": [154, 155], "bandit": 148, "bar": [9, 170], "bare": 15, "barto": [2, 131, 132, 142, 143, 150, 151, 154, 163, 173, 175], "base": [2, 4, 5, 6, 9, 10, 13, 14, 15, 17, 18, 19, 20, 126, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 153, 154, 155, 157, 163, 165, 167, 169, 170, 175, 177], "base_env": 15, "baselin": 148, "baselines3": 18, "basemujocoenv": 155, "bash": 154, "basic": [155, 158, 163, 169], "basilisk": 148, "batch": [5, 10, 11, 12, 13, 14, 154, 170], "batch_spac": [5, 13, 155], "bbox_inch": 173, "bbox_to_anchor": 175, "beat": [148, 150], "beatabl": [163, 175], "becam": 154, "becaus": [136, 158, 160, 163, 165, 167, 169, 170, 175], "becom": [126, 154, 155, 166, 170], "been": [2, 9, 10, 129, 137, 140, 141, 148, 154, 155, 157, 158, 160, 165, 166, 170, 173], "befor": [2, 4, 9, 10, 14, 15, 16, 17, 18, 19, 20, 128, 137, 138, 139, 140, 144, 145, 146, 147, 154, 155, 158, 163, 165, 167, 169, 170, 177], "began": 155, "begin": [14, 17, 154, 158, 160, 165, 170, 175], "behav": 154, "behavior": [14, 15, 135, 136, 137, 138, 139, 140, 141, 146, 147, 154, 155, 158, 165, 167, 169], "behaviour": [129, 136, 155, 157, 161], "behind": [2, 137, 140, 155], "being": [3, 4, 8, 17, 129, 133, 134, 135, 137, 140, 141, 144, 145, 154, 155, 158, 163, 169, 173, 175], "believ": 155, "belong": 6, "below": [7, 10, 20, 135, 147, 155, 157, 161, 166, 170, 175], "benchmark": [126, 148, 171, 176], "benchmark_init": 9, "benchmark_rend": 9, "benchmark_step": 9, "benefit": [17, 155], "bennet": 148, "besid": 136, "best": [163, 170, 173, 175], "better": [127, 154, 155, 170], "between": [3, 6, 7, 14, 17, 18, 19, 20, 129, 131, 132, 135, 136, 137, 138, 139, 140, 141, 142, 143, 145, 146, 147, 148, 154, 155, 158, 163, 165, 166, 170, 173, 175, 177], "beyond": 155, "bfoot": 138, "bia": [137, 138, 139, 140, 141, 146, 147, 170], "bias": 170, "big": [165, 173], "bigger": 173, "biggest": 173, "binari": [5, 7, 148, 157, 165], "bind": [126, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154], "biomechan": 136, "bipe": 136, "biped": [126, 140, 141, 147, 154, 169], "bipedal_walk": 127, "bipedalwalk": [127, 154], "biplan": 148, "bird": 148, "bit": 154, "bitcraz": 148, "bkrl": 154, "black": [131, 154, 155, 173, 175], "blackjack": [149, 154, 155, 163, 171, 176], "blackjack_tutori": 175, "blackjackag": [163, 175], "blit": 165, "blob": [151, 154], "block": 155, "blog": [154, 155, 160], "blue": [131, 153, 165, 173], "board": 148, "board_gam": 154, "bodi": [129, 136, 137, 138, 139, 140, 141, 142, 144, 146, 147, 169], "boilerpl": [15, 157, 158, 165, 167], "bonu": [147, 170], "book": [131, 150, 151, 154, 163, 175], "book_or_nip": 131, "bool": [2, 4, 5, 6, 7, 9, 10, 11, 12, 14, 15, 17, 18, 132, 136, 137, 138, 139, 140, 141, 146, 147, 160, 163, 165, 173, 175, 177], "bool8": 155, "bool_": 155, "boolean": [2, 5, 9, 14, 17, 129, 155, 160, 163, 175, 177], "booster": 129, "bootstrap": [2, 160, 166, 170], "both": [7, 10, 11, 12, 18, 127, 130, 131, 147, 148, 150, 151, 152, 153, 154, 155, 157, 158, 159, 162, 165, 166, 173, 177], "bother": 165, "bottom": [128, 133, 134, 139, 151, 155, 173], "bound": [2, 7, 14, 16, 17, 18, 19, 20, 129, 131, 132, 133, 134, 135, 154, 155, 157, 158, 160, 166], "boundari": 8, "bounded": 5, "box": [2, 5, 6, 7, 8, 10, 11, 13, 14, 15, 16, 17, 18, 20, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 157, 158, 165, 167], "box2d": [127, 128, 129, 154, 155, 157], "brake": 128, "brax": [148, 155], "break": [128, 154, 155, 160, 173], "bridg": 148, "briefli": 160, "bring": 155, "britojr": 155, "brittl": 177, "broken": [17, 154], "brown": 155, "browser": 1, "bshin": 138, "bsk": 148, "bsuit": 148, "bthigh": 138, "btjanaka": 155, "buffalo": 148, "buffer": [10, 14, 17, 128, 161], "buffer_length": [14, 17, 161], "bug": [17, 129, 136, 137, 139, 140, 142, 143, 144, 145, 147, 152, 154, 155], "build": [5, 126, 137, 138, 139, 140, 141, 142, 143, 146, 147, 155, 165, 167], "built": [13, 148, 154, 155, 160, 162], "bullet": 148, "bump": 154, "bunch": 173, "burden": 177, "bushi": 9, "bust": [150, 175], "button": [5, 7, 9, 157], "c": [9, 129, 131, 155, 165, 177], "calcul": [14, 18, 19, 135, 137, 142, 151, 152, 153, 169, 170, 177], "call": [2, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 137, 144, 145, 150, 152, 154, 155, 157, 158, 159, 160, 163, 165, 169, 170, 175], "call_async": 155, "call_wait": 155, "callabl": [3, 4, 9, 11, 12, 13, 14, 16, 17, 18, 19, 155], "callback": [9, 154], "cambridg": [133, 134], "camera": [136, 137, 138, 139, 140, 146, 147], "camera_id": 136, "camera_nam": 136, "campbel": 128, "can": [1, 2, 4, 5, 6, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 167, 169, 170, 173, 177], "cannot": [4, 8, 13, 17, 148, 151, 154, 155, 157, 158, 160, 165, 166, 172], "canva": 165, "capabl": [154, 156], "capped_cubic_video_schedul": [9, 17], "captur": [9, 17], "car": [126, 130, 148, 154, 157], "car_rac": [128, 154, 157], "card": [150, 163, 175], "care": [5, 148], "carl": 148, "carlo": [163, 170, 175, 177], "carloslui": 154, "carrac": [9, 14, 18, 128, 154, 155, 157], "cart": [130, 142, 143, 177], "cartesian": [6, 7, 135], "cartpol": [2, 9, 10, 14, 16, 17, 18, 19, 130, 132, 136, 142, 143, 148, 154, 155, 157, 159, 161, 177], "cartpoleenv": [17, 132, 155], "cartpolejax": 17, "cartpolevectorenv": [132, 155], "casadi": 148, "case": [2, 3, 5, 6, 7, 13, 14, 16, 128, 137, 138, 139, 140, 141, 146, 147, 153, 155, 157, 158, 160, 163, 165, 166, 167, 169, 170, 177], "casino": [163, 175], "cast": 18, "cat": 138, "catch": 155, "categor": [7, 170], "categori": 1, "caught": 154, "caus": [140, 147, 153, 154, 155, 157, 162], "cbar": 175, "cc": 131, "cclauss": 154, "cd": 1, "cell": [148, 151, 158, 165], "cellular": [148, 155], "center": [14, 18, 20, 128, 129, 132, 135, 137, 140, 141], "centr": [137, 140, 141], "central": 155, "centric": 177, "certain": [2, 143, 145, 154, 155, 163, 170, 175], "cfrc_ext": [137, 140, 141], "chain": [15, 131, 146, 157], "challeng": [148, 163, 175], "chanc": 11, "chandrasekar": 177, "chang": [1, 2, 3, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 143, 146, 147, 148, 153, 154, 155, 157, 158, 159, 160, 161, 162, 163, 165, 167, 169, 175, 177], "changelog": [4, 154, 155], "channel": [14, 17, 18], "charact": [7, 9], "character": 148, "charg": [6, 148], "charlist": 7, "charraut": 155, "charset": 7, "check": [1, 2, 5, 7, 15, 17, 20, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 158, 165, 167, 169, 173, 175], "check_env": [9, 155], "check_environments_match": 155, "checker": [4, 20, 154, 155], "cheetah": 136, "child": 13, "children": 11, "choic": [170, 173, 177], "choos": [158, 165, 169, 173], "choose_act": 173, "chosen": [2, 7, 151, 158, 170, 173], "chri": 128, "christofkaufmann": 155, "chrisyeh96": 155, "chunk": 17, "ci": [9, 154, 155], "cinert": [140, 141], "circl": 165, "citat": 155, "cite": 155, "clamp": [140, 141], "clarifi": [154, 155], "class": [2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 14, 15, 17, 18, 19, 136, 148, 154, 155, 157, 158, 162, 163, 165, 167, 169, 170, 173, 175, 177], "classic": [129, 131, 132, 133, 134, 135, 139, 143, 146, 147, 148, 154, 155, 157, 160, 166], "classic_control": [17, 154, 160], "classif": 148, "classmethod": 15, "clean": [2, 148], "cleaner": 153, "cleanup": 154, "clear": [13, 17, 155], "clear_mpi_env_var": 13, "clearer": [2, 154], "clearli": 5, "clementperroud": 155, "cliff": [149, 155], "cliff_walk": 151, "cliffwalk": [151, 154, 155], "climb": 130, "clip": [14, 16, 19, 20, 133, 134, 137, 155, 157, 158, 165, 167, 170], "clip_reward": 165, "clipact": [14, 16, 20, 157], "clipreward": [10, 14, 19, 20, 155, 167], "clobber": 154, "clock": 165, "clockwis": 135, "clone": [148, 169], "close": [2, 3, 7, 9, 10, 11, 12, 14, 15, 17, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 136, 137, 139, 140, 145, 147, 154, 155, 156, 157, 160, 161, 167, 170, 173, 175], "close_extra": [10, 11, 12], "closechoic": 155, "closer": [129, 150, 155], "closest": 127, "cloudpickl": 13, "cloudpicklewrapp": 13, "cluster": 148, "cmap": [173, 175], "cnt": 155, "co": [131, 133, 134, 135, 142, 145, 148], "coars": 131, "code": [2, 3, 5, 6, 9, 11, 14, 15, 17, 18, 19, 131, 148, 154, 155, 158, 162, 163, 165, 167, 170, 171, 173, 175, 177], "codebas": [155, 165], "coeffici": [146, 170], "coin": 7, "coincid": [158, 165], "col": [151, 152], "cola": 155, "colder": 154, "collect": [2, 10, 11, 12, 17, 20, 148, 155, 156, 161, 163, 169, 170, 175], "collid": 129, "collis": [133, 134, 154], "color": [2, 6, 153], "color_palett": 173, "colour": [128, 154], "column": [4, 173, 177], "com": [19, 140, 141, 151, 152, 154, 165, 169, 173, 177], "combin": [2, 5, 9, 15, 145, 157, 170], "come": [129, 137, 138, 139, 140, 146, 147, 155], "comet": 155, "command": [136, 165], "comment": [136, 154, 155, 170], "commit": [155, 165], "common": [7, 15, 18, 20, 155, 157, 158, 166, 167, 177], "commonli": [20, 154, 157], "commonroad": 148, "commun": [11, 154, 160], "compar": [139, 140, 146, 147, 148, 163, 170, 173, 175], "compartment": 155, "compat": [18, 144, 148, 154, 155, 156, 157], "compil": [148, 154, 162], "compilergym": 148, "complement": 2, "complet": [1, 4, 11, 17, 128, 129, 155, 157, 158, 160, 163, 166, 167, 175], "complex": [5, 6, 136, 140, 141, 142, 143, 155, 162, 170, 177], "compli": 154, "complianc": 154, "compliant": [154, 159], "complic": [7, 15, 167], "compon": [18, 157, 158, 165], "compos": [9, 137, 138, 139, 140, 141, 143, 146, 147], "composit": 7, "compound": 8, "comprehens": [148, 154], "compris": 7, "comput": [2, 3, 9, 18, 19, 20, 136, 137, 154, 155, 158, 163, 165, 170, 175, 177], "computation": [148, 162], "compute_metr": 9, "concat": 173, "concaten": [5, 8, 11, 12, 13, 136, 155], "conceptu": 7, "concern": 177, "conda": [155, 165], "condit": [2, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 163, 166, 169, 175, 177], "conduct": 155, "config": 4, "configur": [126, 130, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 165, 170, 175], "confirm": [131, 154], "conflict": 9, "conjunct": [9, 13], "connect": [2, 6, 11, 131, 137, 138, 139, 140, 141, 145, 146, 147, 148], "connectionist": 177, "consecut": 129, "consequ": 154, "consid": [4, 128, 129, 130, 136, 137, 139, 140, 143, 147, 165, 167], "consist": [3, 7, 127, 129, 131, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 154, 155, 157, 158, 162, 165, 167, 169, 175, 177], "consol": 4, "consolid": 154, "constant": [3, 129, 134, 137, 140, 141, 142], "constantli": 145, "constitu": 6, "constraint": [140, 141, 142, 148], "construct": [7, 9, 131, 137, 138, 139, 140, 141, 146, 147, 148], "constructor": [4, 9, 15, 154, 155, 157, 165], "consult": 169, "consumpt": 2, "contact": [127, 129, 136, 137, 140, 141, 142, 154], "contact_cost": [137, 140], "contact_cost_rang": 140, "contact_cost_weight": [137, 140, 169], "contact_force_rang": 137, "contain": [1, 2, 5, 8, 10, 13, 17, 18, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 157, 158, 160, 163, 165, 169, 170, 172, 175, 177], "context": [4, 11, 13, 148, 177], "contextu": 148, "continu": [2, 5, 9, 14, 18, 19, 128, 129, 130, 133, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 157, 166, 170, 175, 177], "continuous_mountaincarenv": 134, "contrari": [14, 19], "contrast": 160, "contribut": [126, 148, 155, 172], "contributor": 155, "control": [2, 5, 7, 126, 128, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 157, 160, 167, 169, 170, 173], "conveni": [6, 15, 157, 158, 165], "convent": [1, 2], "converg": [163, 173, 175], "convers": [3, 148, 155, 159], "convert": [1, 5, 6, 14, 18, 20, 128, 129, 148, 154, 155, 158, 159, 173, 175], "convert_to_done_step_api": 9, "convert_to_terminated_truncated_step_api": [9, 159], "convolv": [155, 170, 175], "cool": 155, "coordin": [7, 14, 127, 129, 135, 137, 138, 139, 140, 141, 144, 145, 146, 147, 148], "coordinatesof": 142, "copi": [10, 11, 12, 13, 165], "copier": 165, "core": [154, 155, 157], "correct": [4, 9, 16, 129, 148, 153, 154, 155, 165, 166, 177], "correctli": [2, 9, 14, 18, 19, 154, 155, 165], "correspond": [1, 2, 4, 11, 12, 15, 18, 129, 131, 132, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 153, 158, 160, 165, 170, 173, 177], "correspondingli": 128, "cosin": [131, 142, 145], "cost": [127, 155], "could": [2, 9, 14, 137, 138, 139, 140, 141, 143, 146, 147, 155, 157, 165, 166, 167, 169, 170, 173, 175, 177], "coulom": 146, "count": [7, 127, 129, 150, 163, 175], "counter": [135, 155], "coupl": [157, 158, 170], "cours": [148, 173], "cover": [5, 12, 163, 175], "coverag": 148, "cpu": [129, 136, 162, 170], "craft": 129, "craftium": 148, "crash": [129, 157], "crazyfli": 148, "creat": [4, 5, 7, 9, 10, 11, 12, 13, 14, 17, 127, 128, 129, 145, 148, 154, 155, 157, 162, 167, 169, 170, 175, 177], "create_empty_arrai": [5, 13, 155], "create_grid": 175, "create_plot": 175, "create_se": 154, "create_shared_memori": [5, 13], "creation": [2, 4, 155, 158, 165], "creatur": 148, "critic": [2, 17, 155], "critic_lay": 170, "critic_loss": 170, "critic_losses_moving_averag": 170, "critic_lr": 170, "critic_optim": 170, "critic_weight": 170, "critic_weights_path": 170, "cross": [151, 152], "crucial": 5, "crypto": 148, "cstride": 175, "ctr_cost": 147, "ctrl_cost": [137, 138, 139, 140, 146, 147], "ctrl_cost_weight": [137, 138, 139, 140, 141, 146, 147, 169], "ctx": 13, "cube": 5, "cuda": [148, 170], "cum_reward": 173, "cumsum": 173, "cumul": [9, 14, 17, 20, 173], "cumulative_reward": 9, "current": [2, 4, 7, 9, 10, 17, 18, 128, 148, 150, 151, 152, 153, 154, 155, 158, 162, 163, 165, 170, 173, 175, 177], "current_col": [151, 152], "current_namespac": 4, "current_row": [151, 152], "curriculum": 154, "curv": 154, "custom": [2, 5, 10, 11, 12, 14, 15, 17, 18, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 152, 154, 155, 157, 160, 168, 171], "customis": 148, "customiz": 148, "customspaceerror": 13, "cvel": [140, 141], "cyaneu": 152, "cycl": 157, "cylind": 144, "cython": 155, "d": [7, 9, 15, 131, 148, 170, 172], "dac": 148, "dacbench": [148, 155], "daemon": 11, "dag": 148, "dai": 126, "darkgrid": 177, "data": [5, 8, 9, 10, 15, 148, 154, 155, 158, 160, 161, 162, 165, 167, 170, 173, 175, 177], "data_equival": 155, "data_equivil": 155, "databas": 2, "datafram": [173, 177], "datapoint": 8, "dataset": 155, "datatyp": 154, "david": 173, "davidslayback": 155, "de": 155, "dead": [154, 155], "deal": [6, 170], "dealer": [150, 154, 163, 175], "dealer_count": 175, "debug": [2, 5, 9, 148, 149, 154, 155], "decai": [163, 175], "decay_epsilon": [163, 175], "decent": [163, 175, 177], "decid": [150, 155, 173], "decis": [154, 155, 157, 160, 166], "deck": [150, 163, 175], "declar": 158, "decomposit": 153, "decreas": [129, 162, 173], "deep": [148, 154, 162, 166, 170, 173, 177], "deepmind": [136, 154, 169], "def": [9, 14, 155, 158, 163, 165, 167, 170, 173, 175, 177], "default": [2, 4, 5, 6, 7, 9, 11, 12, 13, 14, 15, 17, 18, 19, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 154, 155, 157, 160, 165, 166, 169], "default_camera_config": [136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "default_rng": 173, "defaultdict": [163, 175], "defin": [2, 3, 5, 8, 11, 12, 14, 15, 16, 17, 18, 135, 154, 155, 157, 158, 165, 166, 167, 170, 173], "definit": [137, 139, 140, 142, 154, 155, 160, 166], "degre": [11, 140, 141, 142, 167, 170], "delai": [18, 20, 155, 165], "delayobserv": [18, 20, 155], "deleu": 155, "deliv": 153, "delta": [173, 177], "demand": 128, "demo": 1, "demonstr": [14, 127, 148], "dennybritz": 151, "denot": [2, 132], "densiti": [144, 155], "depend": [2, 6, 9, 10, 14, 18, 19, 126, 129, 130, 132, 136, 137, 138, 139, 140, 141, 146, 147, 152, 154, 155, 165, 170], "dependencynotinstal": [9, 17], "deprec": [2, 136, 154, 155], "depreci": 154, "deque_s": [14, 17, 163, 170, 175], "deriv": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 177], "derk": 148, "desc": [152, 173], "describ": [2, 5, 6, 131, 132, 146, 150, 154, 155, 157, 160, 167], "descript": [20, 136, 155, 165], "design": [131, 140, 141, 148, 149, 153, 155, 165], "desir": [9, 14, 153, 169, 170], "despit": 155, "destin": 153, "destroi": [129, 155, 157], "detach": 170, "detail": [5, 6, 7, 9, 19, 131, 154, 155, 157, 158, 160, 165, 169, 170], "detect": [155, 165], "determin": [2, 17, 129, 131, 132, 133, 134, 135, 152, 154, 155, 157, 165, 166, 169], "determinist": [129, 131, 133, 134, 158, 165, 170], "dev": [2, 154, 177], "develop": [136, 148, 154, 155, 161, 177], "deviat": [14, 19, 146, 170, 177], "devic": [14, 17, 155, 170], "devicearrai": 17, "dexter": 148, "dexteroushand": 148, "df1": 177, "diagnost": 2, "diagram": 135, "dict": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 18, 20, 136, 151, 152, 153, 154, 155, 157, 158, 160, 165, 177], "dict_info": 14, "dict_kei": 18, "dict_time_kei": 18, "dictat": [128, 129], "dictinfotolist": [10, 14, 20, 155], "dictionari": [2, 5, 6, 7, 9, 10, 14, 18, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 158, 160, 163, 165, 167, 175, 177], "did": [155, 166], "didn": [155, 173], "die": 128, "dietterich": 153, "differ": [2, 5, 6, 9, 11, 12, 16, 17, 18, 128, 135, 136, 147, 148, 154, 155, 157, 158, 160, 161, 165, 166, 167, 169, 170, 173], "differenti": 166, "difficult": [132, 136, 142, 143, 155, 162, 173], "difficulti": 154, "digit": 7, "dim": [140, 141], "dimens": [5, 7, 8, 18, 148, 155, 177], "dimension": [5, 17, 129, 137, 138, 139, 146, 147, 155, 157, 158, 165, 170], "direct": [6, 131, 132, 133, 134, 137, 138, 139, 140, 142, 143, 146, 147, 148, 151, 152, 153, 158, 165, 173, 177], "directli": [2, 6, 10, 15, 17, 131, 136, 157, 158, 163, 165, 175, 177], "directori": [1, 154, 165], "dirhtml": 1, "disabl": [4, 17, 18, 19, 151, 152, 154, 155, 169], "disable_env_check": [4, 16, 17, 154], "disable_logg": 17, "disable_print": 4, "disable_render_order_enforc": [17, 154], "disallow": 153, "disc_to_cont": 167, "discard": 18, "discord": [148, 154, 155, 172], "discount": [14, 19, 163, 166, 170, 173, 175, 177], "discount_factor": [163, 175], "discourag": 154, "discov": [155, 169], "discret": [2, 5, 6, 7, 8, 11, 13, 18, 128, 129, 131, 132, 133, 134, 148, 149, 150, 151, 152, 153, 154, 155, 157, 158, 163, 165, 167, 175], "discrete_act": 165, "discreteact": 167, "discreteenv": 154, "discuss": [154, 155], "displac": [137, 138, 139, 140, 146, 147], "displai": [2, 4, 132, 154, 165], "dissect": 173, "dist": 144, "distanc": [138, 139, 141, 144, 147, 158, 165], "distance_from_origin": [137, 140], "distance_penalti": 142, "distinct": 166, "distinguish": [2, 165, 166], "distrib": 177, "distribut": [5, 6, 7, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 152, 154, 169, 170, 173, 177], "divers": 156, "divid": [14, 19], "do": [4, 13, 15, 17, 128, 129, 131, 154, 155, 158, 161, 165, 167, 169, 170, 173, 177], "doc": [9, 129, 154, 155, 167], "dockerfil": 154, "docstr": [1, 154, 155], "doctest": 155, "document": [5, 7, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 130, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 154, 155, 157, 158, 163, 165, 169, 170, 175], "doe": [2, 6, 8, 9, 10, 11, 12, 13, 14, 17, 18, 19, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 154, 155, 158, 165, 166, 167, 170], "doesn": [1, 2, 4, 15, 17, 129, 154, 155, 158, 166, 173], "dof": [137, 140, 141], "dog": 9, "doi": [153, 155], "domain": [16, 128, 131, 133, 134, 153, 167, 168, 171], "domain_random": 128, "don": [6, 14, 15, 128, 133, 154, 155, 158, 160, 165, 166, 170, 173], "done": [2, 9, 14, 154, 155, 157, 159, 160, 163, 165, 166, 169, 170, 173, 175, 177], "donesteptyp": 9, "doom": 148, "dot": [6, 7, 165], "doubl": 136, "doublependulum": 142, "down": [5, 7, 128, 141, 148, 150, 151, 152, 153, 155, 158, 163, 165, 166, 173, 175], "download": [157, 165, 166, 167, 170, 171, 173, 175, 177], "downward": 131, "dramat": [154, 155], "draw": [6, 150, 163, 165, 175], "drawn": [6, 7, 144, 146, 150], "drive": 128, "drone": 148, "drop": [153, 155], "drryanhuang": 155, "dsctt": 154, "dstack": 175, "dt": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 169], "dtype": [2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 20, 129, 131, 132, 133, 134, 135, 154, 155, 158, 165, 173], "dtypelik": 5, "dtypeobserv": [14, 18, 20, 155], "dual": 148, "duburcqa": 154, "duckietown": 148, "due": [2, 129, 132, 136, 152, 154, 155, 160, 163, 166, 170], "dump": 155, "dunbar": 155, "dunder": 155, "duplic": [14, 155], "durat": [9, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 177], "dure": [2, 4, 6, 7, 10, 14, 17, 18, 19, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 158, 165, 167, 169], "dw": 9, "dx": [137, 138, 139, 140, 146, 147], "dylwil3": 155, "dynam": [2, 131, 135, 136, 142, 148, 154, 155], "e": [1, 2, 4, 5, 7, 8, 9, 11, 13, 14, 15, 17, 129, 137, 138, 139, 140, 141, 143, 144, 145, 146, 147, 148, 150, 152, 154, 155, 158, 163, 165, 167, 170, 173, 175, 177], "each": [2, 5, 6, 7, 9, 10, 11, 12, 13, 14, 16, 17, 18, 20, 126, 127, 129, 130, 131, 132, 133, 134, 136, 137, 140, 141, 142, 143, 144, 145, 148, 149, 151, 153, 154, 155, 157, 158, 161, 163, 165, 170, 173, 175], "earli": [126, 163, 175, 177], "earlier": [137, 140, 141, 158, 165], "earliest": 177, "easi": [148, 149, 154, 157, 173], "easier": [130, 154, 155, 170], "easiest": [128, 154, 155], "easili": [6, 10, 15, 148, 154, 155, 162], "east": 153, "ed": 131, "edg": [5, 6, 8, 132, 154, 155, 157], "edge_link": [6, 8], "edge_mask": 6, "edge_spac": [6, 8], "edgecolor": 175, "edit": [148, 154, 165], "effect": [2, 8, 13, 15, 17, 129, 137, 141, 142, 143, 144, 145, 146, 153, 157, 163, 175, 177], "effector": [144, 145], "effici": [11, 133, 134, 136, 148, 155, 162, 170], "effort": [130, 155], "eg": 165, "egl": 136, "eight": 137, "einforc": 177, "either": [7, 8, 9, 12, 17, 133, 134, 148, 150, 152, 157, 159, 170, 173, 177], "ekalosak": [154, 155], "elaps": [14, 17], "elbow": 144, "electron": 148, "element": [2, 5, 6, 7, 10, 11, 13, 18, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 157, 158, 165, 169, 170, 177], "elev": 169, "eleven": 136, "elf": 152, "elliottow": 155, "elo": 148, "els": [14, 18, 139, 147, 152, 155, 158, 163, 165, 170, 173, 175], "emerg": 155, "emit": 2, "emploi": 17, "empti": [5, 7, 13, 17, 137, 138, 139, 140, 141, 146, 147, 155, 163, 173, 175, 177], "emul": [154, 160], "en": [18, 177], "enabl": [4, 9, 17, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 128, 148, 154, 155], "enable_wind": [129, 170], "encapsul": 2, "encod": [8, 148, 153, 154, 158, 165], "encompass": 148, "encount": 170, "encourag": [154, 169, 170], "end": [2, 3, 10, 14, 17, 127, 135, 136, 154, 155, 156, 157, 158, 161, 163, 166, 169, 170, 173, 175, 177], "endeavor": 148, "enemi": 157, "energi": [127, 129, 132, 167], "enforc": [4, 17, 154], "engin": [129, 136, 148, 155], "english": 7, "engstrom": [14, 19], "enough": [169, 170], "ensur": [1, 2, 4, 6, 8, 9, 129, 148, 155, 157, 163, 165, 175], "ent_coef": 170, "enter": [2, 9, 129, 151], "entir": [9, 17, 139, 147, 154, 155, 170], "entiti": 5, "entrain": 148, "entri": [4, 154, 155, 158, 160], "entropi": [2, 170], "entropy_moving_averag": 170, "entry_point": [4, 154, 158, 165], "entrypoint": 165, "enum": [4, 165], "enumer": 173, "env": [4, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 166, 167, 168, 169, 171, 173, 175, 177], "env_check": [9, 154, 155], "env_fn": [11, 12], "env_id": [4, 155, 159], "env_lambda": 9, "env_nam": [4, 159], "env_new": 9, "env_old": 9, "env_spec": 155, "env_typ": 1, "envapicompat": 154, "envcompat": [154, 159], "envcreat": 4, "environ": [4, 5, 7, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 127, 128, 129, 130, 131, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 149, 150, 151, 152, 153, 154, 155, 156, 161, 163, 166, 167, 168, 171, 177], "environment_cr": 165, "environmentnam": 155, "envpool": [154, 155], "envs_wrapp": 170, "envspec": [2, 3, 4, 10, 15, 17, 155], "ep": [173, 177], "ep_action_log_prob": 170, "ep_reward": 170, "ep_value_pr": 170, "episod": [2, 4, 9, 10, 14, 17, 18, 19, 20, 154, 155, 156, 157, 158, 160, 163, 165, 166, 169, 170, 173, 175, 177], "episode_id": 9, "episode_index": 9, "episode_num": 161, "episode_ov": [157, 161], "episode_returns_moving_averag": 170, "episode_reward": [14, 19], "episode_trigg": [9, 17, 161], "episodestatisticsrecord": 155, "epoch": 170, "eprint": 155, "epsilon": [14, 18, 19, 163, 173, 175], "epsilon_decai": [163, 175], "epsilongreedi": 173, "equal": [11, 12, 129, 133, 134, 139, 144, 147, 150, 152, 158, 173], "equat": [131, 135], "equival": [4, 5, 8, 10, 14, 137, 140, 154, 155, 160, 170], "erez": [139, 140, 141], "ericonaldo": 154, "error": [2, 4, 7, 8, 9, 11, 17, 20, 154, 155, 166, 170, 175], "errorr": 154, "escap": 2, "especi": [5, 165, 167, 170], "essenti": [7, 169], "estim": [19, 137, 166, 170, 177], "et": [14, 16, 17, 19], "etc": [2, 5, 10, 137, 138, 139, 140, 146, 147, 154, 155, 157, 158, 165, 166, 167], "euclidean": [144, 145], "ev": 148, "ev2gym": 148, "eval": [161, 170], "evalu": [14, 17, 18, 19, 148, 155, 161, 163, 175], "evanmath": 154, "even": [2, 4, 137, 139, 140, 142, 143, 147, 154, 165, 175], "event": [9, 154, 165], "eventu": 13, "ever": [126, 154], "everi": [1, 5, 6, 9, 14, 17, 18, 19, 128, 129, 132, 137, 139, 140, 142, 143, 147, 155, 157, 165, 170, 177], "everyon": 136, "everyth": [17, 148], "evolv": 9, "exact": [150, 155, 175], "exactli": [8, 146], "exampl": [1, 2, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 128, 131, 136, 148, 150, 151, 152, 154, 155, 157, 158, 161, 163, 165, 166, 167, 170, 171, 175], "examplefuncenv": 155, "exce": [127, 150], "exceed": [2, 17, 20, 157, 163, 175], "excel": 169, "except": [2, 154, 155, 165], "exchang": 157, "excit": 155, "exclud": [4, 17, 20, 137, 138, 139, 140, 141, 142, 145, 146, 147, 153], "exclude_current_positions_from_observ": [137, 138, 139, 140, 141, 146, 147, 169], "exclude_namespac": 4, "exclus": [6, 136, 140, 141, 155], "execut": [1, 9, 16, 153, 157, 165, 170, 177], "exhibit": 136, "exist": [2, 4, 5, 10, 11, 12, 15, 16, 17, 18, 19, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 157, 162, 170, 173], "exist_ok": 173, "exit": [9, 10, 11, 12], "exp": 177, "expand": [5, 7, 160], "expect": [2, 4, 5, 6, 7, 154, 155, 157, 169, 170, 173, 177], "expens": [5, 154], "experi": [17, 136, 137, 140, 141, 142, 143, 148, 163, 170, 173, 175, 177], "experiment": [3, 131, 155], "explain": [154, 158], "explan": [157, 160, 163, 175, 177], "explanatori": [163, 175], "explicit": 155, "explicitli": [3, 165, 166, 169], "exploit": [163, 173, 175], "explor": [2, 5, 148, 155, 162, 163, 170, 173, 175], "explor_exploit_tradeoff": 173, "exponenti": [7, 14, 19, 20], "export": 155, "expos": [3, 155], "express": [5, 9, 137, 140, 141], "ext_control": 6, "extend": [2, 17, 137], "extens": [148, 155, 177], "extent": 152, "extern": [2, 137, 140, 141, 154, 155, 160, 166], "extra": [154, 155, 160, 163, 165, 175], "extract": [9, 155, 173], "extrem": [148, 149], "ez": 154, "ezpickl": [4, 9, 15], "f": [161, 170, 173, 175, 177], "f_": [137, 140, 141], "face": [137, 140, 141, 150, 163, 175], "facecolor": 175, "facedown": 150, "facilit": 136, "fact": 177, "factor": [14, 19, 129, 163, 170, 175, 177], "factori": 4, "fail": 155, "failur": 166, "fair": 7, "fairli": [154, 155, 177], "fall": [127, 140, 151, 152, 166, 169, 173], "fallen": [139, 169], "fals": [4, 6, 9, 10, 11, 12, 14, 16, 17, 18, 19, 128, 129, 132, 137, 138, 139, 140, 141, 146, 147, 150, 154, 155, 157, 158, 160, 161, 163, 165, 169, 170, 173, 175, 177], "familiar": 169, "famou": 163, "far": [127, 128, 141, 142, 144, 145, 152], "farama": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 148, 155, 163, 165, 170, 172, 175, 177], "fast": [128, 136, 138, 140, 142, 146, 148], "faster": [127, 129, 137, 140, 141, 142, 145, 155, 170], "favor": [2, 154, 155], "favorit": [163, 175], "favour": [2, 154, 155, 159, 160], "featur": [6, 7, 18, 136, 154, 155, 170], "feature_spac": 6, "fed": 170, "feed": 170, "feedback": 155, "feel": [169, 177], "feet": [138, 147], "few": [1, 131, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 173, 175], "ffffffff": 152, "fffffhff": 152, "fffh": 152, "fffhffff": 152, "fffhfffg": 152, "ffoot": 138, "ffset": 177, "fhffhfhf": 152, "fhfh": 152, "fhhfffhf": 152, "fibonacci": 177, "fickl": 153, "fidel": 148, "field": [155, 162], "fieteo": 154, "fig": [170, 173, 175], "fig1": 175, "fig2": 175, "figaspect": 175, "fight": 148, "figsiz": [170, 173, 175, 177], "figur": [139, 155, 173, 175, 177], "file": [1, 131, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 165, 169], "filenam": [9, 17], "fill": 165, "filter": [14, 18, 20], "filter_kei": [14, 18], "filterobserv": [14, 18, 20], "filterobservationwrapp": 18, "final": [3, 9, 18, 154, 155, 161, 163, 165, 166, 169, 175], "final_epsilon": [163, 175], "final_info": [14, 17, 154, 155], "final_observ": [14, 17, 154, 155], "find": [2, 4, 155, 162, 165, 169, 173], "find_highest_vers": 4, "fine": [165, 170], "finetun": 148, "finfo": 173, "fingertip": [144, 145], "finish": [2, 128, 129, 154, 155, 165, 170], "finit": [6, 7, 137, 143, 147, 166, 167, 177], "fire": 129, "firefoxmetzg": 154, "firmwar": 148, "first": [2, 4, 6, 9, 10, 11, 12, 15, 129, 131, 133, 134, 137, 138, 139, 140, 141, 142, 145, 146, 147, 151, 153, 154, 155, 156, 157, 158, 160, 161, 163, 165, 173, 175, 177], "five": [130, 154, 177], "fix": [5, 6, 7, 17, 20, 129, 131, 132, 135, 136, 137, 138, 139, 140, 142, 143, 144, 145, 147, 148, 150, 152, 154, 155, 157, 158, 160, 165, 177], "fixtur": 145, "flag": [11, 133, 136, 148], "flake8": [154, 155], "flappi": 148, "flat": [5, 6, 7, 8], "flatdim": [5, 8, 154], "flatt": 170, "flatten": [5, 8, 14, 18, 20, 136, 154, 155, 158, 165, 170, 173, 175], "flatten_spac": [5, 8], "flattenobserv": [6, 14, 18, 20, 157, 158, 165], "flesh": 3, "flex": 144, "flexibl": [11, 148, 155], "flight": 148, "float": [4, 7, 8, 9, 14, 16, 17, 18, 19, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 163, 165, 170, 173, 175, 177], "float32": [2, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 18, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155], "float64": [8, 10, 14, 18, 19, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "flow": 148, "fluid": 146, "fly": [129, 160], "flycraft": 148, "fmt": 173, "fn": 13, "focu": [148, 170], "focus": 172, "foddi": 148, "folder": [1, 9, 17, 155, 161, 173], "follow": [1, 3, 4, 5, 9, 10, 14, 16, 17, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 129, 131, 132, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 158, 161, 165, 166, 167, 169, 170, 173, 175, 177], "fontsiz": [173, 175], "foot": [11, 138, 139, 140, 141, 147], "foot_joint": [139, 147], "foot_left_joint": 147, "forc": [129, 132, 133, 134, 136, 137, 140, 141, 142, 143, 147, 154, 169, 177], "forcibli": 166, "forearm": [140, 141, 144], "forev": 166, "forex": 148, "forg": 165, "forget": [14, 15, 165], "fork": [1, 155, 156, 160], "form": [5, 6, 7, 17, 131, 146, 155, 165], "formal": 166, "format": [1, 4, 5, 9, 14, 155, 157, 169], "formerli": 155, "forum": [14, 19], "forward": [13, 19, 127, 137, 138, 139, 140, 141, 146, 147, 154, 155, 170, 177], "forward_reward": [137, 138, 139, 140, 146, 147, 169], "forward_reward_weight": [137, 138, 139, 140, 146, 147, 169], "found": [20, 136, 142, 154, 155, 163, 175, 177], "foundat": [148, 155, 165, 172], "four": [128, 129, 137, 139, 148, 153, 155, 157, 158, 163, 175], "foxik": 154, "fp": [2, 9, 10, 17, 129, 155, 165], "frac": [137, 138, 139, 140, 146, 147], "frame": [2, 9, 10, 17, 18, 20, 128, 129, 144, 145, 154, 155, 169, 173], "frame_skip": [17, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 169], "framer": 165, "frameskip": 17, "frameskipobserv": 155, "framestack": [17, 18, 154, 155], "framestackobserv": [17, 18, 20, 155], "frametim": [137, 138, 139, 140, 141, 146, 147], "framework": [136, 148, 155, 169], "franuka": 152, "free": [131, 135, 136, 137, 140, 141, 142, 143, 148, 163, 169, 175, 177], "free_body_rot": 146, "freedom": [140, 141, 142, 167], "freejoint": [140, 141], "freez": [14, 18, 19], "frequenc": 17, "frequent": [154, 166], "friction": [127, 146, 147, 148], "frictionless": 132, "from": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 127, 128, 129, 131, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 157, 158, 160, 161, 162, 163, 166, 169, 170, 173, 175, 177], "from_json": [5, 155], "front": [137, 138, 146, 147], "front_left_leg": 137, "front_right_leg": 137, "frontpag": 155, "frozen": [149, 154, 155, 173], "frozen_lak": [152, 173], "frozenlak": [9, 152, 154, 155, 171, 176], "frozenlake4x4": 155, "frozenlake8x8": [152, 154, 155], "frozenlake_q_values_": 173, "frozenlake_states_actions_distrib_": 173, "frozenlake_steps_and_reward": 173, "frozenlake_tuto": 173, "frozenset": 7, "fshin": 138, "fthigh": 138, "fuel": 129, "full": [3, 4, 128, 129, 154, 155, 157, 158, 163, 175], "fulli": [2, 129, 137, 138, 139, 140, 141, 143, 146, 147, 148, 154], "fun": [175, 177], "func": [3, 14, 16, 18, 19, 155], "func_env": 3, "funcenv": [3, 155], "funcjaxenv": 155, "function": [2, 8, 10, 11, 12, 14, 16, 17, 18, 19, 20, 129, 132, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 152, 153, 154, 155, 157, 159, 160, 163, 166, 167, 169, 170, 172, 173, 175, 177], "functional_jax_env": 3, "functionaljaxenv": 3, "furcolor": 9, "further": [2, 5, 7, 129, 144, 145, 154, 155, 160, 173, 177], "furthermor": [2, 9, 155, 157], "furthest": 127, "futur": [3, 154, 155], "future_q_valu": [163, 175], "g": [1, 2, 4, 5, 7, 8, 9, 11, 12, 13, 131, 135, 137, 138, 139, 140, 141, 143, 146, 147, 148, 152, 153, 154, 155, 158, 163, 165, 167, 170, 173, 175, 177], "ga": 128, "gae": 170, "gain": [155, 167], "galleri": [1, 155], "game": [7, 9, 17, 126, 150, 151, 152, 154, 155, 157, 158, 163, 165, 169, 175], "gamma": [14, 19, 166, 170, 173, 177], "gap": 148, "garbag": [10, 11, 12], "gather": 165, "gaussian": 136, "gcc": 148, "gen_gif": 1, "gen_md": 1, "gener": [1, 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 13, 15, 17, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 161, 162, 170], "generalis": [2, 10], "generate_random_map": [152, 173], "geometr": [6, 136], "geometri": 155, "get": [2, 4, 11, 12, 14, 15, 18, 19, 127, 129, 136, 137, 139, 140, 142, 150, 154, 155, 157, 163, 165, 166, 169, 170, 173, 175], "get_act": [163, 175], "get_attr": [11, 12, 155], "get_env_id": 4, "get_loss": 170, "get_rect": 165, "get_stat": 154, "get_wrapper_attr": [15, 155], "gianluca": 155, "gianlucadecola": [154, 155], "gif": [1, 131, 155], "git": [165, 169], "github": [19, 129, 132, 136, 137, 139, 140, 141, 142, 143, 144, 145, 147, 151, 154, 155, 165, 169], "give": [2, 10, 129, 137, 150, 153, 154, 163, 170, 175, 177], "given": [3, 4, 7, 16, 127, 129, 130, 131, 132, 133, 134, 137, 139, 140, 142, 143, 147, 154, 155, 157, 161, 166, 175], "glfw": 136, "global": 4, "gnn": 148, "go": [2, 9, 128, 148, 150, 154, 158, 162, 163, 166, 169, 170, 175, 177], "go1": 169, "goal": [2, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 158, 165, 177], "goal_slidei": 144, "goal_slidex": 144, "goal_veloc": [133, 134], "goalenv": 160, "goe": [150, 154], "gone": 154, "good": [163, 170, 173, 175], "googl": [154, 169], "goul\u00e3o": 155, "gpu": [136, 148, 155, 162, 170], "gracefulli": 154, "gradient": [170, 177], "grai": 17, "grammat": 155, "grant": 129, "graph": [5, 6, 8, 13, 148, 154, 155, 157, 170, 175], "graphic": [136, 154], "graphinst": [6, 8, 13, 154, 155], "grave": 170, "gravit": 129, "graviti": [129, 132, 133, 135, 142, 143, 148, 170, 177], "grayscal": [14, 17, 18, 20], "grayscale_env": 18, "grayscale_newaxi": 17, "grayscale_ob": 17, "grayscaleobserv": [14, 18, 20], "grd": 158, "great": [148, 167, 175], "greater": [129, 131, 132, 133, 134, 143, 150, 163, 175], "greedi": [163, 173, 175], "greedili": [163, 175], "green": [131, 153], "grei": 175, "greyscal": 17, "grid": [148, 151, 152, 153, 158, 165, 175], "grid_world": 165, "gridlin": 165, "gridworld": [2, 148, 151, 158, 165], "gridworldenv": [158, 165], "grip": [163, 175], "ground": [127, 129, 141, 148], "group": [129, 154], "gt": [154, 155], "guarante": [5, 6, 13, 17, 148, 172], "guess": 154, "guid": [148, 155, 157, 158, 169], "guidelin": 17, "guyazran": 155, "gym": [2, 4, 5, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 126, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 155, 156, 157, 158, 160, 161, 163, 165, 166, 167, 170, 173, 175, 177], "gymenv": 159, "gymfc": 148, "gymlibrari": 154, "gymnasium": [2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 156, 157, 158, 159, 160, 161, 162, 163, 166, 169, 170, 173, 175, 177], "gymnasium_env": [158, 165], "gymnax": 148, "gymv21compat": 155, "gymv21environ": 159, "gymv22compat": 155, "gymv22environ": 155, "gymv26environ": 159, "gyroscop": 128, "h5": 170, "ha": [1, 2, 4, 5, 7, 8, 9, 10, 13, 14, 17, 18, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 139, 140, 141, 144, 145, 146, 148, 150, 154, 155, 156, 157, 158, 160, 163, 165, 166, 169, 170, 173, 175, 177], "hacki": 13, "had": [2, 154, 155], "half": 136, "half_cheetah": 138, "halfcheetah": [14, 136, 138], "halt": 166, "hand": [148, 150, 177], "handl": [5, 10, 11, 12, 142, 150, 154, 155, 168, 171, 175], "handling_time_limit": 166, "hang": [13, 131, 154], "hannah": 155, "happen": [133, 134, 137, 139, 140, 142, 143, 147, 150, 152, 153, 154, 155, 158, 160, 166], "haracterist": 177, "hard": [155, 173], "hardcor": 127, "harden": [154, 155], "harder": [129, 169], "hardwar": [148, 154, 155], "harlei": 170, "harm": 175, "has_autoreset": 10, "hasattr": 155, "hash_se": 154, "hasselmo": 131, "hatchl": 165, "have": [2, 4, 5, 7, 8, 10, 11, 13, 14, 15, 16, 17, 18, 19, 126, 127, 128, 131, 137, 140, 141, 142, 143, 144, 145, 147, 150, 152, 153, 154, 155, 157, 158, 159, 160, 163, 165, 166, 167, 169, 170, 173, 175, 177], "haven": 17, "head": [11, 138], "headless": [136, 154], "healthi": [137, 139, 140, 142, 143, 147], "healthy_angle_rang": [139, 147], "healthy_reward": [137, 139, 140, 142, 143, 147, 169], "healthy_state_rang": 139, "healthy_z_rang": [137, 139, 140, 147, 169], "heatmap": [173, 175], "height": [131, 136, 137, 139, 140, 142, 147, 154, 169], "held": 150, "hello": 7, "help": [2, 4, 5, 9, 154, 155, 157, 161, 163, 170, 175], "helper": 154, "helpingstar": 155, "henc": [137, 149, 177], "here": [3, 9, 131, 137, 140, 154, 155, 158, 163, 165, 169, 170], "heurist": [127, 129, 148, 155], "hffg": 152, "hidden": [2, 177], "hidden_space1": 177, "hidden_space2": 177, "hierarch": [5, 153], "high": [2, 6, 7, 8, 11, 12, 13, 14, 18, 131, 132, 134, 135, 137, 148, 154, 155, 157, 167, 169, 170], "higher": [19, 127, 144, 169, 170], "highest": 4, "highli": [5, 9, 126, 130, 148, 154, 160, 165], "highwayenv": [148, 155], "hill": [133, 134], "hing": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "hinge2": 142, "hint": [154, 155], "hinter": 154, "hip": [127, 137, 140, 141], "hip_1": 137, "hip_2": 137, "hip_3": 137, "hip_4": 137, "histor": 148, "histori": 136, "histplot": 173, "hit": [150, 163, 175], "hold": [5, 150, 163, 175], "hole": [152, 173], "homepag": 155, "hood": 155, "hook": [154, 155], "hooper": 155, "hop": [136, 139, 147], "hope": 155, "hopefulli": [154, 163, 175], "hopper": [15, 16, 136, 147, 155], "hopper_v4": 15, "hopperenv": 15, "horizon": [9, 139, 148, 166], "horizon_timestep": 9, "horizont": [127, 129, 131, 158, 165], "host": 155, "hot": 8, "hotter": 154, "how": [2, 5, 9, 11, 12, 14, 141, 142, 144, 145, 148, 155, 157, 158, 161, 163, 165, 166, 167, 169, 170, 175, 177], "howardh": 155, "howev": [2, 5, 6, 7, 8, 9, 10, 11, 13, 16, 17, 131, 137, 138, 139, 140, 141, 145, 146, 147, 154, 155, 157, 158, 160, 165, 166, 167, 170, 175], "hpo": 172, "html": [18, 163, 177], "http": [1, 2, 14, 18, 19, 128, 131, 150, 151, 152, 154, 155, 163, 165, 169, 170, 173, 175, 177], "hue": 173, "huge": [148, 154], "hull": 127, "human": [2, 6, 9, 17, 20, 128, 140, 141, 144, 148, 154, 155, 156, 157, 159, 160, 165, 170], "humanoid": [136, 155, 169], "humanoidstandup": [136, 141], "humanoidstandup_v4": 154, "humanrend": [17, 20, 155], "humnaoid": 169, "hundr": 161, "hundredth": 17, "hyperparam": 170, "hyperparamet": [163, 170, 172, 173, 175, 177], "i": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 126, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 166, 167, 169, 170, 173, 175, 177], "i_": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "ianyfan": 155, "ic": 152, "ice1187": 154, "iclr": 170, "icml": 170, "icu": 148, "id": [4, 14, 17, 19, 136, 137, 140, 141, 152, 155, 158, 165], "idea": [163, 166, 175], "ident": [7, 11, 12, 136, 155], "identifi": 154, "idx": 173, "if__name__": 170, "iff": [9, 17, 165], "iforce2d": 128, "igibson": 148, "ignor": [1, 9, 150, 154, 163, 175], "ikamensh": [154, 155], "illeg": 153, "illustr": [14, 165, 166], "im": 143, "imag": [2, 5, 10, 11, 12, 14, 17, 18, 20, 128, 136], "imagesequenceclip": 9, "imagin": [148, 157, 158, 167], "img": 173, "img_titl": 173, "immedi": [9, 20, 137, 139, 140, 141, 147, 150, 158, 165], "immut": 5, "impact": [141, 173], "impact_cost": 141, "impact_cost_rang": 141, "impact_cost_weight": 141, "implement": [5, 6, 9, 10, 15, 16, 17, 20, 129, 132, 135, 142, 148, 149, 153, 154, 155, 157, 158, 159, 160, 161, 163, 165, 168, 170, 171, 177], "implementing_custom_wrapp": 167, "implicitli": 170, "import": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 165, 167, 169, 170, 173, 177], "importantli": [15, 155, 157], "importerror": 155, "improv": [11, 154, 155, 162], "imshow": 173, "includ": [2, 4, 14, 17, 18, 20, 132, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 153, 154, 155, 157, 158, 159, 165, 166, 169], "include_cfrc_ext_in_observ": [137, 140, 141, 169], "include_cinert_in_observ": [140, 141], "include_cvel_in_observ": [140, 141], "include_qfrc_actuator_in_observ": [140, 141], "inclus": [137, 154], "incompleteidea": [150, 151, 163], "incorrect": [153, 154, 155, 166], "incorrectli": [19, 155, 166], "increas": [129, 131, 132, 136, 139, 144, 146, 147, 155, 162, 163, 170, 173, 175], "incur": [131, 151, 153], "independ": [6, 7, 10, 129, 139, 146, 147, 155], "index": [1, 6, 9, 13, 14, 17, 18, 20, 132, 157], "indic": [2, 9, 14, 17, 128, 131, 132, 150, 151, 152, 153, 154, 160, 163, 166, 173, 175, 177], "individu": [2, 11, 12, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 154, 158, 161, 165, 167, 177], "indoril007": 154, "induc": [137, 138, 139, 140, 141, 146, 147], "induct": [137, 138, 139, 140, 141, 146, 147], "ineffici": [155, 162], "inelast": [133, 134], "inertia": [140, 141], "inf": [2, 10, 16, 18, 132, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 167, 169], "infam": [163, 175], "infeas": 7, "infer": [7, 154], "infinit": [16, 17, 18, 129, 139, 150, 154, 155, 163, 166, 175], "info": [2, 3, 9, 10, 11, 12, 14, 17, 18, 19, 20, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 156, 157, 158, 160, 161, 163, 165, 166, 167, 170, 173, 175, 177], "inform": [1, 2, 5, 6, 7, 9, 10, 12, 14, 18, 19, 20, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 158, 160, 163, 165, 166, 169, 175, 177], "infti": [7, 139], "inher": 166, "inherit": [4, 5, 15, 16, 18, 19, 154, 155, 158, 165], "init": [128, 165, 170], "initi": [2, 3, 4, 6, 7, 9, 10, 14, 15, 16, 17, 18, 19, 127, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 158, 163, 167, 169, 170, 173, 175, 177], "initial_epsilon": [163, 175], "initial_sampl": 155, "initialis": [2, 4, 10, 17, 150, 155, 156, 160], "inlin": 170, "inner": 11, "inner_st": 6, "inplac": 177, "input": [9, 10, 155, 157, 158, 163, 170, 175], "inregular": 17, "insert": [156, 177], "insid": [1, 6, 7, 17, 153, 158, 165], "inspect": 9, "inspir": [151, 155, 173], "instabl": 162, "instal": [1, 9, 17, 126, 130, 136, 154, 155, 157, 165, 169], "instanc": [2, 3, 4, 5, 6, 8, 9, 10, 11, 13, 17, 18, 155, 157, 158, 162, 167, 170], "instanti": [9, 14, 17, 18, 19, 158, 165, 173], "instead": [2, 10, 11, 14, 19, 127, 131, 137, 140, 147, 154, 157, 160, 165, 170], "institut": [133, 134], "instruct": [136, 155, 158, 165], "int": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 136, 137, 150, 151, 152, 153, 155, 158, 163, 165, 170, 173, 175, 177], "int23": 17, "int32": [6, 8, 10, 14, 18, 158], "int64": [6, 7, 8, 155, 158], "int8": [5, 7], "int_32": 17, "integ": [2, 6, 7, 8, 9, 14, 16, 17, 18, 154, 155, 158, 160, 165], "integr": [9, 155, 169], "intellig": [153, 154], "intend": [3, 14, 17, 148, 151, 152, 172, 173], "intens": 148, "interact": [5, 9, 14, 17, 20, 148, 163, 165, 170, 175], "interest": [18, 155, 161, 169], "interestingli": 173, "interfac": [9, 155, 156, 157], "interlink": 157, "intermedi": [140, 141], "intermitt": 17, "intern": [2, 3, 10, 154, 155, 158, 161, 163], "internet": 148, "interoper": 148, "interv": [7, 137, 139, 140, 147], "intrins": 167, "introduc": [137, 140, 141, 148, 154, 155, 159, 160], "introduct": [131, 150, 151, 159, 173], "introductori": 155, "invalid": [2, 7, 155, 169, 175], "invers": 4, "invert": [135, 136, 177], "inverted_double_pendulum": 142, "inverted_pendulum": [143, 177], "inverteddoublependulum": [136, 142], "invertedpendulum": [136, 143, 177], "investig": 155, "involv": [6, 126, 142, 148, 151, 152, 153, 154, 155, 165, 177], "io": [18, 152], "ipynb": [165, 166, 167, 170, 173, 175, 177], "irrespect": 9, "is_avail": 170, "is_bound": 7, "is_np_flatten": [5, 155], "is_slipperi": [151, 152, 155, 173], "is_slippi": 155, "is_vector_env": 9, "isaac": 148, "isinst": 155, "isn": [6, 9, 155, 158], "isort": 155, "issu": [17, 18, 129, 132, 136, 137, 139, 140, 141, 142, 143, 144, 145, 147, 154, 155, 157, 160, 165], "itch": 152, "item": [9, 13, 18, 170, 173, 175], "iter": [5, 6, 12, 13, 155], "its": [2, 9, 16, 17, 129, 135, 137, 139, 141, 144, 147, 148, 155, 157, 170, 173, 177], "itself": [3, 8, 10, 11, 12, 15, 17, 154, 155, 157, 159, 170], "j": [155, 170, 177], "jack": 150, "jair": 153, "jamartinh": 155, "jax": [2, 14, 17, 20, 148, 155, 162], "jaxenv": [14, 17], "jaxtonumpi": [14, 17, 20, 155], "jaxtotorch": [14, 17, 20, 155], "jessefarebro": 154, "jet": 155, "jfpettit": 154, "jimini": 148, "jin": 155, "jit": 162, "jjshoot": [154, 155], "jkterry1": 154, "jnp": 17, "job": [148, 165], "job_statu": 6, "john": 155, "johnminelli": 154, "join": [5, 155], "joint": [127, 129, 131, 132, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "joint0": 145, "joint1": 145, "jordan": [137, 155, 170], "journal": 153, "journei": [154, 163, 175], "json": 155, "json_env_spec": 155, "jsonabl": 5, "jump": 169, "jumpi": 155, "jun": 155, "junk": 14, "jupyt": [1, 154, 165, 166, 167, 170, 171, 173, 175, 177], "just": [3, 13, 15, 143, 145, 154, 155, 157, 162, 165, 169, 170], "k": [8, 9, 14, 129, 146, 155, 170], "kad99kev": 155, "kallinteri": [155, 169], "karp": 170, "kavukcuoglu": 170, "kde": 173, "keep": [9, 14, 17, 18, 20, 132, 141, 143, 165], "keep_dim": [14, 18], "kei": [2, 4, 5, 6, 7, 9, 10, 14, 17, 18, 20, 137, 138, 139, 140, 141, 146, 147, 151, 152, 153, 154, 155, 157, 160, 163, 165, 166, 167, 173, 175], "kellycoinflip": 154, "kept": [10, 136, 169], "kexianshen": 154, "key_to_act": 9, "keyb0ardninja": 155, "keyboard": [7, 9, 154], "keys_to_act": [9, 154, 170], "keyword": [4, 6, 10, 11, 12, 14, 131, 132, 150, 154, 155, 157, 158, 165, 175], "kg": [154, 155], "khoda81": 155, "kind": 7, "king": 150, "kir0ul": [154, 155], "klimov": [126, 127, 128, 129], "knee": [127, 140, 141], "know": [2, 10, 155, 157, 161, 162, 165], "knowledg": 4, "known": 2, "krimmel": 155, "kvrban": 155, "kwarg": [3, 4, 9, 10, 11, 12, 14, 15, 137, 138, 139, 140, 146, 147, 155, 159, 165], "kwiatkowski": 155, "l": [14, 17, 146, 155], "l2r": 148, "lab_black": 173, "label": [173, 175], "laboratori": 148, "lack": 155, "ladder": 127, "lai": 141, "lake": [149, 154, 155], "lam": 170, "lambaactionv0": 155, "lambda": [11, 12, 14, 16, 17, 18, 19, 155, 161, 163, 170, 175], "lambdaact": 14, "land": [129, 157], "lander": [126, 154, 155], "lane": 148, "lap": [128, 154], "lap_complete_perc": 128, "larg": [11, 129, 134, 137, 138, 139, 140, 141, 144, 145, 146, 147, 148, 154, 155, 160, 162, 167, 169, 173], "larger": 170, "largest": 154, "last": [9, 13, 14, 17, 18, 20, 141, 144, 145, 154, 155, 165, 166, 173], "latent": 148, "later": [129, 170, 175, 177], "latest": [2, 162, 177], "latin": 7, "latter": 151, "lava": [2, 157], "layer": [3, 15, 157, 158, 177], "lazili": 5, "lazyfram": 18, "lead": 170, "learn": [2, 5, 6, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 138, 142, 143, 146, 148, 149, 150, 151, 153, 154, 155, 156, 157, 158, 162, 163, 165, 169, 170, 171, 173, 176], "learner": 173, "learning_r": [163, 173, 175, 177], "learnt": 4, "least": [7, 129, 136], "leav": [132, 142, 158, 165, 173], "left": [7, 127, 128, 129, 132, 133, 137, 140, 141, 142, 143, 147, 151, 152, 153, 158, 163, 165, 169, 173, 175, 177], "left_elbow": [140, 141], "left_elfbow": [140, 141], "left_foot": [140, 141], "left_hip_i": [140, 141], "left_hip_x": [140, 141], "left_hip_z": [140, 141], "left_hiz_z": [140, 141], "left_kne": [140, 141], "left_lower_arm": [140, 141], "left_shoulder1": [140, 141], "left_shoulder2": [140, 141], "left_sin": [140, 141], "left_thigh": [140, 141], "left_upper_arm": [140, 141], "leg": [127, 129, 137, 139, 140, 141, 147], "leg_joint": [139, 147], "leg_left_joint": 147, "legaci": 154, "legend": [173, 175], "legend_el": 175, "len": [6, 17, 167, 170, 175], "length": [4, 6, 7, 9, 11, 12, 14, 17, 20, 127, 131, 132, 133, 134, 146, 148, 152, 153, 154, 155, 157, 161, 170, 175], "length_mask": 6, "length_moving_averag": 175, "length_queu": [14, 17, 161, 175], "leq": 142, "less": [127, 129, 146, 147, 157, 173], "lesson": 173, "let": [148, 155, 163, 165, 167, 173, 175, 177], "letter": [153, 154], "level": [9, 15, 157, 169], "levin": [137, 170], "lg": 155, "li": [141, 157], "lib": 151, "librari": [148, 155, 156, 160, 165, 169], "licens": [154, 155, 165, 170, 173, 175, 177], "lidar": 127, "life": [17, 154, 155, 177], "lift": 144, "light": 148, "lighter": 155, "lightgreen": 175, "lightweight": 148, "ligibl": 177, "like": [3, 5, 7, 9, 13, 14, 15, 16, 17, 18, 19, 20, 138, 143, 147, 148, 153, 154, 155, 157, 158, 165, 167, 169, 170, 172], "lillicrap": 170, "limit": [17, 20, 132, 136, 143, 154, 155, 157, 158, 160, 162, 163, 168, 170, 171, 175, 177], "lin826": 154, "linalg": [9, 158, 165], "line": [2, 5, 131, 153, 154, 155, 157, 158, 160, 165], "linear": [10, 129, 131, 142, 143, 146, 170, 177], "linearli": [14, 16, 18, 20, 131, 142, 143, 157, 177], "linecolor": 173, "lineplot": [173, 177], "linewidth": [173, 175], "link": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 131, 137, 145, 146, 155], "list": [2, 4, 5, 6, 7, 9, 10, 11, 12, 14, 15, 17, 18, 154, 155, 157, 158, 170, 173, 175], "list_info": 14, "listdir": 17, "liter": 155, "littl": [129, 155], "live": [1, 9, 163, 175], "ll": [154, 163, 170, 175], "llvm": 148, "load": [4, 154, 155], "load_env_cr": 4, "load_ext": 173, "load_state_dict": 170, "load_weight": 170, "loc": [153, 170], "local": 165, "localhost": 1, "locat": [4, 136, 148, 151, 152, 153, 158, 165], "log": [2, 9, 14, 16, 17, 18, 19, 161, 170, 173, 177], "log_prob": [170, 177], "log_prob_mean": 177, "logan": 155, "logger": [17, 154, 161], "logic": [11, 128, 154, 158, 165, 169], "logical_or": [10, 155], "logit": 170, "long": [9, 17, 132, 143, 148, 154, 155], "longer": [2, 136, 137, 139, 143, 147, 154, 155, 177], "look": [5, 9, 17, 157, 158, 165, 166, 167, 170, 173], "loop": [148, 157, 160, 163, 170, 175], "lose": [129, 150, 175], "loss": [17, 150, 170, 177], "lost": 17, "lot": [15, 157, 163, 167, 175, 177], "low": [2, 6, 7, 8, 12, 13, 14, 18, 131, 132, 134, 135, 154, 155, 157, 167, 170], "lower": [7, 14, 15, 19, 20, 140, 141, 154, 157, 163, 170, 175], "lower_waist": [140, 141], "lpizzinidev": 155, "lr": [163, 170, 173, 175, 177], "lt": [154, 155], "lucasalegr": 155, "luck": 163, "lunar": [126, 154, 155], "lunar_land": [129, 155], "lunarland": [17, 129, 154, 155, 156, 157, 160, 170], "lunarlandercontinu": 167, "lwaist": [140, 141], "lz4": 18, "m": [8, 131, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 170], "mab": 148, "machado": [16, 17], "machin": [154, 173, 177], "made": [1, 129, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 169, 170], "magnitud": [9, 129, 134, 142, 143, 177], "mahiuchun": 154, "mai": [2, 5, 7, 8, 9, 11, 14, 15, 16, 18, 19, 151, 152, 157, 158, 161, 165, 169, 170], "main": [2, 129, 139, 147, 155, 169, 173], "main_bodi": [137, 169], "mainli": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "maintain": [136, 148, 154, 155, 156, 172], "mainten": 154, "major": [154, 155], "make": [1, 2, 6, 9, 11, 12, 15, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 163, 167, 168, 169, 170, 171, 173, 175, 177], "make_dict": 14, "make_vec": [2, 4, 9, 10, 11, 12, 14, 132, 155, 158, 162], "malici": 155, "manag": [4, 13, 148], "mandatori": [158, 165], "manhattan": [158, 165], "mani": [7, 14, 17, 136, 148, 154, 155, 157, 163, 165, 173, 175], "manipul": 148, "manner": [3, 7, 18, 20, 154], "mantasu": 155, "manual": [9, 14, 157, 160, 170], "manual_se": 177, "manuel": 155, "map": [9, 152, 153, 155, 157, 158, 165, 175, 177], "map_nam": 152, "map_siz": 173, "mar": [148, 155], "marionj": 155, "mark": 155, "markdown": 1, "market": 148, "markov": [155, 157, 166], "markovian": 148, "marku": 155, "markus28": [154, 155], "marsexplor": 148, "mask": [5, 6, 7, 153, 154, 155, 160, 170], "maskndarrai": 7, "mass": [129, 137, 140, 141, 146], "massiv": 154, "master": [18, 151, 154, 177], "match": [4, 8, 11, 12, 16, 132, 155], "math": [135, 148, 154], "mathbb": 7, "mathcal": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "mathemat": [5, 157, 177], "matplotlib": [9, 170, 173, 175, 177], "matpoliquin": 155, "matric": [5, 148], "matrix": [6, 140, 141, 148], "matter": [7, 154], "matyasch": 155, "max": [4, 7, 14, 16, 17, 18, 20, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 152, 155, 161, 163, 170, 173, 175, 177], "max_a": 166, "max_act": [14, 15, 16, 20], "max_episode_step": [4, 17, 155, 158, 165, 169, 170], "max_geom": 136, "max_id": 173, "max_length": 7, "max_ob": [14, 18, 20], "max_reward": [10, 14, 19, 155, 167], "max_time_step": [132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "maxandskipenv": 18, "maxandskipobserv": [18, 20, 155], "maxim": [170, 177], "maximis": 162, "maximum": [4, 7, 9, 14, 17, 18, 20, 129, 131, 135, 142, 155, 157, 165, 170, 173], "maxq": 153, "mayb": [2, 173], "mclean": 155, "md": [1, 154, 155, 165], "mdp": [2, 17, 133, 134, 148, 157], "mean": [2, 14, 18, 19, 20, 131, 139, 144, 154, 155, 157, 160, 161, 165, 170, 173, 177], "meant": 3, "meantim": 155, "measur": [9, 127, 135, 141, 142, 144, 145, 154], "mechan": 155, "medium": 148, "mel": 152, "melbourn": 173, "melt": 177, "member": 5, "memori": [5, 17, 133, 134, 154, 155, 162], "menageri": 169, "mention": [155, 177], "mere": 165, "merg": 155, "meshcat": 148, "meshgrid": 175, "messag": [5, 154, 155], "meta": [148, 154], "metaclass": 154, "metadata": [2, 3, 9, 10, 15, 17, 154, 155, 165, 167], "metatrad": 148, "metaworld": 148, "meter": [135, 148], "meth": [7, 17, 158], "method": [3, 4, 6, 9, 14, 16, 17, 18, 19, 148, 154, 155, 158, 159, 165, 167, 170, 177], "methodologi": 148, "metric": [2, 9, 155, 160, 170], "mgoulao": 155, "micim": 154, "microgrid": 148, "middl": [139, 147], "might": [2, 5, 7, 8, 15, 16, 157, 160, 161, 162, 163, 165, 167, 170, 175, 177], "migrat": [155, 157], "mihaic": 155, "miller": 173, "min": [7, 14, 16, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "min_act": [14, 15, 16, 20], "min_length": 7, "min_ob": [14, 18, 20], "min_reward": [10, 14, 19, 155, 167], "minari": 148, "minatari": 148, "mind": 155, "minecraft": 148, "minetest": 148, "miniatur": 148, "minibatch": 170, "minigrid": [148, 155], "minim": [2, 148, 155, 165, 166, 170, 173], "minimalist": 148, "minimum": [7, 14, 18, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 157, 173], "miniwob": 148, "miniworld": 148, "minor": [154, 155], "mirza": 170, "misc": [154, 155], "mischapanch": 155, "miss": [131, 153, 154, 155, 157], "mission": 5, "mistak": 155, "mit": [131, 170, 173, 175, 177], "mitig": 170, "mixtur": 155, "mj_camera": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "mj_step": [137, 140], "mjcf": 169, "mjdata": 136, "mjtvisflag": 136, "mjvcamera": 136, "mjvis_contactforc": 136, "mjvis_contactpoint": 136, "mjvisual": 136, "mkdir": [170, 173], "ml": [148, 154], "mmcaulif": 155, "mnih": 170, "mo": 148, "moba": 148, "mobil": 148, "mode": [2, 4, 10, 11, 12, 14, 154, 155, 157, 160, 165, 170, 175], "model": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 155, 163, 172, 175], "modern": 148, "modif": 167, "modifi": [2, 4, 5, 10, 14, 15, 16, 18, 19, 20, 131, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 158, 160, 165, 167], "modul": [4, 5, 13, 154, 155, 161, 165, 167, 170, 177], "modular": [14, 15, 148, 157, 165, 167], "module_nam": 155, "moment": 3, "monitor": [17, 154, 155, 160], "monop": 136, "mont": [163, 170, 175, 177], "month": 155, "moon": 129, "moor": [133, 134], "moore90efficientmemori": [133, 134], "more": [1, 2, 5, 6, 7, 14, 15, 19, 20, 127, 129, 136, 137, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 158, 160, 162, 163, 165, 166, 167, 169, 170, 173, 175, 177], "moreov": [5, 14, 15, 17, 165, 167], "moritz": [137, 170], "most": [2, 5, 7, 13, 14, 15, 17, 18, 136, 145, 148, 154, 155, 157, 158, 160, 162, 163, 165, 167, 169, 175], "mostli": 154, "motion": 148, "motor": [127, 146, 148, 154, 169], "motor1_rot": 146, "motor2_rot": 146, "mountain": [130, 154, 157], "mountaincar": [14, 133], "mountaincarcontinu": [14, 16, 19, 134], "mountaincarenv": 133, "move": [2, 14, 17, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 129, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 152, 153, 154, 155, 157, 158, 160, 165, 166, 169, 173, 177], "movement": [158, 165], "moviepi": [9, 17, 154, 155], "mozer": 131, "mp": 13, "mp4": 161, "mpi": 13, "mpi4pi": 13, "mpi_init": 13, "mt19937": 154, "mtsim": 148, "much": [154, 155, 165, 166, 169], "mujoco": [1, 9, 15, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 167, 169, 171, 176], "mujoco_env": 154, "mujoco_gl": 136, "mujoco_menageri": 169, "mujoco_rend": 154, "mujocoenv": 155, "mujocorend": 155, "mujocorender": 155, "multi": [2, 7, 136, 140, 141, 144, 148], "multiag": 148, "multibinari": [5, 6, 7, 8, 13, 18, 155, 157], "multidiscret": [5, 6, 7, 8, 10, 13, 18, 154, 155, 157, 165], "multimod": 148, "multipl": [5, 6, 7, 10, 11, 12, 13, 154, 155, 158, 162, 170], "multipli": [6, 129, 134, 144, 145, 170], "multiprocess": [5, 11, 13, 170], "multivari": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "must": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 128, 139, 147, 154, 155, 157, 160, 165, 166], "mutual": 136, "my": 173, "mypi": [2, 155], "n": [4, 5, 6, 7, 8, 11, 12, 13, 17, 18, 20, 128, 131, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 161, 163, 170, 173, 175, 177], "n_action": 170, "n_env": 170, "n_episod": [163, 175], "n_featur": 170, "n_run": 173, "n_showcase_episod": 170, "n_steps_per_upd": 170, "n_updat": 170, "na": 145, "name": [1, 3, 4, 11, 12, 14, 15, 19, 20, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 158, 165], "name_prefix": [9, 17, 161], "namedtupl": [155, 173], "namespac": [4, 158, 165], "nanoquadrotor": 148, "narrow": 173, "nativ": 17, "natur": [148, 150, 152, 175], "navig": [148, 153, 158, 165, 167], "nbodi": [137, 140, 141], "nchain": 154, "ncol": [151, 152, 170, 173, 175], "ncrement": 177, "ndarrai": [2, 3, 6, 7, 8, 13, 14, 16, 17, 18, 19, 131, 132, 133, 134, 135, 153, 170, 177], "neccari": 9, "necessari": [2, 5, 126, 154, 155], "necessarili": 8, "need": [2, 5, 6, 9, 14, 15, 16, 18, 127, 129, 132, 136, 154, 155, 157, 158, 160, 161, 162, 163, 165, 166, 167, 169, 170, 173, 175], "neg": [2, 7, 9, 134, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 153, 157, 177], "neither": [7, 17, 150, 153], "nerdyespresso": 155, "nervous": 127, "nest": [6, 13, 148], "net": [14, 19, 128, 150, 151, 163, 170, 177], "network": [5, 8, 10, 13, 146, 148, 157, 170], "neural": [5, 8, 10, 13, 131, 146, 148, 170, 177], "neurip": 131, "neuro": 148, "neuroflight": 148, "neuronlik": [132, 142, 143], "never": [2, 138, 141, 144, 145, 146, 155], "nevertheless": 167, "new": [2, 5, 14, 15, 16, 17, 18, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 146, 147, 153, 154, 155, 156, 157, 158, 159, 160, 162, 165, 166, 167, 169, 170, 173, 175], "new_action_spac": 14, "new_obs_spac": 14, "new_spac": 14, "new_stat": 173, "new_throughput": 9, "newer": [136, 154], "newli": [14, 18, 19], "newlin": 2, "next": [2, 3, 10, 13, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 155, 156, 157, 158, 160, 163, 169, 177], "next_ob": [155, 163, 175, 177], "next_stat": [3, 155, 175], "nice": 154, "nich": 157, "nidhish": 155, "nine": 137, "nintendo": 7, "nip": 131, "nlp": 148, "nlpgym": 148, "nn": [170, 177], "no_grad": 170, "node": [5, 6, 8, 154, 157], "node_spac": [6, 8], "nois": [130, 136], "non": [2, 5, 8, 9, 17, 129, 132, 137, 138, 139, 140, 141, 146, 147, 148, 152, 154, 155, 165, 166, 177], "non_random_ob": 128, "nonbreak": 154, "nondeterminist": [4, 165], "none": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 136, 152, 154, 155, 158, 160, 165, 170, 173, 175], "nonlinear": 139, "noop": [7, 9, 17, 153, 170], "noop_max": 17, "nor": [7, 17, 150, 151, 153], "norm": [9, 144, 145, 158, 165], "normal": [2, 4, 7, 10, 13, 14, 17, 18, 19, 20, 127, 128, 135, 137, 138, 142, 154, 155, 158, 161, 162, 170, 177], "normalis": 155, "normalize_tim": 18, "normalizeobserv": [14, 18, 20, 154, 155], "normalizereward": [14, 19, 20, 154], "normalizex": 155, "north": 153, "notabl": [154, 155, 166], "note": [1, 5, 9, 13, 14, 17, 18, 19, 131, 132, 136, 137, 139, 140, 141, 142, 144, 147, 153, 157, 162, 163, 166, 169, 170, 175, 177], "notebook": [1, 154, 165, 166, 167, 170, 171, 173, 175, 177], "noth": [3, 128, 129, 155, 177], "notic": 155, "notif": 155, "notimplementederror": 8, "nov": 153, "now": [2, 127, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 158, 160, 165, 166, 169, 173, 175, 177], "np": [2, 3, 5, 6, 7, 9, 10, 12, 13, 14, 16, 18, 19, 129, 133, 140, 141, 153, 154, 155, 158, 163, 165, 167, 169, 170, 173, 175, 177], "np_random": [2, 5, 9, 10, 11, 12, 15, 154, 155, 158, 160, 165], "np_random_se": [2, 10, 11, 12, 15, 155], "npt": 5, "nrow": [152, 170, 173], "nstep": 155, "num": [14, 17, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "num_col": 4, "num_edg": 6, "num_env": [4, 10, 11, 12, 13, 14, 132, 155, 158, 170], "num_eval_episod": 161, "num_nod": 6, "num_training_episod": 161, "number": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 17, 18, 20, 128, 129, 131, 136, 139, 140, 141, 146, 147, 148, 150, 152, 154, 155, 157, 158, 159, 161, 162, 165, 169, 170, 173, 177], "numer": [142, 143, 150, 154, 155, 167, 177], "numpi": [2, 5, 7, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 154, 155, 158, 160, 163, 165, 167, 169, 170, 173, 175, 177], "numpytotorch": [14, 17, 20, 155], "nutshel": [14, 19], "nv": [140, 141], "nvec": 7, "nvidia": 148, "o": [17, 170, 177], "o_": 166, "o_t": 166, "ob": [9, 12, 14, 16, 17, 18, 128, 153, 154, 155, 161, 163, 167, 175, 177], "obj_slidei": 144, "obj_slidex": 144, "object": [2, 5, 9, 11, 12, 13, 14, 15, 136, 144, 145, 148, 155, 157, 175, 177], "obs0": 18, "obs1": 18, "obs2": 18, "obs3": 18, "obs4": 18, "obs_kei": 18, "obs_shap": 170, "obs_space_dim": 177, "obs_t": 9, "obs_tp": 9, "obs_tp1": 9, "observ": [2, 3, 4, 5, 6, 9, 10, 11, 12, 13, 15, 17, 19, 20, 148, 154, 155, 156, 160, 162, 163, 166, 167, 173, 177], "observation_mod": [11, 12], "observation_spac": [2, 5, 6, 7, 9, 10, 11, 12, 13, 14, 15, 18, 157, 158, 165, 167, 173, 177], "observation_structur": [137, 138, 139, 140, 141, 143, 146, 147], "observationwrapp": [14, 15, 18, 165, 169], "obstyp": [2, 3, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 155, 160, 177], "obswrapp": 154, "obtain": [15, 17, 135, 150, 158, 166], "obviou": 166, "obvious": 155, "occur": [2, 10, 131, 132, 165, 170], "octob": 136, "off": [17, 128, 129, 151, 153, 154, 155, 160, 163, 173], "offer": 148, "offici": 3, "offlin": [148, 155], "offscreen": [154, 155], "offset": [129, 140, 141], "offworld": 148, "often": [2, 5, 148, 155, 158, 165, 173], "oftentim": [158, 165], "old": [10, 16, 154, 155, 156, 159, 160], "old_spac": 14, "old_throughput": 9, "older": [136, 155, 160], "oldv21env": 159, "oleg": [126, 127, 128, 129], "olimoyo": 154, "olipinski": [154, 155], "omar": 155, "omega_1": 142, "omega_2": 142, "omesa": 136, "omg": 148, "omit": [137, 138, 139, 140, 141, 146, 147], "omniisaacgymenv": 148, "omnivers": 148, "onc": [1, 17, 153, 154, 158, 165, 170, 175], "one": [2, 5, 6, 7, 8, 9, 11, 15, 131, 132, 133, 134, 135, 137, 139, 142, 143, 146, 150, 153, 154, 155, 158, 159, 160, 163, 166, 167, 169, 170, 173, 175, 177], "oneof": [5, 6, 8, 13, 155], "ones": [14, 130, 154, 170, 175], "ongo": 170, "onli": [2, 4, 5, 6, 7, 8, 9, 17, 18, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 145, 146, 147, 154, 155, 158, 160, 161, 163, 165, 167, 169, 170, 173, 175], "onlin": [140, 141, 150, 151], "onward": 166, "op": [17, 155], "open": [1, 17, 136, 148, 154, 155, 165], "openai": [2, 19, 126, 154, 155, 156, 157, 160, 166, 175, 177], "opencv": [14, 17, 18, 20, 154], "opengl": [136, 155], "openmodelica": 148, "openreview": [14, 19], "oper": [8, 14, 136, 154], "opportun": 148, "opt": 169, "optim": [13, 14, 17, 127, 129, 140, 141, 148, 155, 163, 170, 173, 175, 177], "optimis": [10, 148, 154, 172], "option": [2, 3, 4, 5, 6, 7, 10, 11, 12, 14, 15, 18, 128, 131, 132, 133, 134, 135, 136, 148, 154, 155, 157, 158, 160, 165], "ord": [9, 158, 165], "order": [4, 5, 6, 15, 17, 129, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 157, 161, 162, 165, 173], "order_enforc": [4, 165], "ordereddict": 155, "orderenforc": [4, 15, 17, 20, 127, 128, 129, 131, 132, 133, 134, 135, 154, 155, 157, 158, 165], "orderli": 154, "org": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 155, 163, 170, 175, 177], "orient": [129, 137, 140, 141], "origin": [8, 10, 14, 16, 17, 18, 128, 131, 132, 142, 154, 155, 160], "orsini": 154, "other": [14, 16, 129, 135, 136, 138, 140, 142, 146, 147, 152, 153, 154, 155, 157, 158, 159, 161, 163, 165, 167, 169, 170, 173, 175, 177], "otherwis": [2, 4, 6, 7, 9, 11, 12, 14, 17, 18, 139, 154, 155, 158, 163, 173, 175], "our": [155, 157, 158, 163, 165, 169, 170, 173, 175], "out": [2, 3, 13, 14, 15, 148, 154, 155, 158, 165, 166, 167, 172, 177], "outcom": [150, 173], "outer": 131, "outermost": 14, "outlin": [150, 154, 157, 158, 160, 163, 175], "output": [1, 9, 10, 13, 17, 154, 155, 157, 158, 165, 170, 175, 177], "output_truncation_bool": 9, "outsid": [2, 14, 18, 128, 129, 154, 155, 166], "outweigh": 155, "over": [5, 9, 13, 17, 138, 140, 150, 152, 154, 155, 161, 163, 167, 169, 170, 175], "overal": 155, "overcom": 173, "overfit": [169, 170], "overflow": 155, "overhaul": 155, "overhead": 129, "overrid": [4, 11, 14, 155, 167], "overview": [163, 165, 177], "overwrit": [16, 18, 19], "overwritten": [15, 154, 155], "own": [11, 15, 148, 155, 158, 160, 167, 168, 169, 171], "p": [138, 148, 151, 152, 153, 154, 170, 173], "p_": [144, 145], "p_0": 148, "packag": [1, 10, 17, 126, 148, 154, 155, 173], "package_data": 154, "pad": [18, 129, 155], "padding_typ": [18, 155], "page": [16, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 155, 157, 158, 162, 163, 172], "painlessli": 5, "pair": [136, 140, 141], "palett": 177, "pan": 144, "panda": [148, 173, 177], "paper": [131, 142, 148, 153, 155, 162], "paradigm": 2, "parallel": [10, 11, 12, 14, 148, 155, 158, 161, 162, 170], "param": [3, 7, 9, 155, 173], "paramet": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 158, 160, 165, 167, 170, 177], "parameter": [2, 170, 177], "parameter_tun": 154, "parametr": [5, 177], "parent": [13, 173], "pars": 4, "parse_env_id": 4, "parser": 155, "part": [17, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 155, 158, 166, 169, 173], "parti": [136, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 158, 165], "partial": [2, 148, 154, 155], "particular": [5, 6, 9, 20, 129, 155, 157, 159, 162, 163, 166, 169, 173], "particularli": [9, 17, 18, 132, 155], "pass": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 128, 137, 138, 139, 140, 141, 146, 147, 154, 155, 157, 159, 160, 165, 167, 170], "passeng": 153, "passenger_loc": 153, "passidx": 153, "passiv": [17, 20, 154, 155], "passiveenvcheck": [4, 15, 17, 20, 127, 128, 129, 131, 132, 133, 134, 135, 155, 157, 158], "past": [14, 18, 19, 154], "patch": [154, 175], "path": [4, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 152, 155, 165, 170, 173], "pathlib": 173, "patienc": 154, "paulmest": 155, "paw": 138, "pcg64": 154, "pd": [173, 177], "pdf": [131, 150, 151], "peg": 154, "pelvi": [140, 141], "penal": [137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 167], "penalis": [133, 134], "penalti": 153, "pend": [11, 154], "pendulum": [11, 12, 18, 130, 131, 132, 136, 154, 157, 177], "pendulumenv": 135, "per": [7, 9, 10, 17, 129, 132, 153, 154, 162, 166, 169, 170], "percentag": [128, 163, 175], "peretz": 155, "perez": 155, "perf_count": 154, "perfect": 157, "perfectli": 165, "perform": [2, 5, 14, 145, 154, 155, 157, 159, 162, 177], "period": [139, 161], "periodiclli": 161, "perman": [144, 145, 158], "permiss": [154, 155], "perpendicular": [151, 152, 173], "perpendicularli": 142, "person": 131, "perspect": 148, "perturb": [137, 138, 139, 140, 141, 142, 143, 146, 147], "pettingzoo": [2, 148, 154], "pge": 148, "phase": [148, 170], "phd": [133, 134, 146], "philosoph": 9, "photorealist": 148, "phys2d": 17, "physic": [2, 126, 136, 142, 143, 144, 145, 148, 154, 155, 177], "pi": [129, 131, 133, 135], "pick": [153, 163, 169, 170, 173, 175], "pickl": [1, 13, 154, 155], "pickled_env_spec": 155, "pickup": 153, "pictur": 157, "piec": 165, "pierrecounath": 155, "pierremardon": 155, "pierr\u00e9": [155, 173], "pinocchio": 148, "pip": [1, 126, 130, 136, 154, 155, 157, 165, 169], "pip3": 154, "pipe": [11, 154], "pipx": 165, "pirata": 154, "pitfal": 127, "pix_square_s": 165, "pixel": [2, 18, 128, 148, 165], "pixelobservationwrapp": [18, 154, 155], "pixelobswrapp": 154, "pixels3d": 165, "place": [9, 17, 132, 133, 134, 155, 158, 165, 170], "plai": [9, 128, 148, 154, 155, 163, 175], "plan": [148, 154, 155], "platform": 148, "playablegam": 9, "player": [148, 150, 151, 152, 153, 163, 173, 175], "player_count": 175, "playfield": 128, "playplot": [9, 155], "pleas": [2, 5, 7, 18, 148, 154, 155, 158, 160, 163, 172], "plot": [1, 9, 173, 175], "plot_nam": 9, "plot_q_values_map": 173, "plot_states_actions_distribut": 173, "plot_steps_and_reward": 173, "plot_surfac": 175, "plotter": 9, "plt": [170, 173, 175, 177], "plu": [7, 151], "plugin": [154, 155], "pm": 129, "png": 173, "point": [4, 8, 9, 15, 127, 128, 129, 131, 132, 135, 136, 145, 150, 154, 155, 158, 161, 166], "pointless": 155, "pole": [2, 130, 136, 142, 143, 148, 177], "pole2": 142, "poli": 148, "polici": [2, 14, 18, 19, 130, 136, 137, 138, 139, 140, 141, 146, 147, 148, 154, 155, 156, 157, 160, 166, 169, 170, 173], "policy_grid": 175, "policy_mean_net": 177, "policy_network": 177, "policy_stddev_net": 177, "polygon": 154, "pomdp": 3, "pong": [17, 155], "pontryagin": 129, "pool": [17, 146], "pop": [2, 154, 155], "pop_fram": 17, "popular": [126, 148, 155, 163, 175], "pose": 148, "posit": [2, 6, 7, 8, 9, 11, 13, 17, 127, 128, 129, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 151, 152, 153, 155, 157, 158, 165, 167, 175], "position_fingertip": 145, "position_target": 145, "possibl": [2, 5, 6, 7, 8, 14, 129, 131, 132, 133, 134, 136, 138, 140, 143, 146, 151, 152, 153, 154, 155, 157, 158, 161, 163, 165, 169, 173, 175], "possibli": [7, 13, 15, 17, 157, 170], "post": [154, 155, 160, 173], "postprocess": 173, "potenti": 155, "pow": 170, "power": [17, 128, 129, 134, 142, 143, 148, 177], "pp": 153, "ppo": 172, "pprint": [14, 155], "pprint_registri": [2, 4, 155, 157, 158], "pr": [140, 148, 154, 155, 160], "practic": [148, 154, 166, 175], "pre": [4, 5, 152, 155, 165], "precis": [6, 154], "predefin": 165, "predict": [139, 148, 170, 177], "predominantli": 3, "prefix": 161, "preload": 152, "premad": 148, "prematur": 2, "prepend": [9, 17], "preprocess": [9, 17, 20, 154, 155], "present": [131, 148, 154, 166, 175], "preserv": 166, "press": [7, 9, 128, 131, 157], "pretti": [4, 155, 173], "prevent": [3, 7, 11, 154, 155, 158], "previou": [17, 155, 160, 166, 167], "previous": [4, 9, 18, 137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 159], "primari": 160, "primarili": [148, 155], "primaryclass": 155, "principl": 129, "print": [4, 155, 161, 165, 167, 170, 173, 177], "print_registri": 4, "prior": 4, "priori": 148, "privat": 165, "prng": [2, 5, 6, 7], "proabil": [151, 153], "prob": [170, 177], "proba_frozen": 173, "probabl": [5, 16, 20, 151, 152, 153, 155, 163, 166, 170, 173, 175, 177], "problem": [17, 129, 132, 135, 137, 140, 141, 142, 143, 146, 148, 150, 153, 156, 161, 162, 166, 169, 170], "procedur": 177, "proceed": 131, "process": [9, 10, 11, 13, 14, 18, 131, 148, 155, 157, 165, 166, 170], "process_ev": 9, "produc": [2, 17, 20, 154, 155, 165], "product": [6, 7, 177], "program": [10, 11, 12, 148, 155, 157], "progress": [6, 9, 155, 170], "project": [9, 148, 154, 155, 157, 158, 160, 163, 165, 175], "prompt": 155, "propag": 173, "proper": 154, "properli": [1, 5, 154], "properti": [2, 5, 10, 11, 12, 14, 15, 18, 19, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 166], "proport": 170, "propos": [16, 155], "protocol": 17, "prototyp": 148, "prove": [148, 163], "prover": 148, "provid": [2, 5, 6, 7, 9, 10, 11, 14, 16, 17, 18, 20, 126, 127, 131, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 157, 158, 159, 161, 162, 163, 165, 172, 175], "psc": 154, "pseudo": [154, 155, 170], "pseudorandom": [5, 7], "publicli": 155, "publish": 155, "pull": [18, 19, 154], "pump": 165, "purpos": [3, 5, 154, 158, 177], "push": [129, 132, 136, 142, 143, 148, 155, 177], "pusher": [136, 155], "pusher_v5": 144, "put": [154, 165, 173], "puzzl": 148, "py": [1, 9, 126, 127, 128, 129, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 151, 154, 155, 165, 166, 167, 170, 173, 175, 177], "pybullet": 148, "pycharm": 155, "pydoc": 154, "pyflyt": [148, 155], "pygam": [2, 9, 126, 154, 155, 160, 165], "pyglet": 154, "pypi": [154, 155], "pyplot": [170, 173, 175, 177], "pyproject": 165, "pyright": [2, 154, 155], "pystk2": 148, "python": [1, 17, 126, 127, 128, 129, 148, 154, 155, 156, 157, 158, 161, 165, 166, 167, 170, 171, 173, 175, 177], "pytorch": [14, 17, 20, 155, 162, 177], "q": [153, 163, 166, 170, 171, 176], "q_": 166, "q_target": 166, "q_updat": 173, "q_valu": [153, 163, 175], "qfrc_actuat": [140, 141], "qfrc_constraint": 142, "qgallouedec": 155, "qlearn": 173, "qpo": [136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "qtabl": 173, "qtable_best_act": 173, "qtable_direct": 173, "qtable_directions_map": 173, "qtable_val_max": 173, "quad": 141, "quad_ctrl_cost": 141, "quad_impact_cost": 141, "quadcopt": 148, "quadrotor": 148, "quadrup": [136, 137, 155], "quagla": 154, "qualiti": [10, 148, 155], "quardup": 136, "quaternion": [137, 140, 141], "queen": 150, "question": [148, 165], "queue": [11, 161], "quicker": 170, "quickli": [133, 148, 163, 173, 175], "quit": [11, 165], "qvel": [136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "qwop": 148, "r": [1, 7, 14, 17, 19, 131, 135, 148, 150, 151, 153, 167, 173, 177], "r1etn1rtpb": [14, 19], "r_": 170, "r_elbow_flex_joint": 144, "r_forearm_roll_joint": 144, "r_shoulder_lift_joint": 144, "r_shoulder_pan_joint": 144, "r_t": 166, "r_upper_arm_roll_joint": 144, "r_wrist_flex_joint": 144, "r_wrist_roll_joint": 144, "race": [126, 148, 154], "racecar": 148, "racecar_gym": 148, "racing_dream": 148, "rad": [131, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147], "radian": [129, 131, 132, 135, 143], "rafaelcp": 155, "raghuspacerajan": 154, "rainbow": [172, 177], "rais": [2, 4, 7, 8, 9, 11, 12, 13, 17, 18, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 170], "ram": 162, "rand": 154, "randint": [154, 160], "randn": 154, "random": [2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 17, 18, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 145, 146, 147, 152, 153, 154, 155, 157, 158, 163, 165, 168, 171, 173, 175, 177], "randomize_domain": 170, "randomize_ob": 128, "randomli": [5, 6, 9, 129, 152, 154, 155, 158, 165, 170, 173], "randomnumbergener": 154, "randomst": 154, "rang": [9, 13, 14, 16, 17, 18, 20, 127, 130, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 163, 165, 167, 169, 170, 173, 175, 177], "rangefind": 127, "raphajan": 155, "rapidli": 148, "rare": [7, 154], "rate": [144, 145, 148, 163, 170, 173, 175, 177], "rather": [2, 4, 128, 154, 155, 160, 161, 163, 170, 175], "raw": 155, "rayerdyn": 155, "rcparam": 177, "rdnfn": 154, "re": [1, 154, 155, 165, 173, 177], "reach": [2, 4, 17, 18, 20, 130, 131, 132, 133, 134, 136, 148, 151, 152, 153, 155, 158, 160, 163, 165, 166, 172, 173, 175, 177], "reachabl": 153, "reacher": [136, 155, 167], "reacher_weighted_reward": 165, "reacherrewardwrapp": 167, "read": [2, 5, 13, 19, 131, 137, 140, 141, 154, 155, 158, 160, 163], "read_from_shared_memori": [5, 13], "readabl": [5, 6, 155], "reader": [162, 169], "readi": [169, 175], "readm": [154, 155, 165], "readthedoc": 18, "real": [9, 138, 148], "realist": [136, 148], "realli": [155, 163, 169, 175], "rear": 128, "reason": [2, 17, 129, 148, 154, 155, 160, 169], "rebuild": 1, "receiv": [9, 18, 19, 20, 128, 129, 134, 147, 153, 154, 156, 157, 163, 166, 170, 175, 177], "recent": [13, 14, 17, 18, 19, 155, 159], "recogn": 155, "recommend": [3, 4, 9, 10, 11, 17, 129, 136, 137, 140, 141, 154, 155, 158, 160, 163, 169, 170, 175], "reconstruct": 157, "record": [4, 9, 17, 20, 127, 148, 154, 155, 157, 170, 177], "recordconstructorutil": 155, "recorded_fram": 155, "recordepisodestat": 154, "recordepisodestatist": [14, 17, 20, 154, 155, 160, 161, 163, 170, 175, 177], "recordepisodicstatist": 161, "recordvideo": [17, 20, 154, 155, 160, 161], "recordvideov0": 155, "recreat": 155, "recreated_env": 155, "rect": 165, "red": [153, 154, 165], "redirect": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125], "redtachyon": [154, 155], "reduc": [132, 148, 162, 163, 170, 172, 175], "redund": 154, "refactor": [154, 155], "refer": [2, 4, 5, 7, 9, 155, 156, 157, 160, 162, 165, 166, 169], "referenc": 155, "reflect": [18, 132, 154, 155], "refresh": 173, "regard": 130, "regardless": [137, 138, 139, 140, 141, 146, 147, 169], "regex": 4, "reginald": 155, "regist": [13, 154, 155, 159], "register_env": [17, 155], "registr": [4, 154, 155, 165], "registri": [4, 154, 155], "regress": 9, "regularli": 155, "reimplement": 160, "reinforc": [2, 5, 131, 146, 148, 149, 150, 151, 153, 154, 155, 156, 157, 162, 163, 166, 170, 171, 173, 175, 176], "reinforce_invpend_gym_v26": 177, "reiniti": [154, 177], "rel": [131, 140, 141, 155, 165, 167, 170], "relat": [129, 132, 137, 139, 140, 141, 142, 143, 144, 145, 147, 148, 155, 177], "relative_posit": 165, "relativeposit": [165, 167], "releas": [10, 11, 12, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 165], "relev": [154, 159, 165], "relu": [14, 170], "remain": [18, 151, 154, 165, 166], "remak": 170, "rememb": [17, 128, 165, 167], "remind": [155, 169], "remot": [148, 155], "remov": [2, 4, 17, 18, 137, 138, 139, 140, 141, 145, 146, 147, 153, 154, 155], "renam": [14, 17, 18, 138, 140, 154, 155, 177], "render": [2, 3, 4, 10, 14, 15, 17, 18, 20, 126, 127, 137, 138, 139, 140, 146, 147, 148, 154, 155, 157, 158, 161, 163, 167, 173, 175], "render_clos": 3, "render_fp": [2, 9, 17, 155, 165], "render_imag": 3, "render_init": 3, "render_kei": 18, "render_mod": [2, 3, 9, 10, 15, 17, 18, 127, 128, 131, 132, 133, 134, 135, 136, 154, 155, 156, 157, 159, 160, 161, 165, 170, 173], "render_onli": 18, "render_st": 3, "rendercollect": [2, 17, 20, 155], "renderedfram": [17, 20], "renderfram": [2, 9, 10, 14, 15], "renderstatetyp": 3, "renorm": 129, "rep": 173, "repeat": [4, 16, 18, 20, 155, 157, 158, 173], "repeat_action_prob": 16, "rephras": 154, "replac": [17, 18, 137, 146, 150, 154, 155, 160, 161, 163, 165, 175], "replai": 10, "replay_buff": 155, "repo": [1, 154, 155], "report": [127, 137, 140, 141, 154, 155], "repositori": [136, 154], "repres": [2, 6, 7, 9, 18, 129, 131, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 152, 153, 154, 155, 156, 157, 158, 160, 165, 169, 173, 177], "represent": [2, 155, 157, 166], "reproduc": [2, 6, 148, 155, 160, 173], "request": [18, 150, 155], "requir": [1, 2, 5, 6, 9, 10, 11, 14, 18, 136, 147, 154, 155, 157, 158, 162, 163, 165, 169, 170, 175, 177], "rerun": 155, "res_al": 173, "rescal": [14, 16, 18, 20], "rescale_act": 154, "rescaleact": [14, 15, 16, 20, 155, 157], "rescaleobserv": [14, 18, 20, 155], "research": [136, 142, 148, 153, 155, 162], "reseed_sampl": 155, "reseed_valu": 155, "reset": [2, 3, 4, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 146, 147, 151, 152, 153, 154, 155, 156, 157, 161, 163, 170, 173, 175, 177], "reset_async": 155, "reset_clean": 17, "reset_noise_scal": [137, 138, 139, 140, 141, 142, 143, 146, 147, 169], "reset_qt": 173, "reset_wait": 155, "resetneed": 17, "reshap": [14, 18, 20, 173], "reshape_env": 18, "reshapeobserv": [14, 18, 20], "resiz": [14, 17, 18, 20, 155], "resized_env": 18, "resizeobserv": [14, 18, 20, 155], "resolv": 154, "resourc": [10, 11, 12, 165, 175], "respect": [14, 15, 17, 129, 131, 137, 138, 139, 140, 141, 146, 147, 154, 155, 157, 167, 170], "respectivesli": 14, "respond": 175, "rest": [128, 129, 139, 147, 155, 158, 170], "restart": 157, "restor": [137, 138, 139, 140, 141, 146, 147], "restrict": 155, "result": [2, 6, 8, 10, 11, 12, 13, 17, 129, 131, 132, 136, 137, 140, 141, 142, 145, 149, 151, 153, 154, 155, 157, 158, 163, 170, 173, 175], "retriev": 4, "retro": [148, 155], "return": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 127, 137, 138, 139, 140, 141, 145, 146, 147, 150, 151, 152, 153, 154, 155, 157, 158, 160, 163, 165, 166, 167, 170, 173, 175, 177], "return_info": [2, 154], "return_queu": [14, 17, 161, 170, 175, 177], "reveal": 150, "revers": [5, 8, 150, 170, 177], "revert": [154, 155], "revisit": 17, "rew": [9, 14, 17, 19, 166], "reward": [2, 3, 4, 9, 10, 11, 12, 15, 17, 18, 20, 148, 154, 155, 156, 157, 158, 160, 161, 163, 165, 166, 167, 170, 173, 175, 177], "reward_al": 140, "reward_control": [144, 145], "reward_control_weight": [144, 145], "reward_ctrl": [14, 137, 139, 140, 144, 145, 147, 167], "reward_ctrl_weight": 167, "reward_dist": [144, 145, 167], "reward_dist_weight": [144, 145, 167], "reward_forward": [137, 138, 139, 140, 146, 147], "reward_fwd": 146, "reward_linvel": 140, "reward_moving_averag": 175, "reward_near": 144, "reward_near_weight": [144, 145], "reward_over_episod": 177, "reward_quadctrl": 140, "reward_rang": 155, "reward_run": [14, 138], "reward_surv": [137, 139, 140, 142, 143, 147], "reward_threshold": [4, 137, 138, 139, 140, 141, 144, 145, 146, 147, 155, 165], "rewards_df": 173, "rewards_over_se": 177, "rewards_to_plot": 177, "rewardtyp": [3, 155], "rewardwrapp": [14, 15, 19, 169], "rewrit": [154, 155], "rgb": [2, 14, 17, 18, 20, 128, 137, 138, 139, 140, 146, 147, 154], "rgb_arrai": [2, 3, 9, 17, 18, 20, 127, 128, 131, 132, 133, 134, 135, 136, 154, 161, 165, 170, 173], "rgb_array_list": [2, 9, 17, 154], "richard": 173, "right": [2, 7, 127, 128, 129, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 146, 147, 151, 152, 153, 155, 158, 165, 173, 175, 177], "right_back_leg": 137, "right_elbow": [140, 141], "right_foot": [140, 141], "right_hip_i": [140, 141], "right_hip_x": [140, 141], "right_hip_z": [140, 141], "right_kne": [140, 141], "right_lower_arm": [140, 141], "right_shoulder1": [140, 141], "right_shoulder2": [140, 141], "right_sin": [140, 141], "right_thigh": [140, 141], "right_upper_arm": [140, 141], "rigid": [140, 141], "rl": [9, 17, 148, 155, 156, 163, 166, 169, 175, 177], "rlbook2020": [150, 151], "rmsprop": 170, "rnd": [154, 155], "rng": [2, 3, 5, 6, 7, 9, 10, 154, 155, 165, 173], "road": [128, 155], "roadmap": 155, "robertoschiavon": 155, "robo": 148, "robot": [127, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 160, 166], "robust": [148, 170], "rocket": [129, 148], "rodrigo": 155, "rodrigodelazcano": [154, 155], "rogerjl": 155, "rohan138": 154, "roll": [14, 18, 19, 20, 144, 175], "rolling_length": [170, 175], "rollout": [153, 155], "rom": [154, 155], "ronald": 177, "root": [6, 137, 140, 141, 155, 173], "rooti": [138, 139, 147], "rootx": [138, 139, 147], "rootz": [138, 139, 147], "rotat": [129, 131, 137, 140, 141, 144, 154, 175], "rotor": [137, 138, 139, 140, 141, 146, 147], "roughli": 136, "roulett": 154, "roundtrip": 17, "row": [151, 152, 173], "rpg": 152, "rr": 155, "rstride": 175, "rule": [148, 150, 175], "run": [1, 2, 4, 10, 11, 12, 14, 17, 18, 19, 127, 129, 131, 136, 137, 138, 139, 140, 146, 147, 148, 151, 154, 155, 157, 158, 162, 165, 169, 170], "run_": 1, "run_env": 173, "run_gymnasium_env": 165, "runner": 136, "running_g": 177, "runningmeanstd": [14, 18, 19], "runtim": 170, "runtimeerror": [11, 12, 170], "rushivarora": 154, "rusu24edward": [154, 155], "ryanrud": 154, "r\u00e9mi": 146, "s_": 170, "sab": [150, 163, 175], "safe": [129, 148, 155, 157], "safeti": [148, 154, 155], "sai": [17, 154, 167], "said": [3, 140], "same": [4, 6, 8, 9, 10, 11, 12, 13, 14, 16, 18, 20, 127, 128, 131, 136, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 153, 154, 155, 158, 161, 162, 166, 169, 170, 175], "sampl": [2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 17, 18, 19, 129, 144, 153, 154, 155, 156, 157, 158, 160, 161, 162, 163, 165, 169, 170, 173, 175, 177], "sample_act": 177, "sample_mask": 6, "sample_n": 5, "sample_phas": 170, "samplefactori": 155, "sander": 155, "saniti": 173, "sashashura": [154, 155], "satisfi": 2, "satur": 148, "save": [14, 17, 154, 155, 160, 161, 167, 173], "save_logg": 9, "save_video": [9, 154], "save_videos1": 17, "save_videos2": 17, "save_videos3": 17, "save_weight": 170, "savefig": 173, "savefig_fold": 173, "sb3": 154, "scalabl": 148, "scalar": [7, 14, 16, 154], "scale": [14, 17, 18, 19, 129, 137, 138, 139, 140, 141, 142, 143, 146, 147, 148, 157, 162, 170], "scale_and_shift": 14, "scale_ob": 17, "scenario": [128, 148], "scene": [2, 148, 155, 169], "sched": 148, "schedul": [9, 148, 152], "scheme": 128, "schulhoff": 155, "schulman": [137, 170], "scope": [2, 166], "score": [127, 129], "scoreboard": 154, "scratch": [170, 177], "screen": [17, 137, 138, 139, 140, 146, 147, 154, 155], "screen_siz": 17, "script": [1, 148, 161, 175], "seaborn": [173, 175, 177], "search": 155, "second": [1, 6, 9, 10, 17, 129, 132, 142, 145, 146, 148, 155, 161, 162, 169], "section": [16, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 157, 158, 160, 163, 165, 173, 175], "see": [1, 2, 5, 6, 9, 10, 14, 17, 19, 129, 131, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 151, 152, 154, 155, 157, 158, 160, 161, 163, 165, 169, 170, 173], "seed": [2, 3, 4, 5, 6, 7, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 131, 132, 133, 134, 135, 136, 154, 155, 156, 157, 158, 165, 170, 173, 177], "seeded_valu": 155, "seem": 173, "seen": 131, "segment": 146, "select": [5, 7, 9, 136, 155, 157, 163, 169, 170], "select_act": 170, "self": [5, 6, 7, 9, 14, 18, 19, 134, 148, 155, 158, 163, 165, 167, 170, 173, 175, 177], "semant": 154, "send": 14, "sens": [7, 169, 173], "sensor": [128, 169], "sent": 154, "separ": [5, 17, 155, 158, 160, 165, 170], "sepsi": 148, "sequenc": [2, 4, 5, 6, 7, 8, 11, 12, 13, 14, 18, 148, 154, 155, 157], "sequenti": [170, 177], "seri": [6, 157, 161], "serial": [12, 170], "serv": [3, 5, 137, 138, 139, 140, 141, 146, 147, 177], "server": 155, "set": [2, 4, 5, 6, 7, 9, 10, 11, 12, 15, 16, 17, 18, 20, 126, 129, 130, 131, 133, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 152, 154, 155, 158, 160, 163, 165, 166, 167, 169, 170, 173, 175, 177], "set_attr": [11, 12, 155], "set_color": 173, "set_linewidth": 173, "set_mod": 165, "set_np_random_through_se": [2, 10], "set_rotate_label": 175, "set_stat": 154, "set_them": 173, "set_titl": [170, 173, 175], "set_vis": 173, "set_wrapper_attr": [15, 155], "set_xlabel": [170, 175], "set_xtick": 173, "set_xticklabel": 175, "set_ylabel": 175, "set_yticklabel": 175, "set_zlabel": 175, "setup": 154, "seungjaeryanle": 154, "seven": 147, "sever": [2, 4, 7, 10, 17, 129, 154, 155, 157, 158, 160, 161, 173], "sfff": 152, "sfffffff": 152, "shape": [2, 5, 6, 7, 8, 10, 11, 13, 14, 16, 17, 18, 20, 131, 132, 133, 134, 135, 137, 140, 141, 150, 151, 152, 153, 154, 155, 157, 158, 165, 167, 170, 173, 177], "share": [5, 11, 155, 177], "shared_featur": 177, "shared_memori": [11, 13], "shared_net": 177, "shen": 155, "shift": 7, "shimmi": 155, "shin": [138, 140, 141], "shinrl": 148, "shoot": 11, "short": [136, 154, 158, 163], "shorthand": 154, "should": [1, 2, 5, 7, 9, 10, 11, 12, 13, 14, 17, 136, 145, 154, 155, 157, 158, 160, 161, 162, 163, 165, 167, 169, 170, 175], "shoulder": 144, "shouldn": [155, 160, 165], "show": [3, 136, 150, 154, 155, 160, 161, 167, 170, 173, 175, 177], "showcas": [163, 175], "shown": [128, 131, 160, 166], "shrink_act": 14, "siddarth": [155, 177], "side": [13, 129, 173], "sign": [142, 143, 177], "signal": [2, 17, 137, 139, 140, 147, 153, 157, 165, 166, 173], "signatur": 159, "signific": [148, 155, 159, 169], "significantli": [154, 155, 170], "silenc": 9, "silent": 155, "silver": [170, 173], "sim": [148, 173], "similar": [6, 10, 144, 154, 155, 158, 160, 165], "similarli": [129, 155, 157], "simlat": 136, "simonini": 173, "simoninithoma": 173, "simpl": [127, 148, 149, 156, 157, 166, 177], "simplegrid": 148, "simpler": 157, "simplest": 170, "simpli": [14, 16, 17, 18, 19, 158, 165], "simplic": 148, "simplifi": [135, 154, 155, 157], "simplist": [158, 165], "simul": [2, 136, 140, 141, 142, 143, 148, 163, 173, 175, 177], "simultan": 155, "sin": [129, 135, 142, 145], "sinc": [5, 14, 17, 126, 132, 141, 145, 153, 154, 155, 158, 159, 165, 166, 169, 173], "sine": [131, 142, 145], "singl": [2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 18, 20, 139, 147, 148, 154, 155, 157, 158, 162, 165, 175, 177], "single_action_spac": [10, 13, 14, 155, 170], "single_func": 14, "single_observation_spac": [10, 11, 14, 155, 170], "single_rgb_arrai": 154, "sinusoid": [133, 134], "site": 154, "situat": 153, "six": 147, "size": [5, 13, 14, 17, 152, 154, 155, 158, 165, 169], "skeleton": 165, "skip": [9, 17, 18, 20, 154, 155, 165], "skip_and_max_ob": 18, "skip_close_check": 9, "skip_render_check": 9, "sleep": 129, "slide": [138, 139, 142, 143, 144, 145, 146, 147], "slider": [142, 143], "slider1": 146, "slider2": 146, "slight": 127, "slightli": [9, 127, 136, 137, 140, 141, 142, 145], "slime": 148, "slimevolleygym": 148, "slipperi": [151, 152], "slow": [155, 163, 175], "slowdown": 9, "slower": [129, 170], "small": [127, 148, 149, 154, 155, 177], "smallest": 7, "smart": 148, "smoother": [154, 175], "sn": [173, 175, 177], "snake_cas": 1, "snippet": [9, 17], "snow": 152, "so": [1, 9, 11, 14, 17, 18, 20, 129, 130, 137, 138, 139, 140, 141, 144, 145, 146, 147, 151, 152, 154, 155, 157, 163, 165, 166, 167, 169, 170, 173, 175], "soft": 148, "softmax": 170, "softrobot": 148, "softwar": [2, 148, 155], "sokoban": 148, "sole": [154, 155, 159], "solut": [129, 155, 163, 175], "solv": [2, 127, 130, 132, 136, 142, 143, 148, 163, 165, 171, 173, 176, 177], "some": [2, 5, 6, 7, 11, 12, 14, 15, 128, 130, 131, 145, 146, 153, 154, 155, 157, 158, 160, 163, 165, 167, 169, 170, 173, 175], "someth": [154, 155, 157, 169], "sometim": [9, 137, 151, 152, 165, 167], "sonelu": 155, "soon": [154, 160, 165], "sort": 7, "sourc": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 136, 148, 154, 155, 165, 166, 167, 170, 171, 173, 175, 177], "south": 153, "space": [2, 9, 10, 11, 12, 14, 15, 16, 17, 18, 20, 136, 148, 149, 154, 155, 158, 163, 165, 167, 170, 173, 177], "spacecraft": 148, "spaces_kwarg": 6, "spaceship": 157, "span": 17, "spark": 148, "spars": [131, 165], "spawn": [11, 145, 170], "spec": [2, 3, 4, 10, 15, 17, 155], "special": [155, 159, 177], "specialis": 155, "specialist": [159, 162], "specif": [2, 4, 7, 10, 14, 17, 136, 154, 155, 166, 170, 177], "specifi": [2, 5, 6, 9, 14, 16, 17, 18, 20, 126, 127, 129, 130, 135, 137, 138, 139, 140, 141, 143, 146, 147, 149, 152, 153, 154, 155, 157, 158, 160, 161, 165, 167], "speed": [10, 127, 128, 155, 157, 161, 170], "speedup": 170, "spell": 154, "spent": [127, 129], "sphere": 144, "spheric": 145, "sphinx": 1, "spine": 173, "spinningup": 177, "split": [147, 155], "squar": [17, 144, 145, 153, 158, 165, 170, 173], "squeez": 170, "st": 173, "st_all": 173, "stabil": [14, 18, 19, 140, 141, 154, 167, 177], "stabl": [18, 148, 154, 155, 165], "stable_baselines3": 18, "stack": [6, 17, 18, 20, 155, 177], "stack_siz": 18, "stacked_env": 18, "stage": [17, 155], "stai": [158, 170], "stand": [127, 136, 137, 139, 140, 141, 142, 143, 147, 155, 163, 175], "standard": [14, 18, 19, 148, 154, 155, 156, 170, 177], "standard_norm": 154, "standup": 136, "start": [1, 2, 5, 7, 9, 11, 13, 17, 155, 156, 170, 173, 175, 177], "start_epsilon": [163, 175], "state": [2, 3, 4, 17, 18, 20, 130, 136, 148, 149, 154, 155, 157, 158, 160, 163, 166, 169, 170, 173, 175, 177], "state_dict": 170, "state_info": 3, "state_s": 173, "state_valu": [170, 175], "state_value_pr": 170, "statefulli": 3, "stateless": [3, 155], "statement": 155, "statetyp": [3, 155], "stationari": 170, "statist": [9, 14, 17, 18, 19, 129, 155, 161, 177], "stats_kei": [14, 17], "std": [14, 19, 177], "stderr": 154, "steer": 128, "stefanbschneid": 155, "step": [2, 3, 4, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 127, 129, 131, 132, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 151, 152, 153, 154, 155, 156, 157, 161, 162, 163, 166, 167, 170, 173, 175, 177], "step_api_compat": [9, 159], "step_async": 155, "step_index": 9, "step_info": 3, "step_return": 9, "step_starting_index": 9, "step_trigg": [9, 17], "step_wait": 155, "stepapicompat": [155, 159], "steppabl": 148, "steps_df": 173, "stick": [150, 175], "stickyact": [16, 20, 155], "still": [9, 154, 155, 160, 161, 167, 170, 175], "stochast": [130, 131, 133, 134, 136, 151, 153, 154, 173], "stock": 148, "stool": 152, "stop": [17, 150, 155, 157, 166, 175], "stopiter": 13, "storag": 148, "store": [4, 9, 10, 14, 17, 155, 177], "str": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 160, 173, 175, 177], "strang": 155, "strateg": [133, 134], "strategi": [162, 163, 175], "strict": [2, 154, 155], "strictli": 17, "striker": 154, "string": [2, 4, 5, 7, 9, 14, 18, 154, 155, 157, 158, 165], "stringio": 2, "stronger": 169, "structur": [3, 5, 6, 8, 155, 158, 165], "stuck": 160, "stump": 127, "style": [2, 4, 10, 148, 154, 155, 161, 177], "sub": [6, 10, 11, 12, 14, 155, 162], "subclass": [14, 154], "subject": 146, "submit": [148, 154], "subplot": [170, 173, 175], "subprocess": 11, "subse": 154, "subsect": 169, "subsequ": [6, 154], "subset": [7, 155, 167], "subspac": [5, 6, 7, 8, 14, 18, 154], "substanti": [155, 162], "subtract": 170, "succeed": 157, "success": [131, 153, 155, 166], "successfulli": [2, 153], "suffici": [163, 175], "suggest": [14, 19, 154, 170], "suit": 154, "suitabl": [148, 149], "sum": [6, 14, 17, 19, 129, 150, 155, 163, 173, 175, 177], "summari": 160, "summaris": 155, "sumo": 148, "super": [2, 14, 15, 148, 154, 158, 160, 165, 167, 170, 177], "superclass": [5, 16, 19], "supersuit": 155, "supertuxkart": 148, "suppli": 166, "support": [2, 5, 6, 8, 9, 17, 18, 20, 136, 137, 138, 139, 140, 141, 143, 146, 147, 148, 154, 155, 157, 165], "supportsfloat": [2, 7, 15, 19, 160, 167, 177], "suppos": 9, "suptitl": [170, 175], "sure": [1, 2, 9, 17, 158, 165, 167], "surfac": [142, 143, 147, 165], "surfarrai": 165, "surround": [17, 20], "survei": 162, "suspend": 146, "sutton": [2, 131, 132, 142, 143, 150, 151, 154, 163, 173, 175], "sutton_barto_reward": [132, 155], "swig": [126, 154, 155, 157], "swim": 136, "swimmer": [136, 147, 154], "swing": [131, 135], "swingup": 135, "switch": [154, 155], "symbol": 148, "sync": [4, 9, 10, 12, 14, 132, 155], "synchron": [10, 11, 12, 155], "syncvectorenv": [4, 10, 132, 155, 158, 162], "syntax": 4, "synthes": 148, "synthesi": [140, 141], "system": [131, 135, 136, 142, 154, 155, 160, 165], "system_check": 6, "t": [1, 2, 4, 6, 8, 9, 14, 15, 17, 19, 128, 129, 133, 134, 153, 154, 155, 158, 160, 161, 163, 165, 166, 170, 173, 175], "t_cov": [5, 13], "tabl": 154, "tabular": [148, 163], "tag": 148, "tai": 155, "tailkind": 9, "take": [1, 2, 3, 4, 7, 8, 9, 10, 11, 16, 17, 130, 132, 134, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 153, 154, 157, 160, 163, 167, 170, 173, 175, 177], "taken": [10, 17, 18, 20, 130, 132, 157, 161, 162, 165, 170, 172, 177], "talk": [162, 177], "tall": 137, "tan": 155, "tanh": [129, 177], "target": [18, 131, 136, 144, 145, 158, 165, 166, 167, 170], "target_dur": 9, "target_i": 145, "target_posit": 167, "target_x": 145, "task": [2, 6, 15, 128, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 153, 157, 165, 166, 167, 173, 177], "tassa": [139, 140, 141], "tau": 135, "taxi": [149, 154, 155], "taxi_col": 153, "taxi_row": 153, "td": [170, 173], "td3": 172, "td_error": 170, "team": [154, 155], "technic": [16, 155], "techniqu": [17, 20], "techreport": [133, 134], "tell": [163, 166, 175], "templat": [3, 165], "temporal_differ": [163, 175], "temporarili": 13, "ten": [150, 175], "tendon": [140, 141], "tendon_length": [140, 141], "tendon_veloc": [140, 141], "tensor": [14, 17, 20, 170, 177], "tensorboard": 161, "term": [2, 14, 18, 130, 131, 136, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 158, 165, 167, 170], "termin": [2, 3, 9, 10, 11, 12, 14, 17, 18, 19, 20, 131, 132, 133, 134, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 165, 167, 170, 173, 175, 177], "terminal_observ": 154, "terminal_on_life_loss": 17, "terminaltyp": [3, 155], "terminate_when_unhealthi": [137, 139, 140, 147, 169], "terminatedtruncatedsteptyp": 9, "terminationwrapp": 169, "terrain": [127, 148], "terri": 155, "terribli": 154, "test": [9, 17, 127, 148, 154, 155, 165, 177], "tetri": 148, "text": [2, 5, 7, 8, 13, 150, 151, 152, 153, 154, 155, 157], "textencod": 154, "th": [17, 18, 20], "than": [1, 4, 129, 131, 132, 133, 134, 143, 144, 150, 154, 155, 160, 161, 163, 169, 170, 173, 175], "thank": 155, "theden": 154, "thei": [5, 11, 17, 20, 136, 149, 150, 151, 152, 154, 155, 158, 160, 162, 165, 169, 170], "theirfor": 170, "them": [3, 5, 14, 15, 17, 129, 137, 138, 145, 148, 153, 154, 155, 159, 167, 169, 170], "themselv": [148, 161], "theo": 155, "theorem": 148, "theoret": [148, 154, 155], "theori": [135, 142, 154, 157, 170, 173], "therefor": [2, 5, 17, 137, 140, 141, 145, 147, 154, 155, 158, 160, 161], "thesi": [133, 134, 146], "theta": [135, 154], "theta1": 131, "theta2": 131, "theta_dt": 135, "thi": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 166, 167, 169, 170, 172, 173, 175, 177], "thigh": [138, 139, 140, 141, 147], "thigh_joint": [139, 147], "thigh_left_joint": 147, "thing": [13, 15, 154, 158, 167], "think": [13, 148], "third": [137, 138, 139, 140, 141, 143, 146, 147, 154, 155, 157, 165], "third_party_environ": 154, "thoma": 173, "those": [13, 14, 131, 154, 167, 170, 173], "though": [6, 11, 154], "thought": [154, 155], "thousand": [155, 161], "thousandth": 17, "thread": 155, "three": [5, 6, 15, 139, 140, 141, 146, 154, 158, 161, 165], "threshold": [4, 131, 132, 153, 165], "throttl": 129, "through": [2, 3, 4, 5, 10, 11, 12, 14, 17, 18, 19, 20, 140, 141, 148, 154, 155, 156, 158, 159, 160, 161, 162, 166], "throw": [155, 167], "thrower": 154, "thruster": 129, "thu": [11, 137, 140, 141, 142, 155, 170], "thytu": 155, "ti": 173, "tick": 165, "tight": 173, "tight_layout": [170, 173, 175], "tile": [128, 173], "tileset": 152, "till": [170, 175], "till2": 155, "tilleri": 152, "tilt": 129, "tim": 173, "time": [1, 2, 6, 9, 10, 14, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 128, 132, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 151, 153, 154, 155, 158, 160, 161, 162, 163, 165, 168, 169, 170, 171, 172, 175, 177], "time_limit": [152, 153], "time_queu": [14, 17, 161], "timeawareobserv": [10, 18, 20, 155, 157], "timelimit": [2, 4, 15, 17, 20, 127, 128, 129, 131, 132, 133, 134, 135, 155, 157, 158, 165, 166], "times10": [137, 140, 141], "timestamp": 2, "timestep": [2, 17, 18, 20, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 157, 158, 165, 169, 170, 177], "timgates42": 154, "timschneider42": 155, "tip": [138, 142, 146, 147], "tips_arm": 144, "titl": [9, 133, 134, 155, 173, 175, 177], "tmrl": 148, "to_json": [5, 155], "to_torch": 155, "tobirohr": 155, "toctre": 1, "todo": 153, "todorov": [139, 140, 141], "togeth": [5, 12, 136, 173], "toi": [126, 150, 151, 152, 153, 154], "tom": 153, "toml": 165, "ton": 154, "too": [137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 169, 173], "tool": [137, 138, 139, 140, 141, 143, 146, 147], "toolbox": 148, "toolkit": 148, "top": [128, 129, 130, 133, 134, 139, 142, 143, 147, 148, 154, 155, 158, 165, 173, 177], "torch": [2, 14, 17, 155, 170, 177], "torqu": [127, 129, 131, 135, 137, 138, 139, 140, 141, 144, 145, 146, 147], "torso": [137, 138, 139, 140, 141, 147, 169], "toss": 7, "total": [2, 127, 128, 129, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 153, 161, 170, 173, 177], "total_episod": 173, "total_num_episod": 177, "total_reward": 173, "total_timestep": 155, "touch": [14, 158, 165], "touretzki": 131, "toward": [146, 154], "tower": 155, "towers2024gymnasium": 155, "towers_gymnasium_2023": 155, "town": 155, "toy_text": [152, 154, 163, 173, 175], "tpu": 162, "tqdm": [163, 170, 173, 175], "trace": [127, 155], "traceback": [13, 17], "track": [9, 14, 17, 20, 128, 132, 137, 138, 139, 140, 146, 147, 148, 155, 161], "trackmania": 148, "trade": [148, 155], "tradeoff": 170, "traffic": 148, "train": [137, 140, 141, 142, 145, 148, 154, 155, 157, 160, 168, 172], "training_error": [163, 175], "training_error_moving_averag": 175, "training_period": 161, "trajectori": [14, 18, 19, 129, 140, 141], "transfer": [148, 162], "transform": [3, 5, 8, 9, 14, 15, 16, 18, 19, 20, 145, 157, 167], "transformact": [14, 16, 20], "transformobserv": [14, 18, 20], "transformreward": [14, 19, 20], "transit": [3, 9, 140, 141, 151, 152, 153, 155, 156, 157, 166, 170], "transition_info": 3, "translat": [137, 140, 141, 155, 158, 165], "transport": 148, "transpos": [9, 165], "treasur": 173, "treatment": 148, "tremend": 154, "tri": 155, "trigaten": 154, "trigger": [9, 17, 153], "tristan": 155, "tristandeleu": 154, "trivial": 165, "true": [2, 4, 6, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19, 127, 128, 132, 136, 137, 138, 139, 140, 141, 146, 147, 150, 152, 154, 155, 157, 159, 160, 161, 165, 169, 170, 173, 175, 177], "true_": [16, 18], "trunc": [14, 18], "truncat": [2, 4, 9, 10, 11, 12, 14, 17, 18, 19, 20, 128, 131, 132, 133, 134, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 163, 165, 167, 169, 170, 173, 175, 177], "trunk": 169, "try": [141, 148, 154, 155, 163, 175], "tuckerbmorgan": 154, "tune": [148, 177], "tupl": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 20, 137, 139, 140, 147, 150, 154, 155, 157, 158, 160, 163, 165, 170, 175, 177], "turbotimon": 155, "turbul": [129, 154], "turbulence_pow": [129, 170], "turn": [9, 11, 17, 128, 129, 142, 148, 154, 155, 160], "tutori": [2, 15, 155, 158, 160, 161, 163, 167, 169, 170, 173, 175, 177], "tutorials_jupyt": 171, "tutorials_python": 171, "tweak": [136, 146], "two": [2, 5, 6, 7, 9, 10, 17, 18, 20, 127, 129, 131, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 145, 146, 147, 148, 150, 154, 155, 158, 160, 161, 162, 163, 165, 170, 175], "txt": [1, 154], "type": [2, 4, 5, 6, 7, 10, 14, 16, 17, 18, 20, 128, 132, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 158, 160, 161, 165, 167, 173, 177], "typeerror": 155, "typic": [2, 9, 153, 169], "typo": [154, 155], "u": [5, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 158, 163, 165, 167, 175, 177], "uav": 148, "uint8": [18, 128], "un": 132, "unattach": [144, 145], "unbound": [7, 18], "unclear": 155, "uncommon": 155, "undefin": 2, "under": [2, 155, 163, 175, 177], "underli": [2, 15, 17, 131, 155, 157], "underneath": [15, 132, 157], "underpow": 130, "underscor": 154, "understand": [2, 8, 10, 144, 155, 177], "undocu": 155, "uneven": 127, "unexpect": [11, 129, 154], "unflatten": [5, 8, 155], "unhealthi": [137, 139, 140, 142, 143, 147], "unicod": 9, "uniform": [5, 7, 8, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 173], "uniformli": [7, 131, 132, 153, 158, 165], "unintend": 155, "union": [6, 7, 19, 155], "uniqu": [6, 10, 126, 130, 136, 142, 154], "unit": [5, 14, 18, 19, 20, 129, 131, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148], "uniti": 148, "unitless": [142, 145], "unitre": 169, "unitree_go1": 169, "univers": [133, 134, 173], "unknown": [2, 9, 10, 148, 157], "unless": [7, 15, 150, 151, 152, 153, 154, 165], "unlik": [6, 177], "unnecessari": [154, 155], "unnescesari": 154, "unord": 5, "unpickl": [9, 13, 154, 155], "unplay": 9, "unsupport": 7, "untermin": 132, "until": [17, 144, 150, 151, 152, 154, 155, 157, 158, 165, 170, 173, 177], "unus": 154, "unwrap": [2, 10, 15, 131, 155, 157, 158], "up": [2, 6, 7, 10, 127, 129, 136, 137, 139, 140, 141, 147, 150, 151, 152, 153, 155, 157, 158, 161, 163, 165, 170, 173, 175], "updat": [1, 2, 3, 9, 14, 16, 17, 18, 19, 137, 139, 140, 147, 153, 154, 155, 157, 158, 159, 160, 163, 165, 166, 167, 170, 173, 175, 177], "update_paramet": 170, "update_running_mean": [18, 19], "upgrad": [154, 155], "uph": 141, "uph_cost": 141, "uph_cost_weight": 141, "upon": [133, 134], "upper": [7, 14, 19, 20, 140, 141, 154, 157], "upright": [132, 135, 142, 143], "upstream": 155, "urandom": 2, "urdf": 169, "url": [154, 155, 170], "urldat": 155, "us": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 126, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 152, 153, 154, 155, 157, 159, 160, 161, 162, 163, 166, 167, 169, 171, 175, 176], "usabl": [150, 154, 163, 175], "usable_ac": 175, "usag": [9, 155, 158, 163, 167], "use_contact_forc": [137, 155], "use_cuda": 170, "useless": 169, "user": [2, 3, 4, 5, 9, 10, 12, 14, 18, 20, 131, 132, 133, 134, 135, 154, 155, 157, 160, 165, 177], "usual": [2, 6, 7, 17, 139, 158, 165, 167, 169, 170, 175], "util": [18, 148, 154, 155, 157, 159, 165, 170], "utilis": [10, 154, 160], "utli": 155, "v": [4, 133, 170], "v0": [2, 4, 9, 10, 14, 16, 17, 18, 19, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 158, 165, 166, 177], "v1": [2, 4, 9, 10, 11, 12, 14, 16, 17, 18, 19, 127, 128, 129, 131, 132, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 157, 159, 161, 163, 173, 175], "v2": [127, 128, 129, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 167], "v21": [154, 155], "v24": 154, "v25": [154, 160], "v26": [2, 155, 177], "v28": 155, "v2g": 148, "v3": [9, 14, 17, 18, 127, 128, 129, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 153, 154, 155, 156, 157, 160, 170], "v4": [14, 15, 16, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 177], "v5": [17, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 169], "vairodp": 155, "val": 173, "valentin": 155, "valid": [2, 4, 5, 7, 10, 13, 14, 157, 160, 170, 175], "vallei": [133, 134], "valu": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 127, 129, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 154, 155, 157, 158, 160, 163, 165, 166, 169, 170, 173, 175, 177], "value_grid": 175, "value_pr": 170, "valueerror": [7, 8, 11, 12, 13, 17], "vampir": 148, "var": 19, "vari": [132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 177], "variabl": [2, 5, 7, 10, 11, 13, 14, 15, 17, 136, 137, 139, 146, 147, 148, 154, 155, 157, 161, 163, 165, 166, 175, 177], "varianc": [14, 18, 19, 20, 170], "variant": [128, 158, 165], "variat": 136, "varieti": [9, 148], "variou": [5, 141, 148, 154, 177], "ve": [154, 155], "vec_env": 9, "vecenvwrapp": 154, "vector": [2, 4, 7, 9, 11, 12, 16, 17, 18, 19, 127, 129, 142, 143, 145, 148, 154, 155, 158, 161, 168, 171, 177], "vector_entry_point": [4, 132, 155], "vector_env": 14, "vector_envs_tutori": 170, "vector_kwarg": 4, "vector_list_info": 154, "vectoractionwrapp": [14, 155], "vectorenv": [4, 5, 14, 154, 155], "vectorenvcr": 4, "vectorenvwrapp": 155, "vectoris": [5, 148], "vectorization_mod": [4, 9, 10, 11, 12, 14, 132, 155], "vectorizemod": 4, "vectorizetransformact": [14, 20], "vectorizetransformobserv": [14, 20], "vectorizetransformreward": [14, 20], "vectorizor": [4, 132], "vectorlistinfo": [14, 154, 155], "vectorobservationwrapp": [14, 155], "vectorrewardwrapp": [14, 155], "vectorwrapp": [14, 155], "vehicl": 148, "veloc": [2, 8, 13, 127, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148], "velocity_penalti": 142, "veri": [6, 129, 144, 154, 155, 157, 158, 162, 165, 170], "verifi": 9, "vermouth1992": [154, 155], "versa": [13, 17, 18], "version": [1, 2, 4, 5, 14, 16, 17, 18, 19, 148, 154, 155, 158, 163, 165, 166, 170, 175], "vertic": [127, 139, 141, 142, 143, 147, 158, 165], "vf_next_stat": 166, "vf_target": 166, "via": [5, 9, 14, 15, 17, 126, 130, 136, 148, 149, 157, 158, 165, 177], "viabl": [163, 175], "vice": [13, 17, 18], "vicent": 155, "video": [1, 17, 20, 127, 154, 155, 161, 173], "video_fold": [9, 17, 161], "video_length": [9, 17], "videorecord": [17, 154, 155], "view": [2, 155], "view_init": 175, "viewer": [10, 11, 12, 155], "viewport": 129, "viewport_h": 129, "viewport_w": 129, "virgilt": 154, "viridi": 175, "viscou": 146, "visibl": 165, "visit": [128, 173], "visual": 157, "visual_opt": [136, 155], "visualis": 2, "vizdoom": 148, "vmoen": [154, 155], "voidflight": 155, "vol": [131, 153], "volleybal": 148, "volunt": 155, "vscode": 155, "vwxyzjn": 154, "vx": [14, 17, 129], "vy": 129, "w": [9, 137, 140, 141, 170], "w_": [137, 138, 139, 140, 141, 144, 145, 146, 147], "wa": [2, 4, 6, 9, 10, 14, 17, 18, 19, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 154, 155, 159, 160, 163, 166, 167, 170, 177], "wai": [2, 13, 15, 17, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 157, 158, 163, 166, 167, 170, 173, 175], "wait": [9, 10, 11, 154, 166], "wait_on_play": 9, "wake": 129, "walk": [136, 140, 147, 149, 152, 155, 158, 165, 169], "walker": [126, 145, 147, 154], "walker2d": [136, 155], "walker2d_v5": 147, "wall": [133, 134, 153], "wandb": 161, "want": [1, 2, 4, 6, 11, 15, 17, 137, 140, 141, 154, 155, 157, 158, 161, 163, 165, 167, 169, 170, 173, 175], "ward": 177, "warehous": 148, "warn": [6, 9, 11, 17, 144, 154, 155], "wasn": 17, "watch": 1, "watkin": 163, "wawrzy\u0144ski": 138, "we": [1, 2, 3, 4, 5, 7, 9, 10, 13, 18, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 166, 167, 169, 170, 175, 177], "web": 148, "websit": [21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 136, 154, 155], "weigh": 137, "weight": [137, 138, 139, 140, 141, 144, 145, 146, 147, 148, 154, 166, 167, 177], "well": [2, 5, 14, 131, 148, 154, 165, 170, 173], "were": [126, 131, 136, 154, 155, 160, 165, 169, 170], "west": 153, "what": [2, 5, 14, 19, 148, 151, 155, 162], "wheel": [126, 128, 154], "when": [2, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, 17, 18, 20, 128, 129, 137, 139, 140, 141, 142, 143, 144, 147, 151, 152, 153, 154, 155, 157, 158, 159, 161, 165, 166, 167, 169, 170, 173, 177], "whenev": [17, 129, 160, 165], "where": [1, 4, 6, 7, 8, 9, 14, 17, 128, 129, 131, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 156, 157, 158, 160, 162, 163, 165, 166, 167, 169, 170, 173, 175, 177], "wherea": 173, "whether": [2, 3, 4, 5, 7, 9, 17, 129, 137, 138, 139, 140, 141, 146, 147, 148, 150, 160, 163, 165, 166, 175, 177], "which": [1, 2, 4, 5, 6, 8, 9, 10, 14, 17, 18, 127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 150, 151, 152, 153, 154, 155, 157, 158, 160, 162, 165, 166, 167, 169, 170, 172, 173, 177], "while": [4, 8, 9, 11, 14, 17, 18, 19, 129, 131, 132, 135, 137, 140, 141, 146, 150, 151, 154, 157, 158, 160, 161, 163, 165, 166, 170, 173, 175, 177], "white": 154, "who": 155, "whole": [4, 154, 155, 169], "whose": [137, 150], "why": 129, "wide": [148, 157], "width": [136, 155, 165], "wikipedia": 173, "willdudlei": 155, "william": [133, 134, 177], "win": [150, 163, 175], "wind": [129, 154, 155], "wind_pow": [129, 154, 170], "window": [2, 9, 128, 136, 154, 155, 165], "window_s": 165, "wing": 148, "wire": 148, "wireless": 148, "wise": 177, "wish": [9, 154, 155, 157, 160, 161], "within": [2, 5, 9, 13, 14, 16, 17, 18, 20, 129, 130, 143, 154, 155, 157, 158, 160], "without": [14, 15, 17, 19, 140, 148, 150, 152, 154, 155, 157, 158, 160, 163, 165, 169, 175], "won": [2, 163, 175], "word": [144, 177], "work": [5, 9, 128, 129, 131, 138, 139, 142, 143, 148, 154, 155], "worker": 11, "workflow": 155, "workshop": 148, "world": [129, 148, 151, 152, 153, 155, 173], "worldbodi": [137, 140, 141], "worri": 165, "wors": 154, "would": [6, 8, 16, 18, 19, 137, 138, 139, 140, 146, 147, 154, 155, 156, 158, 165, 167, 170], "wouldn": 155, "wrap": [2, 4, 9, 11, 12, 14, 15, 16, 17, 18, 19, 20, 148, 157, 165, 167, 170, 177], "wrapped_env": [14, 15, 16, 17, 18, 157, 158, 165, 167, 177], "wrapped_env_ob": 16, "wrapped_obs0": 18, "wrapped_obs1": 18, "wrappedenvacttyp": 155, "wrappedenvobstyp": 155, "wrapper": [2, 4, 6, 7, 9, 10, 13, 137, 138, 139, 140, 141, 143, 146, 147, 148, 152, 153, 154, 155, 156, 157, 159, 161, 163, 168, 170, 171, 175, 177], "wrapper_spec": 15, "wrapperacttyp": [15, 16], "wrapperobstyp": [15, 18], "wrapperspec": [4, 15], "wrist": 144, "write": [5, 11, 13, 155, 165, 169], "write_to_shared_memori": [5, 13], "written": [9, 155, 160, 162], "wrong": [154, 155], "www": [128, 150, 151, 152, 154], "x": [2, 5, 8, 14, 129, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 147, 148, 151, 155, 158, 161, 165, 170, 173, 177], "x_": [137, 138, 139, 140, 142, 146, 147], "x_init": 133, "x_posit": [14, 137, 138, 139, 140, 141, 146, 147], "x_veloc": 14, "xml": [137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 154, 155, 169], "xml_file": [136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 155, 169], "xpo": [137, 140, 144, 145], "xtick": 175, "xticklabel": 173, "xuanhien070594": 155, "xuehaipan": [154, 155], "xx": 173, "xy": 142, "y": [2, 129, 135, 137, 138, 139, 140, 141, 142, 144, 145, 146, 153, 158, 173, 177], "y_": 142, "y_coordin": 142, "y_init": 133, "y_posit": [137, 138, 139, 140, 141, 146, 147], "yaniv": 155, "year": [133, 134, 154, 155], "yellow": 153, "yet": [148, 155], "yield": 13, "ylabel": 173, "you": [1, 2, 5, 6, 7, 8, 9, 11, 15, 16, 17, 18, 19, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 127, 128, 137, 140, 141, 148, 154, 155, 156, 157, 158, 161, 163, 165, 166, 167, 169, 170, 172, 173], "youjiacheng": 154, "youni": [154, 155], "younik": [154, 155], "your": [1, 2, 5, 9, 11, 15, 16, 17, 18, 128, 137, 140, 141, 148, 154, 155, 156, 157, 158, 161, 162, 163, 167, 168, 170, 171, 175], "your_env": 9, "yourself": [11, 128, 163, 165, 175], "ytick": 175, "yticklabel": 173, "z": [137, 138, 139, 140, 141, 142, 144, 145, 147], "z_": 141, "z_distance_from_origin": [139, 141, 147], "zaxi": 175, "zemann": [170, 175], "zenodo": 155, "zero": [7, 13, 18, 135, 137, 139, 140, 141, 147, 155, 158, 163, 165, 166, 170, 173, 175, 177], "zero_grad": [170, 177], "zhiqingxiao": 154, "zip": 171, "zlig": 154, "zoom": 9, "zuoxingdong": 154, "zzyunzhi": 154}, "titles": ["404 - Page Not Found", "Gymnasium-docs", "Env", "Functional Env", "Make and register", "Spaces", "Composite Spaces", "Fundamental Spaces", "Spaces Utils", "Utility functions", "Vectorize", "AsyncVectorEnv", "SyncVectorEnv", "Utility functions", "Wrappers", "Wrappers", "Action Wrappers", "Misc Wrappers", "Observation Wrappers", "Reward Wrappers", "List of Wrappers", "Atari", "Adventure", "Air Raid", "Alien", "Amidar", "Assault", "Asterix", "Asteroids", "Atlantis", "Atlantis2", "Backgammon", "Bank Heist", "Basic Math", "Battle Zone", "Beam Rider", "Berzerk", "Blackjack", "Bowling", "Boxing", "Breakout", "Carnival", "Casino", "Centipede", "Chopper Command", "Crazy Climber", "Crossbow", "Darkchambers", "Defender", "Demon Attack", "Donkey Kong", "Double Dunk", "Earthworld", "Elevator Action", "Enduro", "Entombed", "Et", "Fishing Derby", "Flag Capture", "Freeway", "Frogger", "Frostbite", "Galaxian", "Gopher", "Gravitar", "Hangman", "Haunted House", "Hero", "Human Cannonball", "Ice Hockey", "Jamesbond", "Journey Escape", "Kaboom", "Kangaroo", "Keystone Kapers", "King Kong", "Klax", "Koolaid", "Krull", "Kung Fu Master", "Laser Gates", "Lost Luggage", "Mario Bros", "Miniature Golf", "Montezuma Revenge", "Mr Do", "Ms Pacman", "Name This Game", "Othello", "Pacman", "Phoenix", "Pitfall", "Pitfall2", "Pong", "Pooyan", "Private Eye", "Qbert", "Riverraid", "Road Runner", "Robotank", "Seaquest", "Sir Lancelot", "Skiing", "Solaris", "Space Invaders", "Space War", "Star Gunner", "Superman", "Surround", "Tennis", "Tetris", "Tic Tac Toe 3D", "Time Pilot", "Trondead", "Turmoil", "Tutankham", "Up N Down", "Venture", "Video Checkers", "Video Chess", "Video Cube", "Video Pinball", "Wizard Of Wor", "Word Zapper", "Yars Revenge", "Zaxxon", "Box2D", "Bipedal Walker", "Car Racing", "Lunar Lander", "Classic Control", "Acrobot", "Cart Pole", "Mountain Car", "Mountain Car Continuous", "Pendulum", "MuJoCo", "Ant", "Half Cheetah", "Hopper", "Humanoid", "Humanoid Standup", "Inverted Double Pendulum", "Inverted Pendulum", "Pusher", "Reacher", "Swimmer", "Walker2D", "External Environments", "Toy Text", "Blackjack", "Cliff Walking", "Frozen Lake", "Taxi", "Gym Release Notes", "Gymnasium Release Notes", "<no title>", "Basic Usage", "Create a Custom Environment", "Compatibility with Gym", "Migration Guide - v0.21 to v1.0.0", "Recording Agents", "Speeding Up Training", "Training an Agent", "Gymnasium Basics", "Make your own custom environment", "Handling Time Limits", "Implementing Custom Wrappers", "Gymnasium Basics", "Load custom quadruped robot environments", "Training A2C with Vector Envs and Domain Randomization", "Tutorials", "Third-Party Tutorials", "Frozenlake benchmark", "Training Agents", "Solving Blackjack with Q-Learning", "Training Agents", "Training using REINFORCE for Mujoco"], "titleterms": {"0": [154, 155, 160, 169], "0a1": 155, "0a2": 155, "1": [154, 155, 169], "11": 173, "12": 154, "19": 154, "2": [154, 155, 169], "20": 154, "21": [154, 159, 160], "22": 154, "23": 154, "24": 154, "25": 154, "26": [154, 155, 160], "27": 155, "28": 155, "29": 155, "3": [154, 155, 169], "3d": 111, "4": [154, 169, 173], "404": 0, "5": 154, "6": 154, "7": [154, 173], "9": [154, 173], "Not": 0, "Of": 122, "The": [0, 173], "__init__": 158, "a2c": 170, "about": 172, "acrobot": 131, "across": 136, "action": [14, 16, 53, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 163, 175], "actionwrapp": 167, "actor": 170, "ad": 1, "addit": [4, 10, 11, 12], "advantag": 170, "adventur": 22, "agent": [161, 163, 169, 170, 171, 174, 175, 176, 177], "agilerl": 172, "air": 23, "algorithm": 162, "alien": 24, "alpha": 155, "alpha1": 155, "altern": 165, "amidar": 25, "an": [1, 163, 175, 177], "analysi": 170, "ant": 137, "api": [9, 159], "argument": [127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "assault": 26, "asterix": 27, "asteroid": 28, "asynchron": 170, "asyncvectorenv": 11, "atari": 21, "atlanti": 29, "atlantis2": 30, "attack": 49, "attribut": [2, 5, 10, 15], "autonom": 148, "avail": 16, "backend": 136, "backgammon": 31, "bank": 32, "base": [3, 16], "basic": [33, 157, 164, 168, 171], "battl": 34, "beam": 35, "benchmark": [9, 173], "berzerk": 36, "better": 175, "biolog": 148, "biped": 127, "blackjack": [37, 150, 175], "bowl": 38, "box": 39, "box2d": 126, "breakout": 40, "bro": 82, "build": [1, 163, 175, 177], "can": 175, "cannonbal": 68, "captur": 58, "car": [128, 133, 134], "carniv": 41, "cart": 132, "casino": 42, "centiped": 43, "check": 9, "checker": 118, "cheetah": 138, "chess": 119, "chopper": 44, "class": 16, "classic": 130, "cliff": 151, "climber": 45, "close": 165, "code": [157, 160, 166], "command": 44, "common": [14, 17], "compar": 136, "compat": [9, 159], "composit": [5, 6], "comput": 148, "construct": [158, 165], "continu": 134, "control": 130, "convers": [14, 17], "convert": 3, "core": 4, "could": 0, "crazi": 45, "creat": [2, 158, 165, 173], "credit": [127, 128, 129], "critic": 170, "crossbow": 46, "cube": 120, "curv": 177, "custom": [158, 165, 167, 169], "darkchamb": 47, "data": [14, 17], "declar": 165, "defend": 48, "demon": 49, "depend": 173, "derbi": 57, "descript": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "do": [85, 175], "doc": 1, "document": 1, "domain": 170, "donkei": 50, "doubl": [51, 142], "down": 116, "download": 169, "drive": 148, "dunk": 51, "dure": 161, "dynam": [133, 134], "earthworld": 52, "econom": 148, "edit": 1, "electr": 148, "elev": 53, "end": [131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "enduro": 54, "energi": 148, "entomb": 55, "env": [2, 3, 165, 170], "environ": [1, 2, 3, 9, 10, 132, 148, 157, 158, 159, 160, 162, 165, 169, 170, 173, 175], "epilogu": 169, "episod": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161], "escap": 71, "et": 56, "everi": 161, "exact": 136, "exampl": 160, "execut": [163, 175], "explain": 157, "extern": 148, "ey": 95, "financi": 148, "first": 148, "fish": 57, "flag": 58, "found": 0, "freewai": 59, "frogger": 60, "from": [165, 167], "frostbit": 61, "frozen": 152, "frozenlak": 173, "fu": 79, "function": [3, 4, 5, 9, 13, 158], "fundament": [5, 7], "galaxian": 62, "game": [87, 148], "gate": 80, "gener": [160, 165], "golf": 83, "gopher": 63, "gravitar": 64, "guid": 160, "gunner": 106, "gym": [148, 154, 159], "gymnasium": [1, 10, 148, 155, 164, 165, 167, 168, 171, 172], "half": 138, "handl": 166, "hangman": 65, "haunt": 66, "heist": 32, "hero": 67, "heurist": 162, "histori": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "hockei": 69, "hopper": 139, "hous": 66, "human": 68, "humanoid": [140, 141], "ic": 69, "implement": [2, 14, 18, 19, 167], "import": [166, 175], "includ": 160, "inform": [150, 151, 152, 153, 157, 161], "inherit": 167, "initi": [155, 157, 165], "instanc": 165, "instruct": 1, "interact": 157, "introduct": 170, "invad": 104, "invert": [142, 143], "jamesbond": 70, "jax": 3, "journei": 71, "kaboom": 72, "kangaroo": 73, "kaper": 74, "keyston": 74, "king": 75, "klax": 76, "kong": [50, 75], "koolaid": 77, "krull": 78, "kung": 79, "lake": 152, "lancelot": 101, "lander": 129, "laser": 80, "later": 160, "learn": [166, 175, 177], "librari": 172, "limit": 166, "list": 20, "ll": 173, "load": [159, 169, 170], "lost": 81, "luggag": 81, "lunar": 129, "m": 86, "make": [4, 10, 158, 165], "map": 173, "mario": 82, "master": 79, "math": 33, "mathemat": 148, "medic": 148, "memori": 13, "method": [2, 5, 10, 11, 12, 15], "migrat": 160, "miniatur": 83, "misc": 17, "miscellan": 13, "model": 169, "modifi": [1, 157], "montezuma": 84, "more": [157, 161], "mountain": [133, 134], "mr": 85, "mujoco": [136, 177], "n": 116, "name": 87, "network": 177, "new": [1, 9], "note": [129, 146, 154, 155], "notic": 170, "number": 160, "observ": [14, 18, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 158, 165, 169, 175], "observationwrapp": 167, "old": 9, "onli": [14, 20], "openai": 159, "optim": 162, "othello": 88, "other": [1, 148], "own": 165, "packag": 165, "pacman": [86, 89], "page": [0, 1], "paramet": [169, 173], "parti": [148, 172], "pendulum": [135, 142, 143], "perform": [9, 136, 170], "phoenix": 90, "pickl": 9, "pilot": 112, "pinbal": 121, "pitfal": 91, "pitfall2": 92, "plai": 170, "plot": [170, 177], "pole": 132, "polici": [163, 175, 177], "pong": 93, "pooyan": 94, "privat": 95, "pusher": 144, "q": [173, 175], "qbert": 96, "quadrup": 169, "race": 128, "raid": 23, "random": [160, 170], "reacher": 145, "recommend": 165, "record": 161, "refer": [128, 131, 150, 151, 153, 170, 173, 177], "regist": [4, 158, 165], "reinforc": 177, "releas": [154, 155], "remov": 160, "render": [9, 136, 160, 165], "reproduc": 136, "request": 0, "reset": [128, 158, 160, 165], "reveng": [84, 124], "reward": [14, 19, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 169], "rewardwrapp": 167, "rider": 35, "riverraid": 97, "road": 98, "robot": [148, 169], "robotank": 99, "run": 173, "runner": 98, "runtim": 9, "save": [9, 170], "seaquest": 100, "seed": [9, 160], "setup": [165, 169, 170, 175], "share": 13, "showcas": 170, "simul": 169, "sir": 101, "size": 173, "ski": 102, "solari": 103, "solut": [165, 166], "solv": 175, "space": [5, 6, 7, 8, 13, 104, 105, 127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157], "speed": 162, "standard": 3, "standup": 141, "star": 106, "start": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "state": [127, 128, 129, 131, 132, 133, 134, 135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 165], "step": [1, 9, 158, 159, 160, 165, 169], "subclass": 165, "superman": 107, "surround": 108, "swimmer": 146, "synchron": 170, "syncvectorenv": 12, "system": 148, "tabl": 173, "tac": 111, "taxi": 153, "telecommun": 148, "tenni": 109, "termin": [127, 128, 129, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 166, 169], "tetri": 110, "text": 149, "thi": 87, "think": 175, "third": [148, 172], "tic": 111, "time": [112, 166, 173], "timelimit": 160, "toe": 111, "toi": 149, "train": [136, 161, 162, 163, 169, 170, 171, 174, 175, 176, 177], "transit": [133, 134], "trondead": 113, "truncat": [135, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 166], "try": 170, "turmoil": 114, "tutankham": 115, "tutori": [1, 171, 172], "tweak": 169, "uncommon": 17, "up": [116, 162], "us": [148, 158, 165, 170, 172, 173, 177], "usag": 157, "util": [5, 8, 9, 13], "v0": [154, 155, 159, 160], "v1": [155, 160], "variabl": 4, "vector": [5, 10, 13, 14, 20, 132, 162, 170], "vectorenv": 10, "ventur": 117, "version": [127, 128, 129, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153], "video": [9, 118, 119, 120, 121, 148], "visual": [9, 173, 175], "visualis": [163, 175], "walk": 151, "walker": 127, "walker2d": 147, "war": 105, "we": 173, "weight": 170, "wizard": 122, "wor": 122, "word": 123, "wrapper": [14, 15, 16, 17, 18, 19, 20, 158, 160, 165, 167], "write": 1, "yar": 124, "you": 175, "your": [165, 169], "yourself": 170, "zapper": 123, "zaxxon": 125, "zone": 34}}) \ No newline at end of file