From 79aca4ec595bae9c5aef1a94cd947835c2d151ae Mon Sep 17 00:00:00 2001 From: ffelten Date: Fri, 3 May 2024 16:51:29 +0000 Subject: [PATCH] =?UTF-8?q?Deploying=20to=20gh-pages=20from=20@=20Farama-F?= =?UTF-8?q?oundation/momaland@6582254055ea1972eb79cddf3f00856c9558f6e9=20?= =?UTF-8?q?=F0=9F=9A=80?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- .buildinfo | 2 +- environments/all-envs/index.html | 80 ++++++++++++++++---------------- release_notes/index.html | 39 ---------------- searchindex.js | 2 +- 4 files changed, 42 insertions(+), 81 deletions(-) diff --git a/.buildinfo b/.buildinfo index 8f08983b..1fc3adcb 100644 --- a/.buildinfo +++ b/.buildinfo @@ -1,4 +1,4 @@ # Sphinx build info version 1 # This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done. -config: d1a85c866a751920eccef77da5d322f7 +config: eddc55e6a04db0bf4d19fa6417b66437 tags: d77d1c0d9ca2f4c8421862c7c5a0d620 diff --git a/environments/all-envs/index.html b/environments/all-envs/index.html index ea3e7852..11c66241 100644 --- a/environments/all-envs/index.html +++ b/environments/all-envs/index.html @@ -323,6 +323,42 @@

All Environments

mo-beach-v0

+

Any

+

Discrete / Discrete

+

[occupation, mixture]

+

Taken from Mannion_2018. MO-Beach is a game with two objectives, reflecting the enjoyment of tourists (agents) on their respective beach sections in terms of crowdedness and diversity of attendees. Each beach section is characterised by a capacity and each agent is characterised by a type.

+ +

mo-item-gathering-v0

+

Adversarial

+

Discrete / Discrete

+

[#objects] (configurable)

+

Adapted from Kallstrom_2019, is a multi-agent grid world, containing items of different colours. Each colour represents a different objective and the goal of the agents is to collect as many objects as possible.

+ +

mo-gem-mining-v0

+

Cooperative

+

- / Discrete

+

[#gems] (configurable)

+

MO version of Gem Mining Bargiacchi_2018. Agents go to different mines to extract different gems (objectives). There are restrictions on which mines can be reached for each agent. Agents also influence each other’s producitivity.

+ +

mo-congestion-v0

+

Adversarial

+

- / Discrete

+

[travel time, cost]

+

MO-RouteChoice is a multi-objective extension of the route choice problem Thomasini_2023, where a number of self-interested drivers (agents) must navigate a road network.

+ +

mo-pistonball-v0

+

Cooperative

+

Continuous / Any

+

[agent_#n_reward] (configurable)

+

An MO version of PZ’s Pistonball where the reward of each agent is kept separate.

+ +

mo-multiwalker-stability-v0

+

Cooperative

+

Continuous / Continuous

+

[progress right, package stability]

+

A MO version of PZ’s MultiWalker introduced in Gupta_2017, where the agents also seek to keep the package steady.

+

catch-v0

Cooperative

Continuous / Continuous

@@ -341,65 +377,29 @@

All Environments[distance_target, distance_other_drones]

Agents must surround a fixed target point while maintaining distance between themselves.

-

mo-beach-v0

-

Any

-

Continuous / Discrete

-

[occupation, mixture]

-

Taken from Mannion_2018. MO-Beach is a game with two objectives, reflecting the enjoyment of tourists (agents) on their respective beach sections in terms of crowdedness and diversity of attendees. Each beach section is characterised by a capacity and each agent is characterised by a type.

- -

mo-breakthrough-v0

+

mo-breakthrough-v0

Adversarial

Discrete / Discrete

[win, fast win, capturing opponent's pieces, avoiding capture]

Multi-objective version of the two-player, turn-based, board game Breakthrough.

-

mo-congestion-v0

-

Mixed

-

Continuous / Discrete

-

[travel time, cost]

-

MO-RouteChoice is a multi-objective extension of the route choice problem Thomasini_2023, where a number of self-interested drivers (agents) must navigate a road network.

-

mo-connect4-v0

Adversarial

Discrete / Discrete

[win, fast win, [column #n]]

MO version of Connect 4. Additional objectives are fast win and optionally one objective per column.

-

mo-gem-mining-v0

-

Cooperative

-

Continuous / Discrete

-

[#gems] (configurable)

-

MO version of Gem Mining Bargiacchi_2018. Agents go to different mines to extract different gems (objectives). There are restrictions on which mines can be reached for each agent. Agents also influence each other’s producitivity.

- -

mo-ingenious-v0

+

mo-ingenious-v0

Any

Discrete / Discrete

[#colors] (configurable)

-

MO adaptation of the zero-sum, turn-based board game Ingenious. The game’s original rules support 2-4 players collecting scores in multiple colors (objectives), with the goal of winning by maximizing the minimum score over all colors.

- -

mo-item-gathering-v0

-

Adversarial

-

Discrete / Discrete

-

[#objects] (configurable)

-

Adapted from Kallstrom_2019, is a multi-agent grid world, containing items of different colours. Each colour represents a different objective and the goal of the agents is to collect as many objects as possible.

- -

mo-multiwalker-stability-v0

-

Cooperative

-

Continuous / Continuous

-

[progress right, package stability]

-

A MO version of PZ’s MultiWalker introduced in Gupta_2017, where the agents also seek to keep the package steady.

- -

mo-pistonball-v0

-

Cooperative

-

Continuous / Any

-

[agent_#n_reward] (configurable)

-

An MO version of PZ’s Pistonball where the reward of each agent is kept separate.

+

MO adaptation of the zero-sum, turn-based board game Ingenious. The game’s original rules support 2-4 players collecting scores in multiple colors (objectives), with the goal of winning by maximizing the minimum score over all colors. In MO-Ingenious, we leave the utility wrapper up to the users and only return the vector of scores in each colour objective.

mo-same-game-v0

Any

Discrete / Discrete

[colors_n] (configurable)

-

MO-SameGame is a multi-objective, multi-agent variant of the single-player, single-objective turn-based puzzle game called SameGame Baier_2015. The original single-player, single-objective SameGame rewards the player with \(n^2\) points for removing any group of n tiles. MO-SameGame can extend this in two ways. Agents can either only get points for their own actions, leading to competition between them, or all rewards can be shared in ``team reward’’ mode. Additionally, points for every colour can be counted as separate objectives, allowing for different trade-offs between colours, or they can be accumulated in a single objective like in the default game variant, essentially providing a single-objective wrapper for the game.

+

MO-SameGame is a multi-objective, multi-agent variant of the single-player, single-objective turn-based puzzle game called SameGame Baier_2015. The original single-player, single-objective SameGame rewards the player with \(n^2\) points for removing any group of \(n\) tiles. MO-SameGame can extend this in two ways. Agents can either only get points for their own actions, leading to competition between them, or all rewards can be shared in ``team reward’’ mode. Additionally, points for every colour can be counted as separate objectives, allowing for different trade-offs between colours, or they can be accumulated in a single objective like in the default game variant, essentially providing a single-objective wrapper for the game.

diff --git a/release_notes/index.html b/release_notes/index.html index 7d890f70..4e20b297 100644 --- a/release_notes/index.html +++ b/release_notes/index.html @@ -308,44 +308,6 @@

Release Notes

-
-

0.1.0

-

Released on 2024-05-03 - GitHub - PyPI

-

What's Changed

- -

New Contributors

- -

Full Changelog: v0.0.2...0.1.0

v0.0.2: Dummy release 2

Released on 2023-12-20 - GitHub - PyPI

@@ -404,7 +366,6 @@

v0.0.1: Dummy release
  • Release Notes diff --git a/searchindex.js b/searchindex.js index a249743b..35df1531 100644 --- a/searchindex.js +++ b/searchindex.js @@ -1 +1 @@ -Search.setIndex({"alltitles": {"0.1.0": [[21, "release-0-1-0"]], "404 - Page Not Found": [[0, "page-not-found"]], "AEC": [[2, "aec"], [4, "aec"]], "AEC to Parallel": [[4, "aec-to-parallel"]], "API": [[19, "api"]], "Action Space": [[6, "action-space"], [7, "action-space"], [8, "action-space"], [9, "action-space"], [10, "action-space"], [11, "action-space"], [12, "action-space"], [14, "action-space"], [15, "action-space"], [16, "action-space"], [17, "action-space"], [18, "action-space"]], "All Environments": [[5, "all-environments"]], "Arguments": [[6, "arguments"], [7, "arguments"], [8, "arguments"], [9, "arguments"], [10, "arguments"], [11, "arguments"], [12, "arguments"], [14, "arguments"], [15, "arguments"], [16, "arguments"], [17, "arguments"], [18, "arguments"]], "Attributes": [[2, "attributes"], [3, "attributes"]], "Build the Documentation": [[1, "build-the-documentation"]], "Catch": [[6, "catch"]], "Citing": [[19, "citing"]], "Conversion": [[4, "conversion"]], "Credits": [[6, "credits"], [7, "credits"], [10, "credits"], [12, "credits"], [16, "credits"], [18, "credits"]], "Episode Termination": [[6, "episode-termination"], [7, "episode-termination"], [8, "episode-termination"], [10, "episode-termination"], [12, "episode-termination"], [14, "episode-termination"], [15, "episode-termination"], [16, "episode-termination"], [18, "episode-termination"]], "Episode Truncation": [[6, "episode-truncation"], [7, "episode-truncation"], [8, "episode-truncation"], [10, "episode-truncation"], [12, "episode-truncation"], [14, "episode-truncation"], [16, "episode-truncation"], [18, "episode-truncation"]], "Escort": [[7, "escort"]], "Install": [[19, "install"], [20, "install"]], "Learning algorithms": [[23, "learning-algorithms"]], "Legal Actions Mask": [[9, "legal-actions-mask"], [11, "legal-actions-mask"], [17, "legal-actions-mask"]], "MO-Beach": [[8, "mo-beach"]], "MO-Breakthrough": [[9, "mo-breakthrough"]], "MO-Congestion": [[10, "mo-congestion"]], "MO-Connect4": [[11, "mo-connect4"]], "MO-GemMining": [[12, "mo-gemmining"]], "MO-Ingenious": [[13, "mo-ingenious"]], "MO-ItemGathering": [[14, "mo-itemgathering"]], "MO-MultiwalkerStability": [[15, "mo-multiwalkerstability"]], "MO-Pistonball": [[16, "mo-pistonball"]], "MO-SameGame": [[17, "mo-samegame"]], "MOAECEnv": [[2, "moaecenv"]], "MOMAland documentation": [[1, "momaland-documentation"]], "MOParallelEnv": [[3, "moparallelenv"]], "Making a Custom Environment": [[22, "making-a-custom-environment"]], "Methods": [[2, "methods"], [3, "methods"]], "Observation Space": [[6, "observation-space"], [7, "observation-space"], [8, "observation-space"], [9, "observation-space"], [10, "observation-space"], [11, "observation-space"], [12, "observation-space"], [14, "observation-space"], [15, "observation-space"], [16, "observation-space"], [17, "observation-space"], [18, "observation-space"]], "Parallel": [[3, "parallel"], [4, "parallel"]], "Parallel to AEC": [[4, "parallel-to-aec"]], "Release Notes": [[21, "release-notes"]], "Reward Space": [[6, "reward-space"], [7, "reward-space"], [8, "reward-space"], [10, "reward-space"], [12, "reward-space"], [14, "reward-space"], [15, "reward-space"], [16, "reward-space"], [18, "reward-space"]], "Rewards": [[9, "rewards"], [11, "rewards"], [17, "rewards"]], "Setup pre-commit hooks": [[22, "setup-pre-commit-hooks"]], "Starting State": [[6, "starting-state"], [7, "starting-state"], [8, "starting-state"], [9, "starting-state"], [10, "starting-state"], [11, "starting-state"], [12, "starting-state"], [14, "starting-state"], [16, "starting-state"], [17, "starting-state"], [18, "starting-state"]], "Surround": [[18, "surround"]], "The requested page could not be found.": [[0, "the-requested-page-could-not-be-found"]], "Usage": [[2, "usage"], [3, "usage"]], "Version History": [[9, "version-history"], [11, "version-history"], [17, "version-history"]], "Wrappers and Utils": [[4, "wrappers-and-utils"]], "v0.0.1: Dummy release": [[21, "release-v0-0-1"]], "v0.0.2: Dummy release 2": [[21, "release-v0-0-2"]]}, "docnames": ["404", "README", "api/aec", "api/parallel", "api/wrappers", "environments/all-envs", "environments/catch", "environments/escort", "environments/mobeach", "environments/mobreakthrough", "environments/mocongestion", "environments/moconnect4", "environments/mogem_mining", "environments/moingenious", "environments/moitem_gathering", "environments/momultiwalker_stability", "environments/mopistonball", "environments/mosame_game", "environments/surround", "index", "introduction/install", "release_notes", "tutorials/custom_env", "tutorials/learning_algorithms"], "envversion": {"sphinx": 61, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.viewcode": 1}, "filenames": ["404.md", "README.md", "api/aec.md", "api/parallel.md", "api/wrappers.md", "environments/all-envs.md", "environments/catch.md", "environments/escort.md", "environments/mobeach.md", "environments/mobreakthrough.md", "environments/mocongestion.md", "environments/moconnect4.md", "environments/mogem_mining.md", "environments/moingenious.md", "environments/moitem_gathering.md", "environments/momultiwalker_stability.md", "environments/mopistonball.md", "environments/mosame_game.md", "environments/surround.md", "index.md", "introduction/install.md", "release_notes.md", "tutorials/custom_env.md", "tutorials/learning_algorithms.md"], "indexentries": {"action_space() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.action_space", false]], "action_space() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.action_space", false]], "action_spaces (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.action_spaces", false]], "action_spaces (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.action_spaces", false]], "agent_selection (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.agent_selection", false]], "agents (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.agents", false]], "agents (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.agents", false]], "close() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.close", false]], "close() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.close", false]], "infos (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.infos", false]], "linearizereward (class in momaland.utils.aec_wrappers)": [[4, "momaland.utils.aec_wrappers.LinearizeReward", false]], "linearizereward (class in momaland.utils.parallel_wrappers)": [[4, "momaland.utils.parallel_wrappers.LinearizeReward", false]], "max_num_agents (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.max_num_agents", false]], "max_num_agents (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.max_num_agents", false]], "mo_aec_to_parallel_wrapper (class in momaland.utils.conversions)": [[4, "momaland.utils.conversions.mo_aec_to_parallel_wrapper", false]], "mo_parallel_to_aec_wrapper (class in momaland.utils.conversions)": [[4, "momaland.utils.conversions.mo_parallel_to_aec_wrapper", false]], "moaecenv (class in momaland.utils.env)": [[2, "momaland.utils.env.MOAECEnv", false]], "moparallelenv (class in momaland.utils.env)": [[3, "momaland.utils.env.MOParallelEnv", false]], "normalizereward (class in momaland.utils.aec_wrappers)": [[4, "momaland.utils.aec_wrappers.NormalizeReward", false]], "normalizereward (class in momaland.utils.parallel_wrappers)": [[4, "momaland.utils.parallel_wrappers.NormalizeReward", false]], "num_agents (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.num_agents", false]], "num_agents (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.num_agents", false]], "observation_space() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.observation_space", false]], "observation_space() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.observation_space", false]], "observation_spaces (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.observation_spaces", false]], "observation_spaces (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.observation_spaces", false]], "observe() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.observe", false]], "possible_agents (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.possible_agents", false]], "possible_agents (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.possible_agents", false]], "recordepisodestatistics (class in momaland.utils.parallel_wrappers)": [[4, "momaland.utils.parallel_wrappers.RecordEpisodeStatistics", false]], "render() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.render", false]], "render() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.render", false]], "reset() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.reset", false]], "reset() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.reset", false]], "reward_space() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.reward_space", false]], "reward_space() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.reward_space", false]], "reward_spaces (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.reward_spaces", false]], "reward_spaces (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.reward_spaces", false]], "rewards (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.rewards", false]], "state() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.state", false]], "step() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.step", false]], "step() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.step", false]], "terminations (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.terminations", false]], "truncations (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.truncations", false]]}, "objects": {"momaland.utils.aec_wrappers": [[4, 0, 1, "", "LinearizeReward"], [4, 0, 1, "", "NormalizeReward"]], "momaland.utils.conversions": [[4, 0, 1, "", "mo_aec_to_parallel_wrapper"], [4, 0, 1, "", "mo_parallel_to_aec_wrapper"]], "momaland.utils.env": [[2, 0, 1, "", "MOAECEnv"], [3, 0, 1, "", "MOParallelEnv"]], "momaland.utils.env.MOAECEnv": [[2, 1, 1, "", "action_space"], [2, 2, 1, "", "action_spaces"], [2, 2, 1, "", "agent_selection"], [2, 2, 1, "", "agents"], [2, 1, 1, "", "close"], [2, 2, 1, "", "infos"], [2, 2, 1, "", "max_num_agents"], [2, 2, 1, "", "num_agents"], [2, 1, 1, "", "observation_space"], [2, 2, 1, "", "observation_spaces"], [2, 1, 1, "", "observe"], [2, 2, 1, "", "possible_agents"], [2, 1, 1, "", "render"], [2, 1, 1, "", "reset"], [2, 1, 1, "", "reward_space"], [2, 2, 1, "", "reward_spaces"], [2, 2, 1, "", "rewards"], [2, 1, 1, "", "step"], [2, 2, 1, "", "terminations"], [2, 2, 1, "", "truncations"]], "momaland.utils.env.MOParallelEnv": [[3, 1, 1, "", "action_space"], [3, 2, 1, "", "action_spaces"], [3, 2, 1, "", "agents"], [3, 1, 1, "", "close"], [3, 2, 1, "", "max_num_agents"], [3, 2, 1, "", "num_agents"], [3, 1, 1, "", "observation_space"], [3, 2, 1, "", "observation_spaces"], [3, 2, 1, "", "possible_agents"], [3, 1, 1, "", "render"], [3, 1, 1, "", "reset"], [3, 1, 1, "", "reward_space"], [3, 2, 1, "", "reward_spaces"], [3, 1, 1, "", "state"], [3, 1, 1, "", "step"]], "momaland.utils.parallel_wrappers": [[4, 0, 1, "", "LinearizeReward"], [4, 0, 1, "", "NormalizeReward"], [4, 0, 1, "", "RecordEpisodeStatistics"]]}, "objnames": {"0": ["py", "class", "Python class"], "1": ["py", "method", "Python method"], "2": ["py", "attribute", "Python attribute"]}, "objtypes": {"0": "py:class", "1": "py:method", "2": "py:attribute"}, "terms": {"": [2, 3, 4, 5, 6, 7, 8, 9, 11, 16, 17, 18, 19, 21, 23], "0": [2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], "00e": 16, "01": 16, "01567": 15, "02": 15, "03": 21, "04": 16, "05": 21, "08": 4, "1": [2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18], "10": [6, 7, 12, 14, 17, 18], "100": [8, 15, 16], "100e": 15, "11": [12, 13], "110": 15, "12": [8, 12, 21], "120": 16, "125": 16, "13": 12, "14": 12, "15": [6, 7, 12, 17, 18], "16": 12, "17": 12, "18": [12, 13], "19": [12, 16], "192": 9, "1e": 4, "2": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], "20": [9, 11, 12, 16, 21], "200": [6, 7, 18], "2019": 14, "2023": 21, "2024": 21, "209": 15, "21": 21, "225": 17, "23": 12, "24": 21, "25": 21, "255": 16, "26": 21, "27": 21, "28": 21, "2880": 13, "29": 21, "2d": [6, 7, 8, 10, 14, 15, 18], "3": [2, 3, 6, 7, 8, 9, 10, 12, 14, 16, 17, 18, 19], "30": 17, "31": [15, 21], "35": 21, "37": 21, "38": 21, "39": 21, "3d": [6, 7, 16, 18], "4": [5, 9, 11, 12, 14, 15, 21], "40": 21, "41": 21, "4199": 10, "42": [2, 3, 12, 19, 21], "43": 21, "44": 21, "45": 21, "457": 16, "46": [15, 21], "48": 21, "49": 21, "5": [2, 6, 7, 8, 9, 12, 14, 16, 17, 18, 19], "50625": 17, "51": 21, "52": 21, "53": 21, "54": [15, 21], "567e": 15, "6": [8, 11, 12, 13, 16], "60": 16, "7": [2, 3, 8, 11, 12, 19], "75": 16, "79e": 16, "8": [2, 4, 9, 11, 12, 14, 16, 19], "881808": 8, "8x8": [9, 14], "9": [4, 11, 12], "99": [4, 8], "A": [2, 3, 4, 5, 6, 7, 8, 10, 14, 15, 18, 19], "And": 3, "As": [12, 16], "At": [9, 11], "For": [1, 2, 3, 4, 19], "If": [8, 9, 11, 15, 17, 19], "In": [3, 9, 10, 11], "It": 8, "On": 4, "The": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 23], "There": [5, 9], "These": [2, 3], "To": [1, 19, 20, 23], "With": 11, "_build": 1, "_env": [2, 3, 19], "_of_a_of_current_typ": 8, "_scalartype_co": 2, "a_typ": 8, "about": 1, "abov": [16, 17], "absolut": 21, "accept": 2, "access": [2, 4], "accord": 8, "accumul": [2, 5, 17], "action": [2, 3, 5, 13, 19, 23], "action_mask": [9, 11, 13, 17], "action_spac": [2, 3, 19], "actiontyp": [2, 3], "ad": [2, 3, 12], "adapt": [5, 6, 7, 10, 15, 16, 18], "add": [4, 21, 22], "addit": [5, 9, 11], "addition": [5, 9, 11, 17, 22], "adversari": 5, "aec": [16, 19, 20, 21, 22], "aec_env": 4, "aec_wrapp": 4, "aecwrapp": 4, "after": [2, 8, 10, 12], "agent": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 21, 22], "agent_": 5, "agent_0": 4, "agent_1": 4, "agent_i": [6, 7, 8, 10, 12, 13, 14, 17, 18], "agent_it": [2, 19], "agent_select": 2, "agentid": [2, 3], "aka": [2, 19], "algorithm": [19, 20], "aliv": 15, "all": [2, 3, 8, 9, 11, 12, 14, 17, 19, 20], "all_modul": 22, "allow": 5, "also": [2, 4, 5, 8, 9, 19], "altern": 3, "alternatingli": 9, "alwai": 10, "amount": [6, 7, 16, 18], "an": [2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 14, 16, 17, 18, 19, 22], "analysi": [2, 19], "angl": 15, "ani": [2, 5, 9, 11, 12, 17, 23], "anoth": [12, 17], "ansi": [2, 3], "api": [2, 3, 4, 20], "appli": 4, "appropri": 3, "ar": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 15, 16, 17, 18, 22, 23], "area": [6, 7, 18], "around": 5, "arrai": [2, 3, 4, 6, 7, 18, 19], "assign": [2, 19], "attende": 5, "autobuild": 1, "automat": [1, 2], "avail": [5, 9, 11, 14, 15, 17], "averag": [4, 6, 7, 18], "avoid": [4, 5, 6, 7, 9, 10, 15, 18], "award": [9, 11], "b": 1, "baier_2015": 5, "ball": 16, "ball_elast": 16, "ball_frict": 16, "ball_mass": 16, "bandit": 12, "bargiacchi": 12, "bargiacchi_2018": 5, "base": [2, 3, 4, 5, 9, 10, 11, 12, 15, 17, 19], "baselin": 23, "basic": [19, 20], "beach": 5, "becom": 17, "been": [11, 14], "befor": 16, "behavior": 4, "below": [9, 11, 17], "bernoulli": 12, "better": 21, "between": [4, 5, 16, 17, 19], "binari": [9, 11, 17], "board": [5, 9, 11, 13, 17, 21], "board_height": [9, 11, 17], "board_siz": 11, "board_width": [9, 11, 17], "bonu": 12, "bool": [2, 3], "both": [9, 11, 15, 21], "bottom": 11, "bound": 15, "box": [6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18], "bpd": 21, "braess": 10, "break": 5, "breakthrough": [5, 21], "bump": 21, "call": [2, 5, 17], "can": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 16, 17, 18, 19, 20], "cannot": [2, 3, 11], "capac": [5, 8], "captur": [5, 9], "case": [9, 10, 11], "catch": 5, "catch_v0": 6, "cbox": 12, "cd": 1, "cell": [9, 11, 14], "central": [3, 23], "certain": [19, 20], "chang": [1, 2, 3, 4, 8, 15, 16, 21], "changelog": 21, "characteris": 5, "check": [2, 4, 19, 21, 22], "choic": 5, "choos": 10, "circl": 5, "class": [2, 3, 4, 19, 22], "classic": [2, 3], "cleanrl": 23, "close": [2, 3, 16, 17, 19, 23], "closer": 16, "code": [4, 6, 7, 10, 12, 16, 18], "codebas": 10, "colab": [2, 19], "collect": 5, "collid": [6, 7, 18], "collis": [6, 7, 18], "color": [5, 17], "color_reward": 17, "colors_n": 5, "colour": 5, "column": [5, 11, 17], "column_object": 11, "commun": 19, "compar": 19, "compat": 23, "competit": 5, "compliant": 19, "compon": [16, 19, 20], "condit": [6, 7, 18], "configur": 5, "conflict": 11, "congest": [5, 21], "connect": [2, 5, 11, 12, 17, 21], "connect4": 5, "constant": 10, "constructor": 22, "consumpt": 8, "contain": [1, 2, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 23], "continu": [5, 6, 7, 8, 15, 16, 18, 23], "contribut": [1, 21], "contributor": 21, "control": 2, "convers": 22, "convert": 4, "cooper": [5, 12, 16], "coordin": [6, 7, 9, 17, 18, 19], "corner": 5, "correctli": 4, "correl": 12, "correlated_object": 12, "correspond": [2, 8, 9, 10, 11, 12], "cost": [5, 10], "could": [2, 3], "count": [5, 17], "crazyrl": 21, "creat": [2, 19, 22], "crowded": 5, "current": [2, 3, 8, 9, 11, 17, 23], "customiz": 12, "cycl": 16, "d": 17, "data": 3, "decentr": 3, "decis": [2, 19], "default": [2, 3, 5, 8, 11, 12, 14, 15, 16, 17], "defaultli": 12, "defin": 22, "definit": [2, 19], "demonstr": 21, "depend": [4, 10, 19, 20], "describ": [9, 11, 17], "descript": 5, "destin": [5, 10], "detail": [2, 3, 19], "determin": 14, "develop": 19, "diagon": [9, 11], "dict": [2, 3, 4, 9, 11, 13, 17], "dictionari": [3, 9, 11, 17, 22], "diederik": 12, "differ": [2, 5, 8, 10, 12, 17, 19], "dimens": [9, 11, 17], "direct": [6, 7, 18], "directli": 2, "directori": [22, 23], "dirhtml": 1, "disappear": 17, "discord": 19, "discount": 4, "discret": [5, 8, 9, 10, 11, 12, 13, 14, 16, 17, 21, 23], "disentangl": 16, "displai": [2, 3, 6, 7, 10, 18], "distanc": [5, 6, 7, 15, 18], "distance_other_dron": 5, "distance_target": 5, "distribut": 8, "divers": 5, "divid": 9, "do": [9, 11], "doc": [1, 21], "document": [2, 3, 15, 19, 20, 21, 22], "doe": [2, 19, 20], "domain": [8, 12], "don": 22, "down": [14, 16, 17], "draw": [9, 11], "driver": [5, 10], "drone": [5, 6, 7, 18], "drop": [11, 15, 16], "dtype": 2, "due": 12, "duplic": 4, "dure": 15, "dynam": 21, "e": [1, 2, 3, 12, 22], "each": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19, 22, 23], "edg": 16, "either": [5, 9, 10, 11, 17], "elast": 16, "element": [9, 11, 15, 17], "els": [2, 19], "empti": [9, 11, 14, 17], "encod": [9, 11, 17], "end": [4, 11, 15], "enforc": [2, 3], "enjoy": 5, "entir": 17, "entri": 14, "env": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 22], "env_nam": 22, "environ": [2, 3, 4, 6, 7, 8, 10, 12, 13, 14, 15, 16, 18, 19, 21, 23], "episod": 4, "epsilon": 4, "equal": [8, 11, 12], "equival": [2, 3], "escap": 6, "escort": 5, "escort_v0": 7, "essenti": [5, 19], "etc": 3, "eugenio": 12, "evalu": [21, 23], "even": 15, "everi": [1, 2, 3, 5, 11, 17], "everyth": [19, 20, 22, 23], "exampl": [2, 4, 6, 7, 8, 18, 19, 22], "except": [9, 11, 17], "exclud": 12, "exclus": [9, 11, 17], "execut": [2, 3], "exert": 15, "exist": 11, "expect": 12, "experiment": 21, "exponenti": 4, "export": 22, "extend": [2, 3, 5, 9, 11, 17, 22], "extens": 5, "extract": 5, "ezpickl": 21, "factor": [4, 11], "factori": 22, "fall": [11, 15, 17], "fals": [14, 17], "farama": 19, "fast": 5, "faster": [9, 11], "featur": 21, "felten": [6, 7, 18], "few": 4, "ffelten": 21, "file": 23, "fill": [9, 11, 17], "final": [7, 18], "final_target_loc": [7, 18], "find": 12, "finish": 16, "first": [2, 9, 15, 21], "fix": [4, 5, 21], "float": [2, 3, 4, 6, 7, 10, 18], "float32": [6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18], "folder": 1, "follow": [3, 6, 7, 15, 18, 19], "forc": 15, "fork": 1, "format": 5, "forward": 9, "found": [9, 11, 12, 17], "four": 11, "fourth": 9, "frame": 3, "friction": 16, "from": [2, 3, 5, 6, 7, 9, 10, 11, 12, 15, 16, 17, 18, 19, 22, 23], "front": 9, "full": [11, 21], "fulli": 12, "function": [2, 3, 12, 19, 22], "g": 22, "game": [5, 9, 11, 13, 17, 21], "gamma": 4, "gap": 17, "gather": [5, 14, 21], "gem": [5, 12, 21], "gener": [2, 3, 12, 21], "get": [5, 9, 11, 17], "github": 21, "give": 15, "given": [8, 9, 11, 12, 16, 17, 22], "global": [3, 8, 16], "go": [1, 5], "goal": 5, "greater": 12, "grid": [5, 9, 11, 14, 17], "ground": [6, 7, 18], "group": [5, 17], "guid": 20, "gupta_2017": 5, "gym": 3, "gymnasium": [2, 3], "ha": [2, 3, 4, 8, 9, 11, 12, 17], "handl": 4, "have": [4, 9, 10, 11, 14, 19, 22], "height": [9, 11, 17], "heintz": 14, "henc": 12, "here": [2, 19, 23], "heterogen": 10, "hiazmani": 21, "higher": 15, "hip": 15, "hold": [9, 11, 17], "home": 9, "horizont": [11, 17], "how": [1, 6, 7, 16, 18], "http": 19, "human": [2, 3, 6, 7, 10, 11, 16, 18, 19], "i": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23], "id": [8, 12, 14], "ident": 9, "idx": 4, "imag": 16, "immedi": 4, "implement": [2, 3, 21, 23], "import": [2, 3, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], "incentiv": [9, 11], "includ": [5, 19, 20], "independ": 21, "index": [4, 9, 11, 17], "indic": [9, 11, 17, 23], "individu": [17, 23], "inf": [6, 7, 15, 18], "influenc": 5, "info": [2, 3, 19], "inform": [1, 5], "ingeni": [5, 21], "inherit": 4, "init_flying_po": [6, 7, 18], "init_target_loc": [6, 7, 18], "initi": [4, 6, 7, 8, 14, 17, 18, 22], "initial_map": 14, "inproceed": 19, "input": 4, "insert": [2, 3, 19], "insid": 22, "instal": [1, 22], "instanc": [2, 4, 12, 19], "instantan": 2, "instanti": 4, "instead": [19, 22], "int": [2, 3, 6, 7, 8, 10, 16, 18], "int32": 13, "int64": 14, "int8": [9, 11, 13, 17], "integ": [2, 3, 9, 11, 12, 14, 17], "interact": [2, 3, 19], "interest": 5, "intermedi": [7, 18], "intern": 2, "introduc": 5, "iql": 23, "isn": 12, "item": [5, 14, 21], "its": 5, "join": 19, "joint": 15, "just": 12, "kallstrom_2019": 5, "keep": [4, 5, 23], "kei": [2, 3, 4], "kept": 5, "knee": 15, "k\u00e4llstr\u00f6m": 14, "land": 11, "languag": 12, "last": [2, 8, 9, 12, 19], "latenc": 10, "launch": 21, "lead": 5, "learn": [6, 7, 10, 18, 19, 20, 21], "least": 17, "left": [8, 9, 14, 17], "length": [2, 3, 6, 7, 8, 12, 18], "less": 12, "level": [8, 15], "librari": [19, 23], "like": [2, 3, 4, 5, 17, 23], "limit": [12, 16], "linear": [4, 23], "linearizereward": [2, 4, 19], "linearreward": 3, "list": [2, 3, 23], "literatur": 5, "live": 12, "local": [8, 12, 16], "locat": [9, 11, 17], "log": 21, "look": 2, "loop": 21, "lose": [9, 11], "lost": 9, "lower": [15, 16], "made": [1, 11, 21], "mai": [2, 3], "main": [9, 11, 17, 21], "maintain": 5, "make": [1, 2, 19], "mani": 5, "mannion_2018": 5, "map": 14, "margin": 10, "markdown": 21, "marl": 5, "mass": 16, "max": 9, "max_connect": 12, "max_cycl": 16, "max_mov": 9, "max_num_ag": [2, 3], "max_prob": 12, "max_work": 12, "maxim": [5, 6, 7, 15, 18], "maximis": 16, "maximum": [8, 12, 14, 16], "mct": 10, "md": 1, "mean": [9, 11, 17], "member": [2, 3, 22], "merg": 21, "met": [6, 7, 18], "method": 22, "mgoulao": 21, "min_connect": 12, "min_prob": 12, "min_work": 12, "mine": [5, 12], "minim": [6, 7, 10, 16, 18], "minimis": 10, "minimum": [5, 12], "miss": 21, "mix": 5, "mixtur": [5, 8], "mo": [4, 5, 21], "mo_aec_to_parallel_wrapp": 4, "mo_parallel_to_aec": 22, "mo_parallel_to_aec_wrapp": 4, "moaec": 2, "moaecenv": 22, "mobeach_v0": [8, 22], "mobil": 5, "mobreakthrough_v0": 9, "mocongestion_v0": 10, "moconnect4_v0": 11, "mode": [2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "model": [2, 19], "modifi": 9, "modul": 4, "mogem_mining_v0": 12, "moingenious_v0": 13, "moitem_gathering_v0": 14, "momaland": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23], "momappo": [21, 23], "momarl": 19, "momultiwalk": 21, "momultiwalker_st": [2, 3, 19], "momultiwalker_stability_v0": [2, 3, 15, 19], "monetari": 10, "moparallel": 3, "moparallelenv": 22, "mopistonball_v0": 16, "more": [1, 2, 3, 5, 11, 19], "morl": 23, "mosame_game_v0": 17, "most": 12, "move": [4, 6, 7, 8, 9, 11, 16, 17, 18], "move_count": [9, 11], "multi": [2, 3, 5, 9, 10, 11, 14, 17, 19, 23], "multibinari": 9, "multipl": 5, "multipli": 12, "multiwalk": [5, 15, 21], "must": [2, 3, 4, 5], "n": [2, 5, 17], "n_piston": 16, "n_reward": 5, "name": [2, 3, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 23], "nativ": 4, "navig": 5, "ndarrai": [2, 3], "need": [10, 16], "neg": [9, 12, 14], "negat": [9, 11], "network": [2, 5, 10], "nevertheless": 23, "new": [4, 21, 22], "newli": 4, "next": [2, 16], "non": 12, "none": [2, 3, 6, 7, 8, 10, 14, 16, 18, 19], "normal": 4, "normalizereward": 4, "note": [2, 12], "notebook": [2, 19, 21], "np": [2, 3, 4, 19], "nparrai": [6, 7, 18], "nth": 2, "num_ag": [2, 3, 8, 10, 12, 17], "num_color": 17, "num_dron": [6, 7, 18], "num_intermediate_point": [7, 18], "num_object": [9, 12, 17], "num_timestep": [8, 10, 12, 14], "number": [5, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "numpi": [2, 3, 19], "ob": 5, "object": [2, 3, 4, 5, 8, 9, 10, 11, 12, 14, 15, 17, 19, 23], "observ": [2, 3, 13, 19, 23], "observation_spac": [2, 3], "obstyp": [2, 3], "occup": [5, 8], "occupi": 8, "occur": 14, "od": 10, "off": 5, "ol": [21, 23], "onc": 1, "one": [5, 6, 7, 9, 11, 15, 17, 18], "onli": [5, 8, 9, 17], "open": 19, "oppon": [5, 9, 11], "oppos": [2, 3], "option": [2, 3, 5, 6, 7, 9, 10, 11, 16, 18, 19], "order": [6, 7, 17, 18], "org": 19, "origin": [5, 10, 16], "other": [2, 5, 6, 7, 9, 11, 18, 23], "our": [1, 22], "outer": 21, "output": 4, "outsid": [2, 3], "over": [5, 8], "overrid": [2, 3, 4, 22], "own": [4, 5, 9, 17], "packag": [1, 5, 15, 22], "pair": 10, "paper": [5, 23], "paradox": 10, "parallel": [6, 7, 8, 10, 14, 18, 20, 22], "parallel_env": [3, 4, 22], "parallel_to_aec": 22, "parallel_wrapp": 4, "paramet": 4, "past": 4, "penal": 15, "penalti": [15, 16, 21], "per": [2, 3, 5, 12, 19], "percentag": [8, 10], "perform": 23, "pettingzoo": [2, 3, 5, 15, 19, 22], "philosophi": 23, "piec": [5, 9], "pip": [1, 19, 20, 22], "piston": 16, "piston_i": 16, "pistonbal": 5, "place": [8, 9, 10, 11, 16], "placement": [9, 11], "plai": [2, 3, 17], "plane": [9, 11, 17], "player": [5, 9, 11, 17], "player_i": [9, 11], "pleas": [12, 19], "point": [5, 7, 9, 10, 11, 17, 18], "polici": [2, 3, 4, 19, 23], "posit": [6, 7, 8, 14, 18], "position_distribut": 8, "possibl": [5, 10, 23], "possible_ag": [2, 3, 4], "practic": 22, "prefer": 10, "previou": 12, "print": [2, 3, 4], "probabl": 12, "problem": [5, 8, 14], "problem_nam": 10, "problemat": [19, 20], "producit": 5, "program": 12, "progress": [2, 3, 5], "proport": 16, "provid": [5, 12, 19, 23], "public": 19, "pure": 22, "puzzl": [5, 17], "py": 22, "pypi": 21, "pytest": 22, "python": [2, 19], "pz": [2, 3, 4, 5, 19, 22], "q": 21, "qmix": 3, "r": 1, "rack": 13, "rais": 16, "random": [8, 10, 16], "random_drop": 16, "random_toll_percentag": 10, "randomis": 14, "randomli": [8, 10, 12, 17], "rang": 4, "rather": 2, "raw_env": 22, "reach": [5, 6, 7, 8, 9, 11, 14, 16, 18], "reachabl": 12, "realli": 12, "rebuild": 1, "receiv": [3, 8, 12], "recent": 4, "record": 4, "recordepisodestatist": 4, "recordstatisticswrapp": 21, "rectangular": 17, "reflect": 5, "reimplement": 12, "reinforc": 19, "releas": 2, "remov": [2, 3, 5, 17, 21], "render": [2, 3, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "render_mod": [2, 3, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19], "repositori": [19, 22], "repres": [4, 5, 6, 7, 9, 10, 11, 12, 14, 15, 17, 18], "requir": [1, 19, 20, 21], "research": 19, "reset": [2, 3, 19], "resourc": 2, "respect": [3, 5, 11], "restrict": 5, "return": [2, 3, 19, 22], "reus": 23, "reward": [2, 3, 4, 5, 13, 19, 21, 22, 23], "reward_schem": 8, "reward_spac": [2, 3, 4, 22], "rgb": 16, "rgb_arrai": [2, 3, 11, 16], "right": [5, 8, 9, 14, 16, 17], "rl": [2, 9, 11, 17, 19], "road": [5, 10], "roijer": 12, "rout": [5, 10], "routechoic": 5, "row": 9, "rradul": 21, "rule": 5, "run": [14, 22], "same": [2, 3, 5, 9, 11, 12, 17], "samegam": [5, 21], "sampl": [2, 3, 19], "scalar": [2, 3, 4, 16, 19, 23], "scale": [4, 11], "scheme": 8, "score": [5, 13], "screen": 11, "screen_scal": 11, "script": 21, "second": [2, 9, 11, 15], "section": [5, 8, 9, 11, 17], "section_capac": 8, "section_consumpt": 8, "section_id": 8, "see": [2, 3, 12, 15, 19, 20, 22], "seed": [2, 3, 12, 19, 21], "seek": 5, "select": [2, 10, 12, 17], "self": [2, 5, 22, 23], "send": 12, "separ": [5, 17], "sequenc": 11, "sequenti": 17, "server": 19, "set": [9, 11, 12, 16, 17, 19, 23], "shake": 15, "shape": [4, 6, 7, 10, 18, 22], "share": [5, 17], "shift": [2, 19], "should": [2, 6, 7, 11, 12, 18, 22], "side": [6, 7, 18], "similar": [2, 4, 19], "simpl": [2, 19], "singl": [2, 5, 9, 11, 15, 17, 19, 21, 23], "sister": 15, "size": [6, 7, 17, 18, 21], "so": [3, 4, 10], "some": [4, 19, 20], "sourc": [2, 3, 4, 6, 7, 10, 18, 19], "space": [2, 3, 4, 5, 13, 22, 23], "specif": [2, 3, 9, 11, 12, 17], "specifi": [2, 16, 22], "speed": [6, 7, 18], "sphinx": 1, "split": 4, "squar": [9, 17], "stabil": [4, 5, 21], "stai": [8, 14, 16], "stand": 11, "standard": [2, 3, 19], "start": 2, "state": [2, 3], "stateless": [10, 12], "static": 18, "statist": 4, "steadi": 5, "step": [2, 3, 6, 7, 15, 18, 19], "still": [15, 16], "store": 2, "str": [2, 3, 6, 7, 8, 10, 18], "straight": 9, "string": [2, 3], "structur": 2, "stuff": 21, "subprocess": 2, "successfulli": 11, "sum": [5, 8], "suppli": [19, 20], "support": [2, 3, 5], "surround": [5, 6], "surround_v0": 18, "survei": [2, 19], "switch": 2, "system": [19, 20], "t": [4, 12, 22], "take": [2, 3, 4, 10, 16], "taken": [2, 5], "target": [5, 6, 7, 18], "target_spe": [6, 7, 18], "tax": 10, "team": [5, 17, 23], "team_reward": 17, "tensor": [9, 11, 17], "term": 5, "termin": [2, 3, 19], "terminate_on_fal": 15, "test": [19, 20, 22], "than": 11, "thei": [5, 9, 11, 17], "them": [2, 4, 5, 9, 11, 16, 19], "themselv": 5, "therefor": [10, 12], "thi": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 16, 17, 18, 19, 20, 22], "third": 9, "thomasini_2023": 5, "three": 9, "threepwoodi": 21, "through": [2, 3], "tile": [5, 17], "time": [1, 2, 5, 9, 10, 11, 16], "time_penalti": 16, "timestep": [6, 7, 8, 10, 12, 14, 18], "todo": [5, 19], "togeth": 12, "token": 11, "toll": 10, "toll_mod": 10, "tool": 23, "top": [4, 11], "total": 12, "tourist": 5, "toward": [6, 7, 15, 16, 18], "trade": 5, "train": 3, "trajectori": [4, 7, 18], "transform": 4, "travel": [5, 6, 7, 10, 15, 18], "true": [12, 15, 16, 17], "trunc_prob": 12, "truncat": [2, 3, 19], "try": [6, 9, 21], "tupl": [3, 8, 14], "turn": [5, 9, 11, 17], "two": [5, 8, 9, 10, 11, 16, 17], "txt": 1, "type": [2, 3, 4, 5, 8, 12, 14], "type_distribut": 8, "typic": [2, 3], "uint8": 16, "umutucak": 21, "unchang": 16, "under": 10, "undocu": 21, "uniform": [8, 21], "until": 11, "up": [9, 14, 16], "updat": 21, "upper": 12, "us": [2, 4, 8, 10, 11, 12, 16, 19, 20, 22], "usag": [19, 20], "usual": [9, 11, 17], "util": [2, 3, 19, 22, 23], "v0": 5, "valid": 4, "valu": [2, 3, 4, 6, 7, 8, 11, 12, 15, 16, 18, 19], "variabl": 2, "varianc": 4, "variant": [5, 9, 11, 17], "vec_reward": [2, 3, 19], "vector": [2, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 22], "vectori": 4, "veri": [2, 19], "version": [5, 10, 21, 22], "vertic": [11, 17], "via": 8, "view": 3, "villag": 12, "visual": 21, "w": 12, "w_bonu": 12, "wa": [4, 6, 7, 10, 12, 16, 18], "wai": [5, 17], "walker_0": [2, 19], "walker_1": [2, 19], "walker_2": [2, 19], "walker_i": 15, "wall": 16, "wandb": 21, "we": [2, 19, 23], "webpag": 21, "websit": 19, "weight": [2, 3, 4, 19, 21], "well": [5, 19, 22], "were": 17, "what": 21, "when": [6, 7, 11, 16, 17, 18], "where": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19], "whether": [9, 11, 14, 16, 17], "which": [2, 3, 5, 6, 7, 9, 11, 12, 15, 17, 18, 19, 22], "while": [3, 5, 6, 7, 10, 18], "whose": [4, 9, 11, 17], "width": [9, 11, 17], "win": [5, 9, 11], "window": [2, 3, 16], "without": [5, 11], "work": [19, 22], "worker": 12, "workflow": 21, "world": 5, "would": [2, 3, 19], "wrapper": [5, 21, 23], "x": [9, 11, 17], "x_0": [6, 7, 18], "x_1": [6, 7, 18], "x_n": [6, 7, 18], "x_targ": [6, 7, 18], "xyz": [6, 7, 18], "y": [9, 17], "y_0": [6, 7, 18], "y_1": [6, 7, 18], "y_n": [6, 7, 18], "y_targ": [6, 7, 18], "you": [2, 3, 19, 22], "your": [1, 2, 3, 19, 22], "z": 9, "z_0": [6, 7, 18], "z_1": [6, 7, 18], "z_n": [6, 7, 18], "z_targ": [6, 7, 18], "zero": [5, 9, 11, 17]}, "titles": ["404 - Page Not Found", "MOMAland documentation", "AEC", "Parallel", "Wrappers and Utils", "All Environments", "Catch", "Escort", "MO-Beach", "MO-Breakthrough", "MO-Congestion", "MO-Connect4", "MO-GemMining", "MO-Ingenious", "MO-ItemGathering", "MO-MultiwalkerStability", "MO-Pistonball", "MO-SameGame", "Surround", "API", "Install", "Release Notes", "Making a Custom Environment", "Learning algorithms"], "titleterms": {"0": 21, "1": 21, "2": 21, "404": 0, "Not": 0, "The": 0, "action": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "aec": [2, 4], "algorithm": 23, "all": 5, "api": 19, "argument": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "attribut": [2, 3], "beach": 8, "breakthrough": 9, "build": 1, "catch": 6, "cite": 19, "commit": 22, "congest": 10, "connect4": 11, "convers": 4, "could": 0, "credit": [6, 7, 10, 12, 16, 18], "custom": 22, "document": 1, "dummi": 21, "environ": [5, 22], "episod": [6, 7, 8, 10, 12, 14, 15, 16, 18], "escort": 7, "found": 0, "gemmin": 12, "histori": [9, 11, 17], "hook": 22, "ingeni": 13, "instal": [19, 20], "itemgath": 14, "learn": 23, "legal": [9, 11, 17], "make": 22, "mask": [9, 11, 17], "method": [2, 3], "mo": [8, 9, 10, 11, 12, 13, 14, 15, 16, 17], "moaecenv": 2, "momaland": 1, "moparallelenv": 3, "multiwalkerst": 15, "note": 21, "observ": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "page": 0, "parallel": [3, 4], "pistonbal": 16, "pre": 22, "releas": 21, "request": 0, "reward": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "samegam": 17, "setup": 22, "space": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "start": [6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "state": [6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "surround": 18, "termin": [6, 7, 8, 10, 12, 14, 15, 16, 18], "truncat": [6, 7, 8, 10, 12, 14, 16, 18], "usag": [2, 3], "util": 4, "v0": 21, "version": [9, 11, 17], "wrapper": 4}}) \ No newline at end of file +Search.setIndex({"alltitles": {"404 - Page Not Found": [[0, "page-not-found"]], "AEC": [[2, "aec"], [4, "aec"]], "AEC to Parallel": [[4, "aec-to-parallel"]], "API": [[19, "api"]], "Action Space": [[6, "action-space"], [7, "action-space"], [8, "action-space"], [9, "action-space"], [10, "action-space"], [11, "action-space"], [12, "action-space"], [14, "action-space"], [15, "action-space"], [16, "action-space"], [17, "action-space"], [18, "action-space"]], "All Environments": [[5, "all-environments"]], "Arguments": [[6, "arguments"], [7, "arguments"], [8, "arguments"], [9, "arguments"], [10, "arguments"], [11, "arguments"], [12, "arguments"], [14, "arguments"], [15, "arguments"], [16, "arguments"], [17, "arguments"], [18, "arguments"]], "Attributes": [[2, "attributes"], [3, "attributes"]], "Build the Documentation": [[1, "build-the-documentation"]], "Catch": [[6, "catch"]], "Citing": [[19, "citing"]], "Conversion": [[4, "conversion"]], "Credits": [[6, "credits"], [7, "credits"], [10, "credits"], [12, "credits"], [16, "credits"], [18, "credits"]], "Episode Termination": [[6, "episode-termination"], [7, "episode-termination"], [8, "episode-termination"], [10, "episode-termination"], [12, "episode-termination"], [14, "episode-termination"], [15, "episode-termination"], [16, "episode-termination"], [18, "episode-termination"]], "Episode Truncation": [[6, "episode-truncation"], [7, "episode-truncation"], [8, "episode-truncation"], [10, "episode-truncation"], [12, "episode-truncation"], [14, "episode-truncation"], [16, "episode-truncation"], [18, "episode-truncation"]], "Escort": [[7, "escort"]], "Install": [[19, "install"], [20, "install"]], "Learning algorithms": [[23, "learning-algorithms"]], "Legal Actions Mask": [[9, "legal-actions-mask"], [11, "legal-actions-mask"], [17, "legal-actions-mask"]], "MO-Beach": [[8, "mo-beach"]], "MO-Breakthrough": [[9, "mo-breakthrough"]], "MO-Congestion": [[10, "mo-congestion"]], "MO-Connect4": [[11, "mo-connect4"]], "MO-GemMining": [[12, "mo-gemmining"]], "MO-Ingenious": [[13, "mo-ingenious"]], "MO-ItemGathering": [[14, "mo-itemgathering"]], "MO-MultiwalkerStability": [[15, "mo-multiwalkerstability"]], "MO-Pistonball": [[16, "mo-pistonball"]], "MO-SameGame": [[17, "mo-samegame"]], "MOAECEnv": [[2, "moaecenv"]], "MOMAland documentation": [[1, "momaland-documentation"]], "MOParallelEnv": [[3, "moparallelenv"]], "Making a Custom Environment": [[22, "making-a-custom-environment"]], "Methods": [[2, "methods"], [3, "methods"]], "Observation Space": [[6, "observation-space"], [7, "observation-space"], [8, "observation-space"], [9, "observation-space"], [10, "observation-space"], [11, "observation-space"], [12, "observation-space"], [14, "observation-space"], [15, "observation-space"], [16, "observation-space"], [17, "observation-space"], [18, "observation-space"]], "Parallel": [[3, "parallel"], [4, "parallel"]], "Parallel to AEC": [[4, "parallel-to-aec"]], "Release Notes": [[21, "release-notes"]], "Reward Space": [[6, "reward-space"], [7, "reward-space"], [8, "reward-space"], [10, "reward-space"], [12, "reward-space"], [14, "reward-space"], [15, "reward-space"], [16, "reward-space"], [18, "reward-space"]], "Rewards": [[9, "rewards"], [11, "rewards"], [17, "rewards"]], "Setup pre-commit hooks": [[22, "setup-pre-commit-hooks"]], "Starting State": [[6, "starting-state"], [7, "starting-state"], [8, "starting-state"], [9, "starting-state"], [10, "starting-state"], [11, "starting-state"], [12, "starting-state"], [14, "starting-state"], [16, "starting-state"], [17, "starting-state"], [18, "starting-state"]], "Surround": [[18, "surround"]], "The requested page could not be found.": [[0, "the-requested-page-could-not-be-found"]], "Usage": [[2, "usage"], [3, "usage"]], "Version History": [[9, "version-history"], [11, "version-history"], [17, "version-history"]], "Wrappers and Utils": [[4, "wrappers-and-utils"]], "v0.0.1: Dummy release": [[21, "release-v0-0-1"]], "v0.0.2: Dummy release 2": [[21, "release-v0-0-2"]]}, "docnames": ["404", "README", "api/aec", "api/parallel", "api/wrappers", "environments/all-envs", "environments/catch", "environments/escort", "environments/mobeach", "environments/mobreakthrough", "environments/mocongestion", "environments/moconnect4", "environments/mogem_mining", "environments/moingenious", "environments/moitem_gathering", "environments/momultiwalker_stability", "environments/mopistonball", "environments/mosame_game", "environments/surround", "index", "introduction/install", "release_notes", "tutorials/custom_env", "tutorials/learning_algorithms"], "envversion": {"sphinx": 61, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.viewcode": 1}, "filenames": ["404.md", "README.md", "api/aec.md", "api/parallel.md", "api/wrappers.md", "environments/all-envs.md", "environments/catch.md", "environments/escort.md", "environments/mobeach.md", "environments/mobreakthrough.md", "environments/mocongestion.md", "environments/moconnect4.md", "environments/mogem_mining.md", "environments/moingenious.md", "environments/moitem_gathering.md", "environments/momultiwalker_stability.md", "environments/mopistonball.md", "environments/mosame_game.md", "environments/surround.md", "index.md", "introduction/install.md", "release_notes.md", "tutorials/custom_env.md", "tutorials/learning_algorithms.md"], "indexentries": {"action_space() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.action_space", false]], "action_space() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.action_space", false]], "action_spaces (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.action_spaces", false]], "action_spaces (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.action_spaces", false]], "agent_selection (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.agent_selection", false]], "agents (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.agents", false]], "agents (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.agents", false]], "close() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.close", false]], "close() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.close", false]], "infos (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.infos", false]], "linearizereward (class in momaland.utils.aec_wrappers)": [[4, "momaland.utils.aec_wrappers.LinearizeReward", false]], "linearizereward (class in momaland.utils.parallel_wrappers)": [[4, "momaland.utils.parallel_wrappers.LinearizeReward", false]], "max_num_agents (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.max_num_agents", false]], "max_num_agents (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.max_num_agents", false]], "mo_aec_to_parallel_wrapper (class in momaland.utils.conversions)": [[4, "momaland.utils.conversions.mo_aec_to_parallel_wrapper", false]], "mo_parallel_to_aec_wrapper (class in momaland.utils.conversions)": [[4, "momaland.utils.conversions.mo_parallel_to_aec_wrapper", false]], "moaecenv (class in momaland.utils.env)": [[2, "momaland.utils.env.MOAECEnv", false]], "moparallelenv (class in momaland.utils.env)": [[3, "momaland.utils.env.MOParallelEnv", false]], "normalizereward (class in momaland.utils.aec_wrappers)": [[4, "momaland.utils.aec_wrappers.NormalizeReward", false]], "normalizereward (class in momaland.utils.parallel_wrappers)": [[4, "momaland.utils.parallel_wrappers.NormalizeReward", false]], "num_agents (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.num_agents", false]], "num_agents (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.num_agents", false]], "observation_space() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.observation_space", false]], "observation_space() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.observation_space", false]], "observation_spaces (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.observation_spaces", false]], "observation_spaces (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.observation_spaces", false]], "observe() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.observe", false]], "possible_agents (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.possible_agents", false]], "possible_agents (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.possible_agents", false]], "recordepisodestatistics (class in momaland.utils.parallel_wrappers)": [[4, "momaland.utils.parallel_wrappers.RecordEpisodeStatistics", false]], "render() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.render", false]], "render() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.render", false]], "reset() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.reset", false]], "reset() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.reset", false]], "reward_space() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.reward_space", false]], "reward_space() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.reward_space", false]], "reward_spaces (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.reward_spaces", false]], "reward_spaces (momaland.utils.env.moparallelenv attribute)": [[3, "momaland.utils.env.MOParallelEnv.reward_spaces", false]], "rewards (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.rewards", false]], "state() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.state", false]], "step() (momaland.utils.env.moaecenv method)": [[2, "momaland.utils.env.MOAECEnv.step", false]], "step() (momaland.utils.env.moparallelenv method)": [[3, "momaland.utils.env.MOParallelEnv.step", false]], "terminations (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.terminations", false]], "truncations (momaland.utils.env.moaecenv attribute)": [[2, "momaland.utils.env.MOAECEnv.truncations", false]]}, "objects": {"momaland.utils.aec_wrappers": [[4, 0, 1, "", "LinearizeReward"], [4, 0, 1, "", "NormalizeReward"]], "momaland.utils.conversions": [[4, 0, 1, "", "mo_aec_to_parallel_wrapper"], [4, 0, 1, "", "mo_parallel_to_aec_wrapper"]], "momaland.utils.env": [[2, 0, 1, "", "MOAECEnv"], [3, 0, 1, "", "MOParallelEnv"]], "momaland.utils.env.MOAECEnv": [[2, 1, 1, "", "action_space"], [2, 2, 1, "", "action_spaces"], [2, 2, 1, "", "agent_selection"], [2, 2, 1, "", "agents"], [2, 1, 1, "", "close"], [2, 2, 1, "", "infos"], [2, 2, 1, "", "max_num_agents"], [2, 2, 1, "", "num_agents"], [2, 1, 1, "", "observation_space"], [2, 2, 1, "", "observation_spaces"], [2, 1, 1, "", "observe"], [2, 2, 1, "", "possible_agents"], [2, 1, 1, "", "render"], [2, 1, 1, "", "reset"], [2, 1, 1, "", "reward_space"], [2, 2, 1, "", "reward_spaces"], [2, 2, 1, "", "rewards"], [2, 1, 1, "", "step"], [2, 2, 1, "", "terminations"], [2, 2, 1, "", "truncations"]], "momaland.utils.env.MOParallelEnv": [[3, 1, 1, "", "action_space"], [3, 2, 1, "", "action_spaces"], [3, 2, 1, "", "agents"], [3, 1, 1, "", "close"], [3, 2, 1, "", "max_num_agents"], [3, 2, 1, "", "num_agents"], [3, 1, 1, "", "observation_space"], [3, 2, 1, "", "observation_spaces"], [3, 2, 1, "", "possible_agents"], [3, 1, 1, "", "render"], [3, 1, 1, "", "reset"], [3, 1, 1, "", "reward_space"], [3, 2, 1, "", "reward_spaces"], [3, 1, 1, "", "state"], [3, 1, 1, "", "step"]], "momaland.utils.parallel_wrappers": [[4, 0, 1, "", "LinearizeReward"], [4, 0, 1, "", "NormalizeReward"], [4, 0, 1, "", "RecordEpisodeStatistics"]]}, "objnames": {"0": ["py", "class", "Python class"], "1": ["py", "method", "Python method"], "2": ["py", "attribute", "Python attribute"]}, "objtypes": {"0": "py:class", "1": "py:method", "2": "py:attribute"}, "terms": {"": [2, 3, 4, 5, 6, 7, 8, 9, 11, 16, 17, 18, 19, 23], "0": [2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], "00e": 16, "01": 16, "01567": 15, "02": 15, "04": 16, "08": 4, "1": [2, 4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18], "10": [6, 7, 12, 14, 17, 18], "100": [8, 15, 16], "100e": 15, "11": [12, 13], "110": 15, "12": [8, 12, 21], "120": 16, "125": 16, "13": 12, "14": 12, "15": [6, 7, 12, 17, 18], "16": 12, "17": 12, "18": [12, 13], "19": [12, 16], "192": 9, "1e": 4, "2": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], "20": [9, 11, 12, 16, 21], "200": [6, 7, 18], "2019": 14, "2023": 21, "209": 15, "225": 17, "23": 12, "255": 16, "2880": 13, "2d": [6, 7, 8, 10, 14, 15, 18], "3": [2, 3, 6, 7, 8, 9, 10, 12, 14, 16, 17, 18, 19], "30": 17, "31": 15, "3d": [6, 7, 16, 18], "4": [5, 9, 11, 12, 14, 15], "4199": 10, "42": [2, 3, 12, 19], "457": 16, "46": 15, "5": [2, 6, 7, 8, 9, 12, 14, 16, 17, 18, 19], "50625": 17, "54": 15, "567e": 15, "6": [8, 11, 12, 13, 16], "60": 16, "7": [2, 3, 8, 11, 12, 19], "75": 16, "79e": 16, "8": [2, 4, 9, 11, 12, 14, 16, 19], "881808": 8, "8x8": [9, 14], "9": [4, 11, 12], "99": [4, 8], "A": [2, 3, 4, 5, 6, 7, 8, 10, 14, 15, 18, 19], "And": 3, "As": [12, 16], "At": [9, 11], "For": [1, 2, 3, 4, 19], "If": [8, 9, 11, 15, 17, 19], "In": [3, 5, 9, 10, 11], "It": 8, "On": 4, "The": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 23], "There": [5, 9], "These": [2, 3], "To": [1, 19, 20, 23], "With": 11, "_build": 1, "_env": [2, 3, 19], "_of_a_of_current_typ": 8, "_scalartype_co": 2, "a_typ": 8, "about": 1, "abov": [16, 17], "accept": 2, "access": [2, 4], "accord": 8, "accumul": [2, 5, 17], "action": [2, 3, 5, 13, 19, 23], "action_mask": [9, 11, 13, 17], "action_spac": [2, 3, 19], "actiontyp": [2, 3], "ad": [2, 3, 12], "adapt": [5, 6, 7, 10, 15, 16, 18], "add": [4, 22], "addit": [5, 9, 11], "addition": [5, 9, 11, 17, 22], "adversari": 5, "aec": [16, 19, 20, 22], "aec_env": 4, "aec_wrapp": 4, "aecwrapp": 4, "after": [2, 8, 10, 12], "agent": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 22], "agent_": 5, "agent_0": 4, "agent_1": 4, "agent_i": [6, 7, 8, 10, 12, 13, 14, 17, 18], "agent_it": [2, 19], "agent_select": 2, "agentid": [2, 3], "aka": [2, 19], "algorithm": [19, 20], "aliv": 15, "all": [2, 3, 8, 9, 11, 12, 14, 17, 19, 20], "all_modul": 22, "allow": 5, "also": [2, 4, 5, 8, 9, 19], "altern": 3, "alternatingli": 9, "alwai": 10, "amount": [6, 7, 16, 18], "an": [2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 14, 16, 17, 18, 19, 22], "analysi": [2, 19], "angl": 15, "ani": [2, 5, 9, 11, 12, 17, 23], "anoth": [12, 17], "ansi": [2, 3], "api": [2, 3, 4, 20], "appli": 4, "appropri": 3, "ar": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 15, 16, 17, 18, 22, 23], "area": [6, 7, 18], "around": 5, "arrai": [2, 3, 4, 6, 7, 18, 19], "assign": [2, 19], "attende": 5, "autobuild": 1, "automat": [1, 2], "avail": [5, 9, 11, 14, 15, 17], "averag": [4, 6, 7, 18], "avoid": [4, 5, 6, 7, 9, 10, 15, 18], "award": [9, 11], "b": 1, "baier_2015": 5, "ball": 16, "ball_elast": 16, "ball_frict": 16, "ball_mass": 16, "bandit": 12, "bargiacchi": 12, "bargiacchi_2018": 5, "base": [2, 3, 4, 5, 9, 10, 11, 12, 15, 17, 19], "baselin": 23, "basic": [19, 20], "beach": 5, "becom": 17, "been": [11, 14], "befor": 16, "behavior": 4, "below": [9, 11, 17], "bernoulli": 12, "between": [4, 5, 16, 17, 19], "binari": [9, 11, 17], "board": [5, 9, 11, 13, 17], "board_height": [9, 11, 17], "board_siz": 11, "board_width": [9, 11, 17], "bonu": 12, "bool": [2, 3], "both": [9, 11, 15], "bottom": 11, "bound": 15, "box": [6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18], "braess": 10, "break": 5, "breakthrough": 5, "bump": 21, "call": [2, 5, 17], "can": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 16, 17, 18, 19, 20], "cannot": [2, 3, 11], "capac": [5, 8], "captur": [5, 9], "case": [9, 10, 11], "catch": 5, "catch_v0": 6, "cbox": 12, "cd": 1, "cell": [9, 11, 14], "central": [3, 23], "certain": [19, 20], "chang": [1, 2, 3, 4, 8, 15, 16], "characteris": 5, "check": [2, 4, 19, 21, 22], "choic": 5, "choos": 10, "circl": 5, "class": [2, 3, 4, 19, 22], "classic": [2, 3], "cleanrl": 23, "close": [2, 3, 16, 17, 19, 23], "closer": 16, "code": [4, 6, 7, 10, 12, 16, 18], "codebas": 10, "colab": [2, 19], "collect": 5, "collid": [6, 7, 18], "collis": [6, 7, 18], "color": [5, 17], "color_reward": 17, "colors_n": 5, "colour": 5, "column": [5, 11, 17], "column_object": 11, "commun": 19, "compar": 19, "compat": 23, "competit": 5, "compliant": 19, "compon": [16, 19, 20], "condit": [6, 7, 18], "configur": 5, "conflict": 11, "congest": 5, "connect": [2, 5, 11, 12, 17], "connect4": 5, "constant": 10, "constructor": 22, "consumpt": 8, "contain": [1, 2, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 23], "continu": [5, 6, 7, 8, 15, 16, 18, 23], "contribut": 1, "control": 2, "convers": 22, "convert": 4, "cooper": [5, 12, 16], "coordin": [6, 7, 9, 17, 18, 19], "corner": 5, "correctli": 4, "correl": 12, "correlated_object": 12, "correspond": [2, 8, 9, 10, 11, 12], "cost": [5, 10], "could": [2, 3], "count": [5, 17], "creat": [2, 19, 22], "crowded": 5, "current": [2, 3, 8, 9, 11, 17, 23], "customiz": 12, "cycl": 16, "d": 17, "data": 3, "decentr": 3, "decis": [2, 19], "default": [2, 3, 5, 8, 11, 12, 14, 15, 16, 17], "defaultli": 12, "defin": 22, "definit": [2, 19], "depend": [4, 10, 19, 20], "describ": [9, 11, 17], "descript": 5, "destin": [5, 10], "detail": [2, 3, 19], "determin": 14, "develop": 19, "diagon": [9, 11], "dict": [2, 3, 4, 9, 11, 13, 17], "dictionari": [3, 9, 11, 17, 22], "diederik": 12, "differ": [2, 5, 8, 10, 12, 17, 19], "dimens": [9, 11, 17], "direct": [6, 7, 18], "directli": 2, "directori": [22, 23], "dirhtml": 1, "disappear": 17, "discord": 19, "discount": 4, "discret": [5, 8, 9, 10, 11, 12, 13, 14, 16, 17, 23], "disentangl": 16, "displai": [2, 3, 6, 7, 10, 18], "distanc": [5, 6, 7, 15, 18], "distance_other_dron": 5, "distance_target": 5, "distribut": 8, "divers": 5, "divid": 9, "do": [9, 11], "doc": 1, "document": [2, 3, 15, 19, 20, 22], "doe": [2, 19, 20], "domain": [8, 12], "don": 22, "down": [14, 16, 17], "draw": [9, 11], "driver": [5, 10], "drone": [5, 6, 7, 18], "drop": [11, 15, 16], "dtype": 2, "due": 12, "duplic": 4, "dure": 15, "e": [1, 2, 3, 12, 22], "each": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19, 22, 23], "edg": 16, "either": [5, 9, 10, 11, 17], "elast": 16, "element": [9, 11, 15, 17], "els": [2, 19], "empti": [9, 11, 14, 17], "encod": [9, 11, 17], "end": [4, 11, 15], "enforc": [2, 3], "enjoy": 5, "entir": 17, "entri": 14, "env": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 22], "env_nam": 22, "environ": [2, 3, 4, 6, 7, 8, 10, 12, 13, 14, 15, 16, 18, 19, 23], "episod": 4, "epsilon": 4, "equal": [8, 11, 12], "equival": [2, 3], "escap": 6, "escort": 5, "escort_v0": 7, "essenti": [5, 19], "etc": 3, "eugenio": 12, "evalu": 23, "even": 15, "everi": [1, 2, 3, 5, 11, 17], "everyth": [19, 20, 22, 23], "exampl": [2, 4, 6, 7, 8, 18, 19, 22], "except": [9, 11, 17], "exclud": 12, "exclus": [9, 11, 17], "execut": [2, 3], "exert": 15, "exist": 11, "expect": 12, "exponenti": 4, "export": 22, "extend": [2, 3, 5, 9, 11, 17, 22], "extens": 5, "extract": 5, "factor": [4, 11], "factori": 22, "fall": [11, 15, 17], "fals": [14, 17], "farama": 19, "fast": 5, "faster": [9, 11], "felten": [6, 7, 18], "few": 4, "file": 23, "fill": [9, 11, 17], "final": [7, 18], "final_target_loc": [7, 18], "find": 12, "finish": 16, "first": [2, 9, 15], "fix": [4, 5], "float": [2, 3, 4, 6, 7, 10, 18], "float32": [6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18], "folder": 1, "follow": [3, 6, 7, 15, 18, 19], "forc": 15, "fork": 1, "format": 5, "forward": 9, "found": [9, 11, 12, 17], "four": 11, "fourth": 9, "frame": 3, "friction": 16, "from": [2, 3, 5, 6, 7, 9, 10, 11, 12, 15, 16, 17, 18, 19, 22, 23], "front": 9, "full": 11, "fulli": 12, "function": [2, 3, 12, 19, 22], "g": 22, "game": [5, 9, 11, 13, 17], "gamma": 4, "gap": 17, "gather": [5, 14], "gem": [5, 12], "gener": [2, 3, 12], "get": [5, 9, 11, 17], "github": 21, "give": 15, "given": [8, 9, 11, 12, 16, 17, 22], "global": [3, 8, 16], "go": [1, 5], "goal": 5, "greater": 12, "grid": [5, 9, 11, 14, 17], "ground": [6, 7, 18], "group": [5, 17], "guid": 20, "gupta_2017": 5, "gym": 3, "gymnasium": [2, 3], "ha": [2, 3, 4, 8, 9, 11, 12, 17], "handl": 4, "have": [4, 9, 10, 11, 14, 19, 22], "height": [9, 11, 17], "heintz": 14, "henc": 12, "here": [2, 19, 23], "heterogen": 10, "higher": 15, "hip": 15, "hold": [9, 11, 17], "home": 9, "horizont": [11, 17], "how": [1, 6, 7, 16, 18], "http": 19, "human": [2, 3, 6, 7, 10, 11, 16, 18, 19], "i": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23], "id": [8, 12, 14], "ident": 9, "idx": 4, "imag": 16, "immedi": 4, "implement": [2, 3, 23], "import": [2, 3, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], "incentiv": [9, 11], "includ": [5, 19, 20], "index": [4, 9, 11, 17], "indic": [9, 11, 17, 23], "individu": [17, 23], "inf": [6, 7, 15, 18], "influenc": 5, "info": [2, 3, 19], "inform": [1, 5], "ingeni": 5, "inherit": 4, "init_flying_po": [6, 7, 18], "init_target_loc": [6, 7, 18], "initi": [4, 6, 7, 8, 14, 17, 18, 22], "initial_map": 14, "inproceed": 19, "input": 4, "insert": [2, 3, 19], "insid": 22, "instal": [1, 22], "instanc": [2, 4, 12, 19], "instantan": 2, "instanti": 4, "instead": [19, 22], "int": [2, 3, 6, 7, 8, 10, 16, 18], "int32": 13, "int64": 14, "int8": [9, 11, 13, 17], "integ": [2, 3, 9, 11, 12, 14, 17], "interact": [2, 3, 19], "interest": 5, "intermedi": [7, 18], "intern": 2, "introduc": 5, "iql": 23, "isn": 12, "item": [5, 14], "its": 5, "join": 19, "joint": 15, "just": 12, "kallstrom_2019": 5, "keep": [4, 5, 23], "kei": [2, 3, 4], "kept": 5, "knee": 15, "k\u00e4llstr\u00f6m": 14, "land": 11, "languag": 12, "last": [2, 8, 9, 12, 19], "latenc": 10, "lead": 5, "learn": [6, 7, 10, 18, 19, 20], "least": 17, "leav": 5, "left": [8, 9, 14, 17], "length": [2, 3, 6, 7, 8, 12, 18], "less": 12, "level": [8, 15], "librari": [19, 23], "like": [2, 3, 4, 5, 17, 23], "limit": [12, 16], "linear": [4, 23], "linearizereward": [2, 4, 19], "linearreward": 3, "list": [2, 3, 23], "literatur": 5, "live": 12, "local": [8, 12, 16], "locat": [9, 11, 17], "look": 2, "lose": [9, 11], "lost": 9, "lower": [15, 16], "made": [1, 11], "mai": [2, 3], "main": [9, 11, 17], "maintain": 5, "make": [1, 2, 19], "mani": 5, "mannion_2018": 5, "map": 14, "margin": 10, "marl": 5, "mass": 16, "max": 9, "max_connect": 12, "max_cycl": 16, "max_mov": 9, "max_num_ag": [2, 3], "max_prob": 12, "max_work": 12, "maxim": [5, 6, 7, 15, 18], "maximis": 16, "maximum": [8, 12, 14, 16], "mct": 10, "md": 1, "mean": [9, 11, 17], "member": [2, 3, 22], "met": [6, 7, 18], "method": 22, "min_connect": 12, "min_prob": 12, "min_work": 12, "mine": [5, 12], "minim": [6, 7, 10, 16, 18], "minimis": 10, "minimum": [5, 12], "mixtur": [5, 8], "mo": [4, 5], "mo_aec_to_parallel_wrapp": 4, "mo_parallel_to_aec": 22, "mo_parallel_to_aec_wrapp": 4, "moaec": 2, "moaecenv": 22, "mobeach_v0": [8, 22], "mobil": 5, "mobreakthrough_v0": 9, "mocongestion_v0": 10, "moconnect4_v0": 11, "mode": [2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "model": [2, 19], "modifi": 9, "modul": 4, "mogem_mining_v0": 12, "moingenious_v0": 13, "moitem_gathering_v0": 14, "momaland": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23], "momappo": 23, "momarl": 19, "momultiwalker_st": [2, 3, 19], "momultiwalker_stability_v0": [2, 3, 15, 19], "monetari": 10, "moparallel": 3, "moparallelenv": 22, "mopistonball_v0": 16, "more": [1, 2, 3, 5, 11, 19], "morl": 23, "mosame_game_v0": 17, "most": 12, "move": [4, 6, 7, 8, 9, 11, 16, 17, 18], "move_count": [9, 11], "multi": [2, 3, 5, 9, 10, 11, 14, 17, 19, 23], "multibinari": 9, "multipl": 5, "multipli": 12, "multiwalk": [5, 15], "must": [2, 3, 4, 5], "n": [2, 5, 17], "n_piston": 16, "n_reward": 5, "name": [2, 3, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 23], "nativ": 4, "navig": 5, "ndarrai": [2, 3], "need": [10, 16], "neg": [9, 12, 14], "negat": [9, 11], "network": [2, 5, 10], "nevertheless": 23, "new": [4, 22], "newli": 4, "next": [2, 16], "non": 12, "none": [2, 3, 6, 7, 8, 10, 14, 16, 18, 19], "normal": 4, "normalizereward": 4, "note": [2, 12], "notebook": [2, 19], "np": [2, 3, 4, 19], "nparrai": [6, 7, 18], "nth": 2, "num_ag": [2, 3, 8, 10, 12, 17], "num_color": 17, "num_dron": [6, 7, 18], "num_intermediate_point": [7, 18], "num_object": [9, 12, 17], "num_timestep": [8, 10, 12, 14], "number": [5, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "numpi": [2, 3, 19], "ob": 5, "object": [2, 3, 4, 5, 8, 9, 10, 11, 12, 14, 15, 17, 19, 23], "observ": [2, 3, 13, 19, 23], "observation_spac": [2, 3], "obstyp": [2, 3], "occup": [5, 8], "occupi": 8, "occur": 14, "od": 10, "off": 5, "ol": 23, "onc": 1, "one": [5, 6, 7, 9, 11, 15, 17, 18], "onli": [5, 8, 9, 17], "open": 19, "oppon": [5, 9, 11], "oppos": [2, 3], "option": [2, 3, 5, 6, 7, 9, 10, 11, 16, 18, 19], "order": [6, 7, 17, 18], "org": 19, "origin": [5, 10, 16], "other": [2, 5, 6, 7, 9, 11, 18, 23], "our": [1, 22], "output": 4, "outsid": [2, 3], "over": [5, 8], "overrid": [2, 3, 4, 22], "own": [4, 5, 9, 17], "packag": [1, 5, 15, 22], "pair": 10, "paper": [5, 23], "paradox": 10, "parallel": [6, 7, 8, 10, 14, 18, 20, 22], "parallel_env": [3, 4, 22], "parallel_to_aec": 22, "parallel_wrapp": 4, "paramet": 4, "past": 4, "penal": 15, "penalti": [15, 16], "per": [2, 3, 5, 12, 19], "percentag": [8, 10], "perform": 23, "pettingzoo": [2, 3, 5, 15, 19, 22], "philosophi": 23, "piec": [5, 9], "pip": [1, 19, 20, 22], "piston": 16, "piston_i": 16, "pistonbal": 5, "place": [8, 9, 10, 11, 16], "placement": [9, 11], "plai": [2, 3, 17], "plane": [9, 11, 17], "player": [5, 9, 11, 17], "player_i": [9, 11], "pleas": [12, 19], "point": [5, 7, 9, 10, 11, 17, 18], "polici": [2, 3, 4, 19, 23], "posit": [6, 7, 8, 14, 18], "position_distribut": 8, "possibl": [5, 10, 23], "possible_ag": [2, 3, 4], "practic": 22, "prefer": 10, "previou": 12, "print": [2, 3, 4], "probabl": 12, "problem": [5, 8, 14], "problem_nam": 10, "problemat": [19, 20], "producit": 5, "program": 12, "progress": [2, 3, 5], "proport": 16, "provid": [5, 12, 19, 23], "public": 19, "pure": 22, "puzzl": [5, 17], "py": 22, "pypi": 21, "pytest": 22, "python": [2, 19], "pz": [2, 3, 4, 5, 19, 22], "qmix": 3, "r": 1, "rack": 13, "rais": 16, "random": [8, 10, 16], "random_drop": 16, "random_toll_percentag": 10, "randomis": 14, "randomli": [8, 10, 12, 17], "rang": 4, "rather": 2, "raw_env": 22, "reach": [5, 6, 7, 8, 9, 11, 14, 16, 18], "reachabl": 12, "realli": 12, "rebuild": 1, "receiv": [3, 8, 12], "recent": 4, "record": 4, "recordepisodestatist": 4, "rectangular": 17, "reflect": 5, "reimplement": 12, "reinforc": 19, "releas": 2, "remov": [2, 3, 5, 17], "render": [2, 3, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "render_mod": [2, 3, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19], "repositori": [19, 22], "repres": [4, 5, 6, 7, 9, 10, 11, 12, 14, 15, 17, 18], "requir": [1, 19, 20], "research": 19, "reset": [2, 3, 19], "resourc": 2, "respect": [3, 5, 11], "restrict": 5, "return": [2, 3, 5, 19, 22], "reus": 23, "reward": [2, 3, 4, 5, 13, 19, 22, 23], "reward_schem": 8, "reward_spac": [2, 3, 4, 22], "rgb": 16, "rgb_arrai": [2, 3, 11, 16], "right": [5, 8, 9, 14, 16, 17], "rl": [2, 9, 11, 17, 19], "road": [5, 10], "roijer": 12, "rout": [5, 10], "routechoic": 5, "row": 9, "rule": 5, "run": [14, 22], "same": [2, 3, 5, 9, 11, 12, 17], "samegam": 5, "sampl": [2, 3, 19], "scalar": [2, 3, 4, 16, 19, 23], "scale": [4, 11], "scheme": 8, "score": [5, 13], "screen": 11, "screen_scal": 11, "second": [2, 9, 11, 15], "section": [5, 8, 9, 11, 17], "section_capac": 8, "section_consumpt": 8, "section_id": 8, "see": [2, 3, 12, 15, 19, 20, 22], "seed": [2, 3, 12, 19], "seek": 5, "select": [2, 10, 12, 17], "self": [2, 5, 22, 23], "send": 12, "separ": [5, 17], "sequenc": 11, "sequenti": 17, "server": 19, "set": [9, 11, 12, 16, 17, 19, 23], "shake": 15, "shape": [4, 6, 7, 10, 18, 22], "share": [5, 17], "shift": [2, 19], "should": [2, 6, 7, 11, 12, 18, 22], "side": [6, 7, 18], "similar": [2, 4, 19], "simpl": [2, 19], "singl": [2, 5, 9, 11, 15, 17, 19, 23], "sister": 15, "size": [6, 7, 17, 18], "so": [3, 4, 10], "some": [4, 19, 20], "sourc": [2, 3, 4, 6, 7, 10, 18, 19], "space": [2, 3, 4, 5, 13, 22, 23], "specif": [2, 3, 9, 11, 12, 17], "specifi": [2, 16, 22], "speed": [6, 7, 18], "sphinx": 1, "split": 4, "squar": [9, 17], "stabil": [4, 5], "stai": [8, 14, 16], "stand": 11, "standard": [2, 3, 19], "start": 2, "state": [2, 3], "stateless": [10, 12], "static": 18, "statist": 4, "steadi": 5, "step": [2, 3, 6, 7, 15, 18, 19], "still": [15, 16], "store": 2, "str": [2, 3, 6, 7, 8, 10, 18], "straight": 9, "string": [2, 3], "structur": 2, "subprocess": 2, "successfulli": 11, "sum": [5, 8], "suppli": [19, 20], "support": [2, 3, 5], "surround": [5, 6], "surround_v0": 18, "survei": [2, 19], "switch": 2, "system": [19, 20], "t": [4, 12, 22], "take": [2, 3, 4, 10, 16], "taken": [2, 5], "target": [5, 6, 7, 18], "target_spe": [6, 7, 18], "tax": 10, "team": [5, 17, 23], "team_reward": 17, "tensor": [9, 11, 17], "term": 5, "termin": [2, 3, 19], "terminate_on_fal": 15, "test": [19, 20, 22], "than": 11, "thei": [5, 9, 11, 17], "them": [2, 4, 5, 9, 11, 16, 19], "themselv": 5, "therefor": [10, 12], "thi": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 16, 17, 18, 19, 20, 22], "third": 9, "thomasini_2023": 5, "three": 9, "through": [2, 3], "tile": [5, 17], "time": [1, 2, 5, 9, 10, 11, 16], "time_penalti": 16, "timestep": [6, 7, 8, 10, 12, 14, 18], "todo": [5, 19], "togeth": 12, "token": 11, "toll": 10, "toll_mod": 10, "tool": 23, "top": [4, 11], "total": 12, "tourist": 5, "toward": [6, 7, 15, 16, 18], "trade": 5, "train": 3, "trajectori": [4, 7, 18], "transform": 4, "travel": [5, 6, 7, 10, 15, 18], "true": [12, 15, 16, 17], "trunc_prob": 12, "truncat": [2, 3, 19], "try": [6, 9, 21], "tupl": [3, 8, 14], "turn": [5, 9, 11, 17], "two": [5, 8, 9, 10, 11, 16, 17], "txt": 1, "type": [2, 3, 4, 5, 8, 12, 14], "type_distribut": 8, "typic": [2, 3], "uint8": 16, "unchang": 16, "under": 10, "uniform": 8, "until": 11, "up": [5, 9, 14, 16], "upper": 12, "us": [2, 4, 8, 10, 11, 12, 16, 19, 20, 22], "usag": [19, 20], "user": 5, "usual": [9, 11, 17], "util": [2, 3, 5, 19, 22, 23], "v0": 5, "valid": 4, "valu": [2, 3, 4, 6, 7, 8, 11, 12, 15, 16, 18, 19], "variabl": 2, "varianc": 4, "variant": [5, 9, 11, 17], "vec_reward": [2, 3, 19], "vector": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 22], "vectori": 4, "veri": [2, 19], "version": [5, 10, 21, 22], "vertic": [11, 17], "via": 8, "view": 3, "villag": 12, "w": 12, "w_bonu": 12, "wa": [4, 6, 7, 10, 12, 16, 18], "wai": [5, 17], "walker_0": [2, 19], "walker_1": [2, 19], "walker_2": [2, 19], "walker_i": 15, "wall": 16, "we": [2, 5, 19, 23], "websit": 19, "weight": [2, 3, 4, 19], "well": [5, 19, 22], "were": 17, "when": [6, 7, 11, 16, 17, 18], "where": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19], "whether": [9, 11, 14, 16, 17], "which": [2, 3, 5, 6, 7, 9, 11, 12, 15, 17, 18, 19, 22], "while": [3, 5, 6, 7, 10, 18], "whose": [4, 9, 11, 17], "width": [9, 11, 17], "win": [5, 9, 11], "window": [2, 3, 16], "without": [5, 11], "work": [19, 22], "worker": 12, "workflow": 21, "world": 5, "would": [2, 3, 19], "wrapper": [5, 23], "x": [9, 11, 17], "x_0": [6, 7, 18], "x_1": [6, 7, 18], "x_n": [6, 7, 18], "x_targ": [6, 7, 18], "xyz": [6, 7, 18], "y": [9, 17], "y_0": [6, 7, 18], "y_1": [6, 7, 18], "y_n": [6, 7, 18], "y_targ": [6, 7, 18], "you": [2, 3, 19, 22], "your": [1, 2, 3, 19, 22], "z": 9, "z_0": [6, 7, 18], "z_1": [6, 7, 18], "z_n": [6, 7, 18], "z_targ": [6, 7, 18], "zero": [5, 9, 11, 17]}, "titles": ["404 - Page Not Found", "MOMAland documentation", "AEC", "Parallel", "Wrappers and Utils", "All Environments", "Catch", "Escort", "MO-Beach", "MO-Breakthrough", "MO-Congestion", "MO-Connect4", "MO-GemMining", "MO-Ingenious", "MO-ItemGathering", "MO-MultiwalkerStability", "MO-Pistonball", "MO-SameGame", "Surround", "API", "Install", "Release Notes", "Making a Custom Environment", "Learning algorithms"], "titleterms": {"0": 21, "1": 21, "2": 21, "404": 0, "Not": 0, "The": 0, "action": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "aec": [2, 4], "algorithm": 23, "all": 5, "api": 19, "argument": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "attribut": [2, 3], "beach": 8, "breakthrough": 9, "build": 1, "catch": 6, "cite": 19, "commit": 22, "congest": 10, "connect4": 11, "convers": 4, "could": 0, "credit": [6, 7, 10, 12, 16, 18], "custom": 22, "document": 1, "dummi": 21, "environ": [5, 22], "episod": [6, 7, 8, 10, 12, 14, 15, 16, 18], "escort": 7, "found": 0, "gemmin": 12, "histori": [9, 11, 17], "hook": 22, "ingeni": 13, "instal": [19, 20], "itemgath": 14, "learn": 23, "legal": [9, 11, 17], "make": 22, "mask": [9, 11, 17], "method": [2, 3], "mo": [8, 9, 10, 11, 12, 13, 14, 15, 16, 17], "moaecenv": 2, "momaland": 1, "moparallelenv": 3, "multiwalkerst": 15, "note": 21, "observ": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "page": 0, "parallel": [3, 4], "pistonbal": 16, "pre": 22, "releas": 21, "request": 0, "reward": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "samegam": 17, "setup": 22, "space": [6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18], "start": [6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "state": [6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18], "surround": 18, "termin": [6, 7, 8, 10, 12, 14, 15, 16, 18], "truncat": [6, 7, 8, 10, 12, 14, 16, 18], "usag": [2, 3], "util": 4, "v0": 21, "version": [9, 11, 17], "wrapper": 4}}) \ No newline at end of file