diff --git a/main/.buildinfo b/main/.buildinfo
index e841302f..91db9f28 100644
--- a/main/.buildinfo
+++ b/main/.buildinfo
@@ -1,4 +1,4 @@
# Sphinx build info version 1
# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
-config: c36cab8f46d25d045284105903a971ff
+config: 2438baf5055f5ae6b23aa6ccd3bcb8b8
tags: d77d1c0d9ca2f4c8421862c7c5a0d620
diff --git a/main/_images/deep-sea-treasure-mirrored.gif b/main/_images/deep-sea-treasure-mirrored.gif
new file mode 100644
index 00000000..5de3b3d2
Binary files /dev/null and b/main/_images/deep-sea-treasure-mirrored.gif differ
diff --git a/main/_static/videos/deep-sea-treasure-mirrored.gif b/main/_static/videos/deep-sea-treasure-mirrored.gif
new file mode 100644
index 00000000..5de3b3d2
Binary files /dev/null and b/main/_static/videos/deep-sea-treasure-mirrored.gif differ
diff --git a/main/environments/all-environments/index.html b/main/environments/all-environments/index.html
index 211cdb0e..1cee3bfc 100644
--- a/main/environments/all-environments/index.html
+++ b/main/environments/all-environments/index.html
@@ -336,77 +336,82 @@
Available environmentsAgent is a submarine that must collect a treasure while taking into account a time penalty. Treasures values taken from Vamplew et al. 2010 .
-resource-gathering-v0
+deep-sea-treasure-mirrored-v0
+Discrete / Discrete
+[treasure, time_penalty]
+Harder version of the concave DST Felten et al. 2022 .
+
+resource-gathering-v0
Discrete / Discrete
[enemy, gold, gem]
Agent must collect gold or gem. Enemies have a 10% chance of killing the agent. From Barret & Narayanan 2008 .
-fishwood-v0
+fishwood-v0
Discrete / Discrete
[fish_amount, wood_amount]
ESR environment, the agent must collect fish and wood to light a fire and eat. From Roijers et al. 2018 .
-breakable-bottles-v0
+breakable-bottles-v0
Discrete (Dictionary) / Discrete
[time_penalty, bottles_delivered, potential]
Gridworld with 5 cells. The agents must collect bottles from the source location and deliver to the destination. From Vamplew et al. 2021 .
-fruit-tree-v0
+fruit-tree-v0
Discrete / Discrete
[nutri1, ..., nutri6]
Full binary tree of depth d=5,6 or 7. Every leaf contains a fruit with a value for the nutrients Protein, Carbs, Fats, Vitamins, Minerals and Water. From Yang et al. 2019 .
-water-reservoir-v0
+water-reservoir-v0
Continuous / Continuous
[cost_flooding, deficit_water]
A Water reservoir environment. The agent executes a continuous action, corresponding to the amount of water released by the dam. From Pianosi et al. 2013 .
-four-room-v0
+four-room-v0
Discrete / Discrete
[item1, item2, item3]
Agent must collect three different types of items in the map and reach the goal. From Alegre et al. 2022 .
-mo-mountaincar-v0
+mo-mountaincar-v0
Continuous / Discrete
[time_penalty, reverse_penalty, forward_penalty]
Classic Mountain Car env, but with extra penalties for the forward and reverse actions. From Vamplew et al. 2011 .
-mo-mountaincarcontinuous-v0
+mo-mountaincarcontinuous-v0
Continuous / Continuous
[time_penalty, fuel_consumption_penalty]
Continuous Mountain Car env, but with penalties for fuel consumption.
-mo-lunar-lander-v2
+mo-lunar-lander-v2
Continuous / Discrete or Continuous
[landed, shaped_reward, main_engine_fuel, side_engine_fuel]
MO version of the LunarLander-v2
environment . Objectives defined similarly as in Hung et al. 2022 .
-minecart-v0
+minecart-v0
Continuous or Image / Discrete
[ore1, ore2, fuel]
Agent must collect two types of ores and minimize fuel consumption. From Abels et al. 2019 .
-mo-highway-v0
and mo-highway-fast-v0
+mo-highway-v0
and mo-highway-fast-v0
Continuous / Discrete
[speed, right_lane, collision]
The agent’s objective is to reach a high speed while avoiding collisions with neighbouring vehicles and staying on the rightest lane. From highway-env .
-mo-supermario-v0
+mo-supermario-v0
Image / Discrete
[x_pos, time, death, coin, enemy]
[:warning: SuperMarioBrosEnv support is limited.] Multi-objective version of SuperMarioBrosEnv . Objectives are defined similarly as in Yang et al. 2019 .
-mo-reacher-v4
+mo-reacher-v4
Continuous / Discrete
[target_1, target_2, target_3, target_4]
Mujoco version of mo-reacher-v0
, based on Reacher-v4
environment .
-mo-hopper-v4
+mo-hopper-v4
Continuous / Continuous
[velocity, height, energy]
Multi-objective version of Hopper-v4 env.
-mo-halfcheetah-v4
+mo-halfcheetah-v4
Continuous / Continuous
[velocity, energy]
Multi-objective version of HalfCheetah-v4 env. Similar to Xu et al. 2020 .
diff --git a/main/environments/deep-sea-treasure-mirrored/index.html b/main/environments/deep-sea-treasure-mirrored/index.html
index 09cbe437..66db7835 100644
--- a/main/environments/deep-sea-treasure-mirrored/index.html
+++ b/main/environments/deep-sea-treasure-mirrored/index.html
@@ -307,7 +307,7 @@
Deep-Sea-Treasure-Mirrored
-
+
diff --git a/main/searchindex.js b/main/searchindex.js
index 843f35ab..b129854e 100644
--- a/main/searchindex.js
+++ b/main/searchindex.js
@@ -1 +1 @@
-Search.setIndex({"docnames": ["404", "README", "citing/citing", "community/community", "environments/all-environments", "environments/breakable-bottles", "environments/deep-sea-treasure", "environments/deep-sea-treasure-concave", "environments/deep-sea-treasure-mirrored", "environments/fishwood", "environments/four-room", "environments/fruit-tree", "environments/minecart", "environments/minecart-deterministic", "environments/mo-halfcheetah", "environments/mo-highway", "environments/mo-highway-fast", "environments/mo-hopper", "environments/mo-hopper-2d", "environments/mo-lunar-lander", "environments/mo-lunar-lander-continuous", "environments/mo-mountaincar", "environments/mo-mountaincarcontinuous", "environments/mo-reacher", "environments/mo-supermario", "environments/resource-gathering", "environments/water-reservoir", "examples/citation", "examples/morl_baselines", "examples/publications", "index", "introduction/api", "introduction/install", "release_notes", "tutorials/custom_env", "wrappers/wrappers"], "filenames": ["404.md", "README.md", "citing/citing.md", "community/community.md", "environments/all-environments.md", "environments/breakable-bottles.md", "environments/deep-sea-treasure.md", "environments/deep-sea-treasure-concave.md", "environments/deep-sea-treasure-mirrored.md", "environments/fishwood.md", "environments/four-room.md", "environments/fruit-tree.md", "environments/minecart.md", "environments/minecart-deterministic.md", "environments/mo-halfcheetah.md", "environments/mo-highway.md", "environments/mo-highway-fast.md", "environments/mo-hopper.md", "environments/mo-hopper-2d.md", "environments/mo-lunar-lander.md", "environments/mo-lunar-lander-continuous.md", "environments/mo-mountaincar.md", "environments/mo-mountaincarcontinuous.md", "environments/mo-reacher.md", "environments/mo-supermario.md", "environments/resource-gathering.md", "environments/water-reservoir.md", "examples/citation.md", "examples/morl_baselines.md", "examples/publications.md", "index.md", "introduction/api.md", "introduction/install.md", "release_notes.md", "tutorials/custom_env.md", "wrappers/wrappers.md"], "titles": ["404 - Page Not Found", "MO-Gymnasium documentation", "<no title>", "Community", "Available environments", "Breakable-Bottles", "Deep-Sea-Treasure", "Deep-Sea-Treasure-Concave", "Deep-Sea-Treasure-Mirrored", "Fishwood", "Four-Room", "Fruit-Tree", "Minecart", "Minecart-Deterministic", "MO-Halfcheetah", "MO-Highway", "MO-Highway-Fast", "MO-Hopper", "MO-Hopper-2D", "MO-Lunar-Lander", "MO-Lunar-Lander-Continuous", "MO-Mountaincar", "MO-Mountaincarcontinuous", "MO-Reacher", "MO-Supermario", "Resource-Gathering", "Water-Reservoir", "Citation", "MORL Baselines", "List of Publications", "API", "API", "Install", "Release Notes", "Creating a custom environment", "Wrappers"], "terms": {"thi": [1, 5, 27, 30, 31, 32, 33, 34, 35], "folder": 1, "contain": [1, 4, 5, 10, 11, 12, 13, 21, 22, 23, 28], "For": [1, 23, 30, 31, 33, 35], "more": [1, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 24, 30, 31], "inform": [1, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 24], "about": 1, "how": [1, 12, 13, 24, 28], "contribut": [1, 3, 5, 33], "go": [1, 9, 11, 17, 18, 21], "our": [1, 33, 34], "md": [1, 33], "instal": 1, "requir": 1, "packag": [1, 33], "your": [1, 27, 30], "fork": 1, "pip": [1, 30, 32], "r": [1, 35], "doc": [1, 33], "txt": 1, "e": [1, 5, 6, 7, 8, 22, 27, 30, 33], "To": [1, 6, 7, 8, 30, 32], "onc": 1, "cd": 1, "make": [1, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31, 33, 34, 35], "dirhtml": 1, "rebuild": 1, "automat": [1, 33, 34], "everi": [1, 4, 11], "time": [1, 4, 5, 6, 7, 8, 12, 13, 21, 22, 24, 35], "chang": [1, 6, 7, 8, 33], "i": [1, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 30, 31, 33, 34, 35], "made": [1, 33], "sphinx": 1, "autobuild": 1, "b": 1, "_build": 1, "If": [3, 12, 13, 17, 18, 26, 27, 30, 34], "you": [3, 27, 30, 31, 32, 33, 34], "want": 3, "help": 3, "u": 3, "out": [3, 26], "reach": [3, 4, 6, 7, 8, 10, 11, 21, 22, 24, 33], "simpli": 3, "ask": 3, "question": 3, "can": [3, 5, 9, 11, 12, 13, 26, 30, 31, 32, 33], "join": [3, 30], "farama": [3, 12, 13, 30, 33], "discord": [3, 30], "server": [3, 30], "here": [3, 10, 30, 31, 33], "asid": 3, "from": [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 25, 26, 33, 34, 35], "main": [3, 12, 13, 19, 20], "contributor": [3, 33], "some": [3, 30, 32, 34], "peopl": 3, "have": [3, 4, 5, 30, 33], "also": [3, 30, 31, 33], "project": 3, "variou": [3, 28], "wai": [3, 10], "we": [3, 30, 33, 34], "would": 3, "like": [3, 30, 32, 35], "thank": 3, "them": [3, 5, 30, 31], "all": [3, 25, 30, 32, 33, 35], "The": [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 33, 34], "minecart": [3, 4, 30, 31, 33], "v0": [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, 21, 22, 23, 24, 25, 26, 30, 31], "env": [3, 4, 12, 13, 14, 15, 17, 18, 19, 20, 21, 30, 31, 33, 34, 35], "refactor": [3, 12, 13], "http": [3, 5, 6, 7, 8, 12, 13, 25, 26, 30], "github": [3, 12, 13, 33], "com": [3, 12, 13], "axelabel": 3, "dynmorl": 3, "deep": [3, 4, 33], "sea": [3, 4, 33], "treasur": [3, 4, 5, 25, 33], "fruit": [3, 4, 33], "tree": [3, 4, 26, 33], "mo": [3, 4, 12, 13, 27, 28, 29, 30, 31, 32, 34, 35], "supermario": [3, 4], "ar": [3, 4, 5, 12, 13, 25, 26, 33, 34, 35], "base": [3, 4, 5, 12, 13, 23, 26, 33], "runzheyang": 3, "morl": [3, 4, 6, 7, 8, 9, 30, 31, 33], "four": [3, 4, 23], "room": [3, 4], "mike": [3, 10], "gimelfarb": [3, 10], "successor": [3, 10], "featur": [3, 10, 33], "transfer": [3, 10], "fishwood": [3, 4, 33], "code": [3, 6, 7, 8, 9, 10, 12, 13, 26, 33], "wa": [3, 5, 6, 7, 8, 12, 13, 29, 33], "provid": [3, 9, 30, 33], "deni": [3, 9], "steckelmach": [3, 9], "conor": 3, "f": [3, 26], "hay": 3, "water": [3, 4, 11, 33], "reservoir": [3, 4, 33], "mathieu": [3, 26], "reymond": [3, 26], "gymnasium": [4, 12, 13, 14, 17, 18, 19, 20, 21, 28, 29, 30, 31, 32, 34, 35], "includ": [4, 30, 32], "taken": 4, "literatur": [4, 33], "well": [4, 30, 33, 34], "multi": [4, 9, 14, 15, 16, 17, 18, 19, 20, 21, 24, 26, 27, 29, 30, 31, 33, 34], "object": [4, 9, 14, 15, 16, 17, 18, 19, 20, 21, 24, 26, 27, 29, 30, 31, 33, 34], "version": [4, 10, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 33], "classic": [4, 6, 7, 8, 34], "mujoco": [4, 23, 30, 32], "ob": [4, 30, 31, 33], "action": [4, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 33, 34], "space": [4, 16, 33, 34], "descript": 4, "discret": [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, 19, 21, 23, 24, 25, 33], "time_penalti": [4, 5], "agent": [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 25, 26, 33], "submarin": [4, 6, 7, 8], "must": [4, 12, 13, 34], "collect": [4, 9, 10, 11, 12, 13, 24, 25, 33], "while": [4, 5, 10], "take": 4, "account": 4, "penalti": [4, 5, 6, 7, 8, 21, 22, 24], "valu": [4, 6, 7, 8, 11], "yang": [4, 6, 7, 8, 11], "et": [4, 6, 7, 8, 10, 11, 12, 13, 29], "al": [4, 6, 7, 8, 10, 11, 12, 13, 29], "2019": [4, 6, 7, 8, 11, 12, 13], "concav": [4, 33], "vamplew": 4, "2010": 4, "resourc": [4, 33], "gather": [4, 33], "enemi": [4, 5, 24, 25], "gold": [4, 5, 25], "gem": [4, 5, 25], "10": [4, 6, 7, 8, 11, 25, 26], "chanc": 4, "kill": [4, 24, 25], "barret": 4, "narayanan": [4, 25], "2008": [4, 25], "fish_amount": 4, "wood_amount": 4, "esr": [4, 33], "fish": [4, 9], "wood": [4, 9], "light": 4, "fire": 4, "eat": 4, "roijer": 4, "2018": 4, "breakabl": 4, "bottl": 4, "dictionari": [4, 5], "bottles_deliv": [4, 5], "potenti": [4, 5, 33], "gridworld": [4, 10], "5": [4, 5, 11, 12, 13, 15, 16, 19, 20, 24, 25, 33], "cell": 4, "sourc": [4, 6, 7, 8, 10, 12, 13, 22, 30], "locat": [4, 5, 23], "deliv": [4, 5], "destin": 4, "2021": 4, "nutri1": 4, "nutri6": 4, "full": [4, 11, 33], "binari": [4, 10, 11], "depth": [4, 11], "d": [4, 11], "6": [4, 11, 12, 13, 14, 21, 22, 23], "7": [4, 6, 11, 12, 13, 33], "leaf": [4, 11], "nutrient": [4, 11], "protein": [4, 11], "carb": [4, 11], "fat": [4, 11], "vitamin": [4, 11], "miner": [4, 11], "continu": [4, 6, 7, 8, 22, 26, 33], "cost_flood": 4, "deficit_wat": 4, "A": [4, 5, 10, 16, 21, 22, 26, 27, 29, 30, 31, 35], "execut": [4, 26], "correspond": [4, 26], "amount": [4, 26], "releas": [4, 26], "dam": [4, 26], "pianosi": [4, 26], "2013": 4, "item1": 4, "item2": 4, "item3": 4, "three": 4, "differ": 4, "type": [4, 12, 13], "item": [4, 10], "map": [4, 6, 7, 8, 10, 12, 13], "goal": [4, 10, 21, 22], "alegr": [4, 27, 29, 30], "2022": [4, 27, 29, 30, 33], "mountaincar": [4, 22], "reverse_penalti": 4, "forward_penalti": 4, "mountain": [4, 21, 22, 33], "car": [4, 33], "extra": 4, "forward": [4, 14, 17, 18, 21], "revers": [4, 5, 21], "2011": 4, "mountaincarcontinu": [4, 33], "fuel_consumption_penalti": 4, "fuel": [4, 12, 13, 19, 20, 22], "consumpt": [4, 12, 13], "lunar": [4, 33], "lander": [4, 33], "v2": [4, 19, 20, 33], "land": [4, 19, 20], "shaped_reward": 4, "main_engine_fuel": 4, "side_engine_fuel": 4, "lunarland": [4, 19, 20], "defin": [4, 5, 23, 34], "similarli": 4, "hung": 4, "imag": [4, 12, 13, 26], "ore1": 4, "ore2": 4, "two": [4, 9, 12, 13, 23, 24, 26], "ores": [4, 12, 13], "minim": [4, 12, 13], "abel": [4, 12, 13], "highwai": [4, 33], "fast": [4, 33], "speed": [4, 12, 13, 15, 33], "right_lan": 4, "collis": [4, 15], "": [4, 6, 7, 8, 10, 12, 13, 14, 17, 18, 19, 20, 21, 30, 31, 35], "high": [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 33], "avoid": [4, 10], "neighbour": 4, "vehicl": 4, "stai": 4, "rightest": 4, "lane": [4, 15], "x_po": 4, "death": 4, "coin": [4, 24], "warn": [4, 33], "supermariobrosenv": 4, "support": 4, "limit": 4, "reacher": 4, "v4": [4, 14, 17, 18, 23, 33], "target_1": 4, "target_2": 4, "target_3": 4, "target_4": 4, "hopper": [4, 33], "veloc": [4, 23], "height": 4, "energi": 4, "halfcheetah": [4, 33], "similar": 4, "xu": 4, "2020": 4, "3": [5, 6, 7, 8, 10, 12, 13, 15, 16, 17, 18, 19, 20, 21, 23, 24, 25, 26], "dict": 5, "bottles_carri": 5, "2": [5, 6, 7, 8, 9, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31], "bottles_drop": 5, "multibinari": 5, "shape": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 34, 35], "0": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31, 35], "50": [5, 33], "low": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26], "inf": [5, 14, 15, 16, 17, 18, 19, 20, 23, 24, 26], "1": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 35], "import": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31, 33, 34], "mo_gymnasium": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31, 33, 34, 35], "environ": [5, 6, 7, 8, 9, 10, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 29, 30, 31, 32, 35], "implement": [5, 28, 34], "problem": [5, 6, 7, 8, 9, 26, 33], "unbreakablebottl": 5, "breakablebottl": 5, "section": 5, "4": [5, 6, 7, 8, 10, 12, 13, 19, 20, 23, 24, 25, 26], "paper": 5, "multiobject": 5, "reinforc": [5, 9, 10, 27, 29, 30, 31, 33], "learn": [5, 9, 10, 25, 27, 29, 30, 31, 33], "approach": 5, "impact": 5, "ai": 5, "safeti": 5, "move": [5, 24, 25, 33], "left": [5, 6, 7, 8, 10, 11, 12, 13, 25], "right": [5, 6, 7, 8, 10, 11, 12, 13, 15, 25], "pick": 5, "up": [5, 6, 7, 8, 10, 25, 26], "kei": 5, "current": [5, 6, 7, 8, 26], "number": [5, 26], "carri": 5, "ha": [5, 24, 33], "each": [5, 6, 7, 8, 11, 12, 13, 21, 22, 23, 33], "boolean": [5, 35], "flag": [5, 17, 18, 24, 25, 34], "indic": [5, 10, 25, 33], "dimens": [5, 33], "step": [5, 6, 7, 8, 9, 21, 22, 24, 26, 29, 30, 31, 33, 34], "bottle_reward": 5, "multipl": [5, 25, 28, 33, 34, 35], "small": [5, 33], "probabl": [5, 9], "drop": [5, 33], "appli": [5, 23, 35], "ground": 5, "when": [5, 6, 7, 8, 9, 10, 11, 12, 13, 24, 25, 33, 34], "size": [5, 11, 34], "prob_drop": 5, "unbreakable_bottl": 5, "true": [5, 6, 7, 8, 12, 13, 26, 35], "which": [5, 6, 7, 8, 9, 10, 30, 33, 34, 35], "again": 5, "so": 5, "outcom": 5, "otherwis": [5, 9, 24], "cannot": 5, "origin": [5, 30, 31, 33], "robert": 5, "klassert": 5, "home": [5, 25], "asset": [5, 25, 33], "limezu": [5, 25], "itch": [5, 6, 7, 8, 25], "io": [5, 6, 7, 8, 25], "serenevillagerevamp": [5, 25], "ninjikin": [5, 25], "pixel": [5, 6, 7, 8, 33], "art": [5, 6, 7, 8, 33], "creat": [5, 6, 7, 8, 30, 31, 33], "assist": [5, 6, 7, 8], "dall": [5, 6, 7, 8], "11": [6, 7, 17, 18, 33], "23": [6, 33], "control": [6, 7, 8, 9, 14, 17, 18], "2d": [6, 7, 8, 10, 12, 13, 22, 33, 35], "grid": [6, 7, 8], "world": [6, 7, 8], "box": [6, 7, 8, 14, 17, 18, 20, 22, 26], "x": [6, 7, 8, 17, 18, 24, 25], "y": [6, 7, 8, 25], "coordin": [6, 7, 8, 25, 30], "where": [6, 7, 8, 9, 11, 21, 22, 33], "down": [6, 7, 8, 10, 25], "dimension": [6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 23, 24, 25], "posit": [6, 7, 8, 10, 12, 13, 23, 24, 25], "alwai": [6, 7, 8], "same": [6, 7, 8], "dst_map": [6, 7, 8], "default": [6, 7, 8, 12, 13, 26], "convex": [6, 7, 8], "us": [6, 7, 8, 12, 13, 26, 27, 28, 29, 30, 31, 32, 33, 34], "deepseatreasur": [6, 7, 8], "concave_map": [6, 7, 8], "mirrored_map": [6, 7, 8], "float_stat": [6, 7, 8], "adapt": [6, 7, 8, 10], "background": [6, 7, 8, 26], "ansimuz": [6, 7, 8], "underwat": [6, 7, 8], "fantasi": [6, 7, 8], "124": [7, 8], "20": [8, 33], "simpl": [9, 30, 31], "fisherman": 9, "either": 9, "expect": 9, "util": [9, 33], "return": [9, 12, 13, 25, 30, 33, 34, 35], "woodproba": 9, "fishproba": 9, "episod": [9, 26, 35], "end": [9, 12, 13], "after": [9, 33, 35], "max_t": 9, "200": 9, "catch": 9, "14": [10, 19, 20, 33], "13": [10, 33], "introduc": [10, 33], "an": [10, 24, 25, 26, 30, 31, 33, 34, 35], "those": 10, "neg": [10, 22, 23], "travel": 10, "fix": 10, "split": 10, "separ": 10, "wall": 10, "passag": 10, "barreto": 10, "andr\u00e9": 10, "nip": 10, "2017": 10, "plu": 10, "vector": [10, 11, 12, 13, 21, 22, 24, 30, 33, 34, 35], "were": 10, "follow": [10, 12, 13, 30, 31, 33, 34], "compon": [10, 33, 35], "blue": 10, "squar": 10, "els": [10, 24, 25], "green": 10, "triangl": 10, "red": 10, "circl": 10, "lower": 10, "g": [10, 33], "maze": 10, "arrai": [10, 30, 31, 33, 34, 35], "see": [10, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 24, 30, 31, 33, 34], "exampl": [10, 28, 30, 31, 33], "63": 11, "chose": 11, "node": 11, "therefor": 11, "receiv": [11, 33], "root": [11, 33], "cart": [12, 13], "sin": [12, 13, 23], "co": [12, 13, 23], "orient": [12, 13], "porcentag": [12, 13], "capac": [12, 13], "fill": [12, 13], "image_observ": [12, 13], "3d": [12, 13, 21], "mine": [12, 13], "acceler": [12, 13], "brake": [12, 13], "none": [12, 13, 26, 33, 35], "quantiti": [12, 13], "first": [12, 13, 26, 29, 33, 35], "minerium": [12, 13], "retriev": [12, 13], "spars": [12, 13], "second": [12, 13], "consum": [12, 13], "dens": [12, 13], "upper": [12, 13], "corner": [12, 13], "render_mod": [12, 13, 26], "render": [12, 13, 26, 34], "mode": [12, 13, 26], "rgb_arrai": [12, 13, 26], "human": [12, 13, 26], "rgb": [12, 13], "frame_skip": [12, 13], "mani": [12, 13], "repeat": [12, 13], "incremental_frame_skip": [12, 13], "whether": [12, 13, 26], "increment": [12, 13], "config": [12, 13], "path": [12, 13], "json": [12, 13], "configur": [12, 13], "file": [12, 13, 33, 34], "foundat": [12, 13, 33], "blob": [12, 13], "mine_config": [12, 13], "axel": [12, 13], "float32": [14, 17, 18, 20, 22, 26], "observ": [14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 33, 34], "17": 14, "halfcheetahenv": 14, "run": [14, 34, 35], "cost": [14, 17, 18, 19, 20, 26], "highwayenv": 15, "reward": [16, 30, 31, 33, 34, 35], "highwayfastenv": 16, "hopperenv": [17, 18], "axi": [17, 18], "jump": [17, 18], "z": [17, 18], "cost_object": [17, 18], "set": [17, 18, 30, 33, 34], "fals": [17, 18], "ad": [17, 18, 34], "other": [17, 18, 30, 31, 33, 35], "8": [19, 20, 26, 30, 31, 33], "100": [19, 20, 24, 35], "crash": [19, 20], "successfulli": [19, 20], "engin": [19, 20], "side": [19, 20], "07": [21, 22], "top": [21, 22], "norm": 22, "9": 23, "angl": 23, "central": 23, "elbow": 23, "joint": [23, 26], "angular": 23, "possibl": 23, "zero": [23, 33], "torqu": 23, "distanc": [23, 34], "tip": 23, "arm": 23, "target": 23, "comput": [23, 33, 34], "r_i": 23, "finger_tip_coord": 23, "target_i": 23, "256": 24, "240": 24, "255": 24, "25": [24, 33], "supermariobro": 24, "gym": [24, 27, 29, 30, 31, 33], "super": 24, "mario": [24, 33], "bro": 24, "far": 24, "much": 24, "pass": [24, 33], "between": [24, 26, 30, 33, 35], "di": 24, "point": 24, "barrett": 25, "leon": 25, "srini": 25, "optim": [25, 29, 33], "polici": [25, 29], "criteria": 25, "proceed": [25, 27, 30], "25th": 25, "intern": [25, 26], "confer": [25, 26, 27, 30], "machin": 25, "41": [25, 33], "47": [25, 33], "1145": 25, "1390156": 25, "1390162": 25, "consist": 25, "element": [25, 33], "diamond": 25, "castelletti": 26, "m": 26, "restelli": 26, "fit": 26, "q": 26, "iter": [26, 35], "markov": 26, "decis": 26, "2012": 26, "neural": 26, "network": 26, "ijcnn": 26, "brisban": 26, "qld": 26, "australia": 26, "pp": 26, "doi": 26, "1109": 26, "6252759": 26, "float": [26, 34, 35], "level": [26, 33], "normalized_act": 26, "percentag": 26, "There": 26, "due": 26, "excess": 26, "wrt": 26, "flood": 26, "threshold": 26, "upstream": 26, "deficit": 26, "suppli": 26, "demand": 26, "hydroelectr": 26, "downstream": 26, "By": 26, "onli": 26, "initi": [26, 34], "random": 26, "160": 26, "ansi": 26, "time_limit": 26, "maximum": 26, "until": 26, "truncat": [26, 30, 31, 33, 34], "nO": 26, "penal": 26, "select": 26, "bound": [26, 33], "normal": [26, 33, 35], "initial_st": 26, "port": 26, "simon": 26, "parisi": 26, "sky": 26, "paulina": 26, "riva": 26, "opengameart": 26, "org": [26, 30], "content": 26, "repositori": [27, 28, 30, 34], "research": [27, 29, 30], "pleas": [27, 29, 30], "cite": 27, "inproceed": [27, 30], "2022bnaic": [27, 30], "author": [27, 30], "luca": [27, 30], "n": [27, 30], "florian": [27, 30], "felten": [27, 29, 30], "el": [27, 30], "ghazali": [27, 30], "talbi": [27, 30], "gr": [27, 30], "goir": [27, 30], "danoi": [27, 30], "ann": [27, 30], "now": [27, 30, 33], "ana": [27, 30], "l": [27, 30, 35], "c": [27, 30], "bazzan": [27, 30], "bruno": [27, 30], "da": [27, 30], "silva": [27, 30], "titl": [27, 30], "librari": [27, 29, 30, 33], "booktitl": [27, 30], "34th": [27, 30], "benelux": [27, 30], "artifici": [27, 30], "intellig": [27, 30], "bnaic": [27, 29, 30], "benelearn": [27, 30], "year": [27, 30], "algorithm": [28, 30], "under": [28, 33], "api": [28, 33, 34], "It": [28, 30, 31, 33, 34], "wrapper": [28, 30, 31, 33], "formerli": 29, "publish": 29, "submiss": 29, "open": [29, 30], "pull": [29, 34], "request": 29, "add": [29, 33], "miss": [29, 33], "entri": 29, "sampl": 29, "effici": 29, "via": 29, "gener": [29, 33, 34], "improv": 29, "priorit": 29, "aama": 29, "2023": [29, 33], "hyperparamet": 29, "modem": 29, "workshop": 29, "leverag": 29, "approxim": 29, "model": [29, 30, 31], "neurip": 29, "toolkit": 29, "reliabl": 29, "benchmark": 29, "standard": [30, 33], "suit": 30, "python": [30, 31, 33], "develop": 30, "compar": 30, "commun": [30, 33], "compliant": 30, "essenti": [30, 34], "numpi": [30, 31, 33, 34, 35], "document": [30, 34], "websit": [30, 33], "public": 30, "work": 30, "gg": 30, "bnj6kubtg6": 30, "As": [30, 31], "class": [30, 31, 35], "instanc": [30, 31, 33], "interact": [30, 31], "veri": [30, 31, 34], "mo_gym": [30, 31, 33], "np": [30, 31, 33], "info": [30, 31, 33, 34, 35], "reset": [30, 31, 33, 34], "vector_reward": [30, 31, 33], "next_ob": [30, 31, 33], "termin": [30, 31, 33, 34], "your_ag": [30, 31, 33], "act": [30, 31, 33], "option": [30, 31, 33, 34], "scalar": [30, 31, 33, 35], "function": [30, 31, 33], "linearreward": [30, 31, 33], "weight": [30, 31, 33, 35], "detail": [30, 31], "mdp": [30, 31, 33], "momdp": [30, 31, 33], "definit": [30, 31], "practic": [30, 31], "guid": [30, 31], "plan": [30, 31], "check": [30, 31], "colab": [30, 31], "notebook": [30, 31], "doe": [30, 32, 35], "depend": [30, 32], "famili": [30, 32], "problemat": [30, 32], "certain": [30, 32], "system": [30, 32], "one": [30, 32], "08": [33, 35], "24": 33, "pypi": 33, "lucasalegr": 33, "75": 33, "momaxandskipobserv": 33, "76": 33, "modifi": 33, "part": [33, 34], "info_dict": 33, "ianleongudri": 33, "69": 33, "order": 33, "wrap": [33, 35], "morecordepisodestatist": 33, "ffelten": 33, "70": 33, "73": 33, "tuto": 33, "custom": 33, "creation": 33, "72": 33, "test": 33, "worker": 33, "67": 33, "pf": 33, "cc": 33, "determinist": 33, "74": 33, "changelog": 33, "06": 33, "12": 33, "thrill": 33, "design": 33, "expand": 33, "capabl": 33, "rl": 33, "scenario": 33, "need": 33, "mai": 33, "conflict": 33, "repres": 33, "distinct": 33, "In": 33, "context": 33, "trade": 33, "off": 33, "combin": 33, "linearli": 33, "predefin": 33, "shown": 33, "snippet": 33, "ctrl_cost": 33, "self": [33, 34], "control_cost": 33, "forward_reward": 33, "_forward_reward_weight": 33, "x_veloc": 33, "With": 33, "user": 33, "flexibl": 33, "determin": 33, "compromis": 33, "thei": [33, 35], "desir": 33, "prefer": 33, "consequ": 33, "do": 33, "thu": 33, "extend": 33, "vectori": 33, "illustr": 33, "domain": 33, "balanc": 33, "save": 33, "batteri": 33, "mark": 33, "within": [33, 34], "stabl": 33, "achiev": 33, "qualiti": 33, "allow": 33, "fall": 33, "back": 33, "singl": 33, "rang": 33, "inher": 33, "exhaust": 33, "list": 33, "avail": [33, 35], "addition": 33, "tailor": 33, "monormalizereward": 33, "linearwrapp": 33, "transform": 33, "new": 33, "bump": 33, "51": 33, "52": 33, "accept": 33, "55": 33, "updat": 33, "pre": 33, "commit": 33, "spell": 33, "mistak": 33, "56": 33, "method": [33, 34], "57": 33, "59": 33, "60": 33, "61": 33, "65": 33, "caus": 33, "randint": 33, "deprec": 33, "53": 33, "edit": 33, "button": 33, "58": 33, "62": 33, "logo": 33, "64": 33, "03": 33, "pareto_front": [33, 34], "get": 33, "dst": 33, "45": 33, "43": 33, "baselin": 33, "readm": 33, "40": 33, "link": 33, "mgoulao": 33, "42": 33, "enabl": 33, "46": 33, "02": 33, "ezpickl": 33, "34": 33, "37": 33, "convers": 33, "33": 33, "eval_mo": 33, "observationspac": 33, "dtype": 33, "35": 33, "gif": 33, "minor": 33, "directli": [33, 35], "26": 33, "pydoc": 33, "31": 33, "hotfix": 33, "27": 33, "vecenv": 33, "30": 33, "01": 33, "benefit": 33, "structur": 33, "should": [33, 34], "higher": 33, "integr": 33, "tool": 33, "break": 33, "been": 33, "renam": 33, "previous": 33, "call": 33, "reli": 33, "instead": 33, "16": 33, "access": 33, "deep_sea_treasur": 33, "gt": 33, "18": 33, "life": 33, "ci": 33, "setup": 33, "19": 33, "init": 33, "Of": 33, "conduct": 33, "21": 33, "09": 33, "revamp": 33, "lint": 33, "bib": 33, "citat": 33, "few": [33, 35], "bugfix": 33, "tutori": 34, "goe": 34, "through": 34, "sinc": [34, 35], "close": 34, "ti": 34, "refer": 34, "its": 34, "inherit": 34, "__init__": 34, "constructor": 34, "action_spac": 34, "observation_spac": 34, "attribut": 34, "moreov": 34, "deal": 34, "reward_spac": 34, "reward_dim": 34, "integ": 34, "seed": 34, "kwarg": 34, "perform": 34, "next": 34, "gamma": [34, 35], "discount": [34, 35], "pareto": 34, "front": 34, "known": 34, "oject": 34, "metric": 34, "invert": 34, "igd": 34, "registri": 34, "done": 34, "line": 34, "py": 34, "directori": 34, "registr": 34, "id": 34, "my_env_v0": 34, "entry_point": 34, "my_env_dir": 34, "my_env_fil": 34, "myenv": 34, "pytest": 34, "inspir": 35, "modul": 35, "ndarrai": 35, "dot": 35, "product": 35, "idx": 35, "int": 35, "99": 35, "epsilon": 35, "1e": 35, "index": 35, "touch": 35, "min_r": 35, "max_r": 35, "clip": 35, "min": 35, "max": 35, "env_fn": 35, "callabl": 35, "copi": 35, "bool": 35, "serial": 35, "deque_s": 35, "keep": 35, "track": 35, "cumul": 35, "length": 35, "complet": 35, "look": 35, "dr": 35, "contrari": 35, "t": 35, "elaps": 35, "begin": 35, "output": 35, "form": 35, "care": 35, "befor": 35, "morewordstatist": 35, "final_observ": 35, "num": 35, "_final_observ": 35, "final_info": 35, "_final_info": 35, "num_env": 35, "dim_reward": 35, "_episod": 35}, "objects": {"mo_gymnasium": [[35, 0, 1, "", "LinearReward"], [35, 0, 1, "", "MOClipReward"], [35, 0, 1, "", "MONormalizeReward"], [35, 0, 1, "", "MORecordEpisodeStatistics"], [35, 0, 1, "", "MOSyncVectorEnv"]]}, "objtypes": {"0": "py:class"}, "objnames": {"0": ["py", "class", "Python class"]}, "titleterms": {"404": 0, "page": [0, 33], "Not": 0, "found": 0, "The": 0, "request": 0, "could": 0, "mo": [1, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 33], "gymnasium": [1, 33], "document": [1, 33], "build": 1, "commun": 3, "acknowledg": 3, "avail": 4, "environ": [4, 33, 34], "breakabl": [5, 33], "bottl": [5, 33], "descript": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 23, 24, 25, 26], "action": [5, 6, 7, 8, 9, 10, 11, 12, 13, 23, 25, 26], "space": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26], "observ": [5, 6, 7, 8, 9, 10, 11, 12, 13, 23, 25, 26], "reward": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26], "start": [5, 6, 7, 8, 9, 10, 11, 12, 13, 25, 26], "state": [5, 6, 7, 8, 9, 10, 11, 12, 13, 25, 26], "episod": [5, 6, 7, 8, 10, 11, 12, 13, 24, 25], "termin": [5, 6, 7, 8, 9, 10, 11, 12, 13, 24, 25], "argument": [5, 6, 7, 8, 9, 10, 12, 13, 26], "credit": [5, 6, 7, 8, 9, 10, 12, 13, 25, 26], "deep": [6, 7, 8], "sea": [6, 7, 8], "treasur": [6, 7, 8], "concav": 7, "mirror": 8, "fishwood": 9, "four": 10, "room": 10, "refer": 10, "fruit": 11, "tree": 11, "minecart": [12, 13], "determinist": 13, "halfcheetah": 14, "highwai": [15, 16], "fast": 16, "hopper": [17, 18], "2d": 18, "lunar": [19, 20], "lander": [19, 20], "continu": 20, "mountaincar": 21, "mountaincarcontinu": 22, "reacher": [23, 33], "supermario": 24, "resourc": 25, "gather": 25, "water": 26, "reservoir": 26, "citat": 27, "morl": 28, "baselin": 28, "list": 29, "public": 29, "api": [30, 31], "instal": [30, 32], "cite": 30, "releas": 33, "note": 33, "v1": 33, "0": 33, "1": [33, 34], "support": 33, "29": 33, "pygam": 33, "render": 33, "more": 33, "becom": 33, "matur": 33, "v0": 33, "3": [33, 34], "4": [33, 34], "known": 33, "pareto": 33, "front": 33, "improv": 33, "polici": 33, "evalu": 33, "bug": 33, "fix": 33, "better": 33, "2": [33, 34], "webpag": 33, "mujoco": 33, "migrat": 33, "creat": 34, "custom": 34, "new": 34, "class": 34, "regist": 34, "test": 34, "instanti": 34, "your": 34, "wrapper": 35, "linearreward": 35, "monormalizereward": 35, "moclipreward": 35, "mosyncvectorenv": 35, "morecordepisodestatist": 35}, "envversion": {"sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx": 60}, "alltitles": {"404 - Page Not Found": [[0, "page-not-found"]], "The requested page could not be found.": [[0, "the-requested-page-could-not-be-found"]], "MO-Gymnasium documentation": [[1, "mo-gymnasium-documentation"]], "Build the Documentation": [[1, "build-the-documentation"]], "Community": [[3, "community"]], "Acknowledgements": [[3, "acknowledgements"]], "Available environments": [[4, "available-environments"]], "Breakable-Bottles": [[5, "breakable-bottles"]], "Description": [[5, "description"], [6, "description"], [7, "description"], [8, "description"], [9, "description"], [10, "description"], [11, "description"], [12, "description"], [13, "description"], [14, "description"], [15, "description"], [17, "description"], [18, "description"], [19, "description"], [20, "description"], [23, "description"], [24, "description"], [25, "description"], [26, "description"]], "Action Space": [[5, "action-space"], [6, "action-space"], [7, "action-space"], [8, "action-space"], [9, "action-space"], [10, "action-space"], [11, "action-space"], [12, "action-space"], [13, "action-space"], [23, "action-space"], [25, "action-space"], [26, "action-space"]], "Observation Space": [[5, "observation-space"], [6, "observation-space"], [7, "observation-space"], [8, "observation-space"], [9, "observation-space"], [10, "observation-space"], [11, "observation-space"], [12, "observation-space"], [13, "observation-space"], [23, "observation-space"], [25, "observation-space"], [26, "observation-space"]], "Reward Space": [[5, "reward-space"], [6, "reward-space"], [7, "reward-space"], [8, "reward-space"], [9, "reward-space"], [10, "reward-space"], [11, "reward-space"], [12, "reward-space"], [13, "reward-space"], [14, "reward-space"], [15, "reward-space"], [17, "reward-space"], [18, "reward-space"], [19, "reward-space"], [20, "reward-space"], [23, "reward-space"], [24, "reward-space"], [25, "reward-space"], [26, "reward-space"]], "Starting State": [[5, "starting-state"], [6, "starting-state"], [7, "starting-state"], [8, "starting-state"], [9, "starting-state"], [10, "starting-state"], [11, "starting-state"], [12, "starting-state"], [13, "starting-state"], [25, "starting-state"], [26, "starting-state"]], "Episode Termination": [[5, "episode-termination"], [6, "episode-termination"], [7, "episode-termination"], [8, "episode-termination"], [10, "episode-termination"], [11, "episode-termination"], [12, "episode-termination"], [13, "episode-termination"], [24, "episode-termination"], [25, "episode-termination"]], "Arguments": [[5, "arguments"], [6, "arguments"], [7, "arguments"], [8, "arguments"], [9, "arguments"], [10, "arguments"], [12, "arguments"], [13, "arguments"], [26, "arguments"]], "Credits": [[5, "credits"], [6, "credits"], [7, "credits"], [8, "credits"], [9, "credits"], [10, "credits"], [12, "credits"], [13, "credits"], [25, "credits"], [26, "credits"]], "Deep-Sea-Treasure": [[6, "deep-sea-treasure"]], "Deep-Sea-Treasure-Concave": [[7, "deep-sea-treasure-concave"]], "Deep-Sea-Treasure-Mirrored": [[8, "deep-sea-treasure-mirrored"]], "Fishwood": [[9, "fishwood"]], "Termination": [[9, "termination"]], "Four-Room": [[10, "four-room"]], "References": [[10, "references"]], "Fruit-Tree": [[11, "fruit-tree"]], "Minecart": [[12, "minecart"]], "Minecart-Deterministic": [[13, "minecart-deterministic"]], "MO-Halfcheetah": [[14, "mo-halfcheetah"]], "MO-Highway": [[15, "mo-highway"]], "MO-Highway-Fast": [[16, "mo-highway-fast"]], "MO-Hopper": [[17, "mo-hopper"]], "MO-Hopper-2D": [[18, "mo-hopper-2d"]], "MO-Lunar-Lander": [[19, "mo-lunar-lander"]], "MO-Lunar-Lander-Continuous": [[20, "mo-lunar-lander-continuous"]], "MO-Mountaincar": [[21, "mo-mountaincar"]], "Reward space:": [[21, "reward-space"], [22, "reward-space"]], "MO-Mountaincarcontinuous": [[22, "mo-mountaincarcontinuous"]], "MO-Reacher": [[23, "mo-reacher"]], "MO-Supermario": [[24, "mo-supermario"]], "Resource-Gathering": [[25, "resource-gathering"]], "Water-Reservoir": [[26, "water-reservoir"]], "Citation": [[27, "citation"]], "MORL Baselines": [[28, "morl-baselines"]], "List of Publications": [[29, "list-of-publications"]], "API": [[30, "api"], [31, "api"]], "Install": [[30, "install"], [32, "install"]], "Citing": [[30, "citing"]], "Release Notes": [[33, "release-notes"]], "v1.0.1: MO-Gymnasium 1.0.1 Release: Support Gymnasium 0.29, breakable-bottles pygame render, and more": [[33, "release-v1-0-1"]], "v1.0.0: MO-Gymnasium becomes mature": [[33, "release-v1-0-0"]], "v0.3.4: MO-Gymnasium 0.3.4 Release: Known Pareto Front, improved renders and documentation": [[33, "release-v0-3-4"]], "v0.3.3: MO-Gymnasium 0.3.3 Release: Policy Evaluation bug fix, better documentation page": [[33, "release-v0-3-3"]], "MO-Gymnasium 0.3.2 Release: Bug fixes, improved webpage": [[33, "release-0-3-2"]], "MO-Gymnasium 0.3.1 Release: Improved documentation and MuJoco MO-Reacher environment": [[33, "release-0-3-1"]], "MO-Gymnasium 0.3.0 Release: Migrate to Gymnasium": [[33, "release-0-3-0"]], "0.2.1": [[33, "release-0-2-1"]], "0.2.0": [[33, "release-0-2-0"]], "0.1.2": [[33, "release-0-1-2"]], "0.1.1": [[33, "release-0-1-1"]], "Creating a custom environment": [[34, "creating-a-custom-environment"]], "1. Create a new environment class": [[34, "create-a-new-environment-class"]], "2. Register the environment": [[34, "register-the-environment"]], "3. Test the environment": [[34, "test-the-environment"]], "4. Instantiate your environment": [[34, "instantiate-your-environment"]], "Wrappers": [[35, "wrappers"]], "LinearReward": [[35, "linearreward"]], "MONormalizeReward": [[35, "monormalizereward"]], "MOClipReward": [[35, "moclipreward"]], "MOSyncVectorEnv": [[35, "mosyncvectorenv"]], "MORecordEpisodeStatistics": [[35, "morecordepisodestatistics"]]}, "indexentries": {"linearreward (class in mo_gymnasium)": [[35, "mo_gymnasium.LinearReward"]], "moclipreward (class in mo_gymnasium)": [[35, "mo_gymnasium.MOClipReward"]], "monormalizereward (class in mo_gymnasium)": [[35, "mo_gymnasium.MONormalizeReward"]], "morecordepisodestatistics (class in mo_gymnasium)": [[35, "mo_gymnasium.MORecordEpisodeStatistics"]], "mosyncvectorenv (class in mo_gymnasium)": [[35, "mo_gymnasium.MOSyncVectorEnv"]]}})
\ No newline at end of file
+Search.setIndex({"docnames": ["404", "README", "citing/citing", "community/community", "environments/all-environments", "environments/breakable-bottles", "environments/deep-sea-treasure", "environments/deep-sea-treasure-concave", "environments/deep-sea-treasure-mirrored", "environments/fishwood", "environments/four-room", "environments/fruit-tree", "environments/minecart", "environments/minecart-deterministic", "environments/mo-halfcheetah", "environments/mo-highway", "environments/mo-highway-fast", "environments/mo-hopper", "environments/mo-hopper-2d", "environments/mo-lunar-lander", "environments/mo-lunar-lander-continuous", "environments/mo-mountaincar", "environments/mo-mountaincarcontinuous", "environments/mo-reacher", "environments/mo-supermario", "environments/resource-gathering", "environments/water-reservoir", "examples/citation", "examples/morl_baselines", "examples/publications", "index", "introduction/api", "introduction/install", "release_notes", "tutorials/custom_env", "wrappers/wrappers"], "filenames": ["404.md", "README.md", "citing/citing.md", "community/community.md", "environments/all-environments.md", "environments/breakable-bottles.md", "environments/deep-sea-treasure.md", "environments/deep-sea-treasure-concave.md", "environments/deep-sea-treasure-mirrored.md", "environments/fishwood.md", "environments/four-room.md", "environments/fruit-tree.md", "environments/minecart.md", "environments/minecart-deterministic.md", "environments/mo-halfcheetah.md", "environments/mo-highway.md", "environments/mo-highway-fast.md", "environments/mo-hopper.md", "environments/mo-hopper-2d.md", "environments/mo-lunar-lander.md", "environments/mo-lunar-lander-continuous.md", "environments/mo-mountaincar.md", "environments/mo-mountaincarcontinuous.md", "environments/mo-reacher.md", "environments/mo-supermario.md", "environments/resource-gathering.md", "environments/water-reservoir.md", "examples/citation.md", "examples/morl_baselines.md", "examples/publications.md", "index.md", "introduction/api.md", "introduction/install.md", "release_notes.md", "tutorials/custom_env.md", "wrappers/wrappers.md"], "titles": ["404 - Page Not Found", "MO-Gymnasium documentation", "<no title>", "Community", "Available environments", "Breakable-Bottles", "Deep-Sea-Treasure", "Deep-Sea-Treasure-Concave", "Deep-Sea-Treasure-Mirrored", "Fishwood", "Four-Room", "Fruit-Tree", "Minecart", "Minecart-Deterministic", "MO-Halfcheetah", "MO-Highway", "MO-Highway-Fast", "MO-Hopper", "MO-Hopper-2D", "MO-Lunar-Lander", "MO-Lunar-Lander-Continuous", "MO-Mountaincar", "MO-Mountaincarcontinuous", "MO-Reacher", "MO-Supermario", "Resource-Gathering", "Water-Reservoir", "Citation", "MORL Baselines", "List of Publications", "API", "API", "Install", "Release Notes", "Creating a custom environment", "Wrappers"], "terms": {"thi": [1, 5, 27, 30, 31, 32, 33, 34, 35], "folder": 1, "contain": [1, 4, 5, 10, 11, 12, 13, 21, 22, 23, 28], "For": [1, 23, 30, 31, 33, 35], "more": [1, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 24, 30, 31], "inform": [1, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 24], "about": 1, "how": [1, 12, 13, 24, 28], "contribut": [1, 3, 5, 33], "go": [1, 9, 11, 17, 18, 21], "our": [1, 33, 34], "md": [1, 33], "instal": 1, "requir": 1, "packag": [1, 33], "your": [1, 27, 30], "fork": 1, "pip": [1, 30, 32], "r": [1, 35], "doc": [1, 33], "txt": 1, "e": [1, 5, 6, 7, 8, 22, 27, 30, 33], "To": [1, 6, 7, 8, 30, 32], "onc": 1, "cd": 1, "make": [1, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31, 33, 34, 35], "dirhtml": 1, "rebuild": 1, "automat": [1, 33, 34], "everi": [1, 4, 11], "time": [1, 4, 5, 6, 7, 8, 12, 13, 21, 22, 24, 35], "chang": [1, 6, 7, 8, 33], "i": [1, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 30, 31, 33, 34, 35], "made": [1, 33], "sphinx": 1, "autobuild": 1, "b": 1, "_build": 1, "If": [3, 12, 13, 17, 18, 26, 27, 30, 34], "you": [3, 27, 30, 31, 32, 33, 34], "want": 3, "help": 3, "u": 3, "out": [3, 26], "reach": [3, 4, 6, 7, 8, 10, 11, 21, 22, 24, 33], "simpli": 3, "ask": 3, "question": 3, "can": [3, 5, 9, 11, 12, 13, 26, 30, 31, 32, 33], "join": [3, 30], "farama": [3, 12, 13, 30, 33], "discord": [3, 30], "server": [3, 30], "here": [3, 10, 30, 31, 33], "asid": 3, "from": [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 25, 26, 33, 34, 35], "main": [3, 12, 13, 19, 20], "contributor": [3, 33], "some": [3, 30, 32, 34], "peopl": 3, "have": [3, 4, 5, 30, 33], "also": [3, 30, 31, 33], "project": 3, "variou": [3, 28], "wai": [3, 10], "we": [3, 30, 33, 34], "would": 3, "like": [3, 30, 32, 35], "thank": 3, "them": [3, 5, 30, 31], "all": [3, 25, 30, 32, 33, 35], "The": [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 33, 34], "minecart": [3, 4, 30, 31, 33], "v0": [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, 21, 22, 23, 24, 25, 26, 30, 31], "env": [3, 4, 12, 13, 14, 15, 17, 18, 19, 20, 21, 30, 31, 33, 34, 35], "refactor": [3, 12, 13], "http": [3, 5, 6, 7, 8, 12, 13, 25, 26, 30], "github": [3, 12, 13, 33], "com": [3, 12, 13], "axelabel": 3, "dynmorl": 3, "deep": [3, 4, 33], "sea": [3, 4, 33], "treasur": [3, 4, 5, 25, 33], "fruit": [3, 4, 33], "tree": [3, 4, 26, 33], "mo": [3, 4, 12, 13, 27, 28, 29, 30, 31, 32, 34, 35], "supermario": [3, 4], "ar": [3, 4, 5, 12, 13, 25, 26, 33, 34, 35], "base": [3, 4, 5, 12, 13, 23, 26, 33], "runzheyang": 3, "morl": [3, 4, 6, 7, 8, 9, 30, 31, 33], "four": [3, 4, 23], "room": [3, 4], "mike": [3, 10], "gimelfarb": [3, 10], "successor": [3, 10], "featur": [3, 10, 33], "transfer": [3, 10], "fishwood": [3, 4, 33], "code": [3, 6, 7, 8, 9, 10, 12, 13, 26, 33], "wa": [3, 5, 6, 7, 8, 12, 13, 29, 33], "provid": [3, 9, 30, 33], "deni": [3, 9], "steckelmach": [3, 9], "conor": 3, "f": [3, 26], "hay": 3, "water": [3, 4, 11, 33], "reservoir": [3, 4, 33], "mathieu": [3, 26], "reymond": [3, 26], "gymnasium": [4, 12, 13, 14, 17, 18, 19, 20, 21, 28, 29, 30, 31, 32, 34, 35], "includ": [4, 30, 32], "taken": 4, "literatur": [4, 33], "well": [4, 30, 33, 34], "multi": [4, 9, 14, 15, 16, 17, 18, 19, 20, 21, 24, 26, 27, 29, 30, 31, 33, 34], "object": [4, 9, 14, 15, 16, 17, 18, 19, 20, 21, 24, 26, 27, 29, 30, 31, 33, 34], "version": [4, 10, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 33], "classic": [4, 6, 7, 8, 34], "mujoco": [4, 23, 30, 32], "ob": [4, 30, 31, 33], "action": [4, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 33, 34], "space": [4, 16, 33, 34], "descript": 4, "discret": [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, 19, 21, 23, 24, 25, 33], "time_penalti": [4, 5], "agent": [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 25, 26, 33], "submarin": [4, 6, 7, 8], "must": [4, 12, 13, 34], "collect": [4, 9, 10, 11, 12, 13, 24, 25, 33], "while": [4, 5, 10], "take": 4, "account": 4, "penalti": [4, 5, 6, 7, 8, 21, 22, 24], "valu": [4, 6, 7, 8, 11], "yang": [4, 6, 7, 8, 11], "et": [4, 6, 7, 8, 10, 11, 12, 13, 29], "al": [4, 6, 7, 8, 10, 11, 12, 13, 29], "2019": [4, 6, 7, 8, 11, 12, 13], "concav": [4, 33], "vamplew": 4, "2010": 4, "mirror": 4, "harder": 4, "dst": [4, 33], "felten": [4, 27, 29, 30], "2022": [4, 27, 29, 30, 33], "resourc": [4, 33], "gather": [4, 33], "enemi": [4, 5, 24, 25], "gold": [4, 5, 25], "gem": [4, 5, 25], "10": [4, 6, 7, 8, 11, 25, 26], "chanc": 4, "kill": [4, 24, 25], "barret": 4, "narayanan": [4, 25], "2008": [4, 25], "fish_amount": 4, "wood_amount": 4, "esr": [4, 33], "fish": [4, 9], "wood": [4, 9], "light": 4, "fire": 4, "eat": 4, "roijer": 4, "2018": 4, "breakabl": 4, "bottl": 4, "dictionari": [4, 5], "bottles_deliv": [4, 5], "potenti": [4, 5, 33], "gridworld": [4, 10], "5": [4, 5, 11, 12, 13, 15, 16, 19, 20, 24, 25, 33], "cell": 4, "sourc": [4, 6, 7, 8, 10, 12, 13, 22, 30], "locat": [4, 5, 23], "deliv": [4, 5], "destin": 4, "2021": 4, "nutri1": 4, "nutri6": 4, "full": [4, 11, 33], "binari": [4, 10, 11], "depth": [4, 11], "d": [4, 11], "6": [4, 11, 12, 13, 14, 21, 22, 23], "7": [4, 6, 11, 12, 13, 33], "leaf": [4, 11], "nutrient": [4, 11], "protein": [4, 11], "carb": [4, 11], "fat": [4, 11], "vitamin": [4, 11], "miner": [4, 11], "continu": [4, 6, 7, 8, 22, 26, 33], "cost_flood": 4, "deficit_wat": 4, "A": [4, 5, 10, 16, 21, 22, 26, 27, 29, 30, 31, 35], "execut": [4, 26], "correspond": [4, 26], "amount": [4, 26], "releas": [4, 26], "dam": [4, 26], "pianosi": [4, 26], "2013": 4, "item1": 4, "item2": 4, "item3": 4, "three": 4, "differ": 4, "type": [4, 12, 13], "item": [4, 10], "map": [4, 6, 7, 8, 10, 12, 13], "goal": [4, 10, 21, 22], "alegr": [4, 27, 29, 30], "mountaincar": [4, 22], "reverse_penalti": 4, "forward_penalti": 4, "mountain": [4, 21, 22, 33], "car": [4, 33], "extra": 4, "forward": [4, 14, 17, 18, 21], "revers": [4, 5, 21], "2011": 4, "mountaincarcontinu": [4, 33], "fuel_consumption_penalti": 4, "fuel": [4, 12, 13, 19, 20, 22], "consumpt": [4, 12, 13], "lunar": [4, 33], "lander": [4, 33], "v2": [4, 19, 20, 33], "land": [4, 19, 20], "shaped_reward": 4, "main_engine_fuel": 4, "side_engine_fuel": 4, "lunarland": [4, 19, 20], "defin": [4, 5, 23, 34], "similarli": 4, "hung": 4, "imag": [4, 12, 13, 26], "ore1": 4, "ore2": 4, "two": [4, 9, 12, 13, 23, 24, 26], "ores": [4, 12, 13], "minim": [4, 12, 13], "abel": [4, 12, 13], "highwai": [4, 33], "fast": [4, 33], "speed": [4, 12, 13, 15, 33], "right_lan": 4, "collis": [4, 15], "": [4, 6, 7, 8, 10, 12, 13, 14, 17, 18, 19, 20, 21, 30, 31, 35], "high": [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 33], "avoid": [4, 10], "neighbour": 4, "vehicl": 4, "stai": 4, "rightest": 4, "lane": [4, 15], "x_po": 4, "death": 4, "coin": [4, 24], "warn": [4, 33], "supermariobrosenv": 4, "support": 4, "limit": 4, "reacher": 4, "v4": [4, 14, 17, 18, 23, 33], "target_1": 4, "target_2": 4, "target_3": 4, "target_4": 4, "hopper": [4, 33], "veloc": [4, 23], "height": 4, "energi": 4, "halfcheetah": [4, 33], "similar": 4, "xu": 4, "2020": 4, "3": [5, 6, 7, 8, 10, 12, 13, 15, 16, 17, 18, 19, 20, 21, 23, 24, 25, 26], "dict": 5, "bottles_carri": 5, "2": [5, 6, 7, 8, 9, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31], "bottles_drop": 5, "multibinari": 5, "shape": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 34, 35], "0": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31, 35], "50": [5, 33], "low": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26], "inf": [5, 14, 15, 16, 17, 18, 19, 20, 23, 24, 26], "1": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 35], "import": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31, 33, 34], "mo_gymnasium": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 31, 33, 34, 35], "environ": [5, 6, 7, 8, 9, 10, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 29, 30, 31, 32, 35], "implement": [5, 28, 34], "problem": [5, 6, 7, 8, 9, 26, 33], "unbreakablebottl": 5, "breakablebottl": 5, "section": 5, "4": [5, 6, 7, 8, 10, 12, 13, 19, 20, 23, 24, 25, 26], "paper": 5, "multiobject": 5, "reinforc": [5, 9, 10, 27, 29, 30, 31, 33], "learn": [5, 9, 10, 25, 27, 29, 30, 31, 33], "approach": 5, "impact": 5, "ai": 5, "safeti": 5, "move": [5, 24, 25, 33], "left": [5, 6, 7, 8, 10, 11, 12, 13, 25], "right": [5, 6, 7, 8, 10, 11, 12, 13, 15, 25], "pick": 5, "up": [5, 6, 7, 8, 10, 25, 26], "kei": 5, "current": [5, 6, 7, 8, 26], "number": [5, 26], "carri": 5, "ha": [5, 24, 33], "each": [5, 6, 7, 8, 11, 12, 13, 21, 22, 23, 33], "boolean": [5, 35], "flag": [5, 17, 18, 24, 25, 34], "indic": [5, 10, 25, 33], "dimens": [5, 33], "step": [5, 6, 7, 8, 9, 21, 22, 24, 26, 29, 30, 31, 33, 34], "bottle_reward": 5, "multipl": [5, 25, 28, 33, 34, 35], "small": [5, 33], "probabl": [5, 9], "drop": [5, 33], "appli": [5, 23, 35], "ground": 5, "when": [5, 6, 7, 8, 9, 10, 11, 12, 13, 24, 25, 33, 34], "size": [5, 11, 34], "prob_drop": 5, "unbreakable_bottl": 5, "true": [5, 6, 7, 8, 12, 13, 26, 35], "which": [5, 6, 7, 8, 9, 10, 30, 33, 34, 35], "again": 5, "so": 5, "outcom": 5, "otherwis": [5, 9, 24], "cannot": 5, "origin": [5, 30, 31, 33], "robert": 5, "klassert": 5, "home": [5, 25], "asset": [5, 25, 33], "limezu": [5, 25], "itch": [5, 6, 7, 8, 25], "io": [5, 6, 7, 8, 25], "serenevillagerevamp": [5, 25], "ninjikin": [5, 25], "pixel": [5, 6, 7, 8, 33], "art": [5, 6, 7, 8, 33], "creat": [5, 6, 7, 8, 30, 31, 33], "assist": [5, 6, 7, 8], "dall": [5, 6, 7, 8], "11": [6, 7, 17, 18, 33], "23": [6, 33], "control": [6, 7, 8, 9, 14, 17, 18], "2d": [6, 7, 8, 10, 12, 13, 22, 33, 35], "grid": [6, 7, 8], "world": [6, 7, 8], "box": [6, 7, 8, 14, 17, 18, 20, 22, 26], "x": [6, 7, 8, 17, 18, 24, 25], "y": [6, 7, 8, 25], "coordin": [6, 7, 8, 25, 30], "where": [6, 7, 8, 9, 11, 21, 22, 33], "down": [6, 7, 8, 10, 25], "dimension": [6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 23, 24, 25], "posit": [6, 7, 8, 10, 12, 13, 23, 24, 25], "alwai": [6, 7, 8], "same": [6, 7, 8], "dst_map": [6, 7, 8], "default": [6, 7, 8, 12, 13, 26], "convex": [6, 7, 8], "us": [6, 7, 8, 12, 13, 26, 27, 28, 29, 30, 31, 32, 33, 34], "deepseatreasur": [6, 7, 8], "concave_map": [6, 7, 8], "mirrored_map": [6, 7, 8], "float_stat": [6, 7, 8], "adapt": [6, 7, 8, 10], "background": [6, 7, 8, 26], "ansimuz": [6, 7, 8], "underwat": [6, 7, 8], "fantasi": [6, 7, 8], "124": [7, 8], "20": [8, 33], "simpl": [9, 30, 31], "fisherman": 9, "either": 9, "expect": 9, "util": [9, 33], "return": [9, 12, 13, 25, 30, 33, 34, 35], "woodproba": 9, "fishproba": 9, "episod": [9, 26, 35], "end": [9, 12, 13], "after": [9, 33, 35], "max_t": 9, "200": 9, "catch": 9, "14": [10, 19, 20, 33], "13": [10, 33], "introduc": [10, 33], "an": [10, 24, 25, 26, 30, 31, 33, 34, 35], "those": 10, "neg": [10, 22, 23], "travel": 10, "fix": 10, "split": 10, "separ": 10, "wall": 10, "passag": 10, "barreto": 10, "andr\u00e9": 10, "nip": 10, "2017": 10, "plu": 10, "vector": [10, 11, 12, 13, 21, 22, 24, 30, 33, 34, 35], "were": 10, "follow": [10, 12, 13, 30, 31, 33, 34], "compon": [10, 33, 35], "blue": 10, "squar": 10, "els": [10, 24, 25], "green": 10, "triangl": 10, "red": 10, "circl": 10, "lower": 10, "g": [10, 33], "maze": 10, "arrai": [10, 30, 31, 33, 34, 35], "see": [10, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 24, 30, 31, 33, 34], "exampl": [10, 28, 30, 31, 33], "63": 11, "chose": 11, "node": 11, "therefor": 11, "receiv": [11, 33], "root": [11, 33], "cart": [12, 13], "sin": [12, 13, 23], "co": [12, 13, 23], "orient": [12, 13], "porcentag": [12, 13], "capac": [12, 13], "fill": [12, 13], "image_observ": [12, 13], "3d": [12, 13, 21], "mine": [12, 13], "acceler": [12, 13], "brake": [12, 13], "none": [12, 13, 26, 33, 35], "quantiti": [12, 13], "first": [12, 13, 26, 29, 33, 35], "minerium": [12, 13], "retriev": [12, 13], "spars": [12, 13], "second": [12, 13], "consum": [12, 13], "dens": [12, 13], "upper": [12, 13], "corner": [12, 13], "render_mod": [12, 13, 26], "render": [12, 13, 26, 34], "mode": [12, 13, 26], "rgb_arrai": [12, 13, 26], "human": [12, 13, 26], "rgb": [12, 13], "frame_skip": [12, 13], "mani": [12, 13], "repeat": [12, 13], "incremental_frame_skip": [12, 13], "whether": [12, 13, 26], "increment": [12, 13], "config": [12, 13], "path": [12, 13], "json": [12, 13], "configur": [12, 13], "file": [12, 13, 33, 34], "foundat": [12, 13, 33], "blob": [12, 13], "mine_config": [12, 13], "axel": [12, 13], "float32": [14, 17, 18, 20, 22, 26], "observ": [14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 33, 34], "17": 14, "halfcheetahenv": 14, "run": [14, 34, 35], "cost": [14, 17, 18, 19, 20, 26], "highwayenv": 15, "reward": [16, 30, 31, 33, 34, 35], "highwayfastenv": 16, "hopperenv": [17, 18], "axi": [17, 18], "jump": [17, 18], "z": [17, 18], "cost_object": [17, 18], "set": [17, 18, 30, 33, 34], "fals": [17, 18], "ad": [17, 18, 34], "other": [17, 18, 30, 31, 33, 35], "8": [19, 20, 26, 30, 31, 33], "100": [19, 20, 24, 35], "crash": [19, 20], "successfulli": [19, 20], "engin": [19, 20], "side": [19, 20], "07": [21, 22], "top": [21, 22], "norm": 22, "9": 23, "angl": 23, "central": 23, "elbow": 23, "joint": [23, 26], "angular": 23, "possibl": 23, "zero": [23, 33], "torqu": 23, "distanc": [23, 34], "tip": 23, "arm": 23, "target": 23, "comput": [23, 33, 34], "r_i": 23, "finger_tip_coord": 23, "target_i": 23, "256": 24, "240": 24, "255": 24, "25": [24, 33], "supermariobro": 24, "gym": [24, 27, 29, 30, 31, 33], "super": 24, "mario": [24, 33], "bro": 24, "far": 24, "much": 24, "pass": [24, 33], "between": [24, 26, 30, 33, 35], "di": 24, "point": 24, "barrett": 25, "leon": 25, "srini": 25, "optim": [25, 29, 33], "polici": [25, 29], "criteria": 25, "proceed": [25, 27, 30], "25th": 25, "intern": [25, 26], "confer": [25, 26, 27, 30], "machin": 25, "41": [25, 33], "47": [25, 33], "1145": 25, "1390156": 25, "1390162": 25, "consist": 25, "element": [25, 33], "diamond": 25, "castelletti": 26, "m": 26, "restelli": 26, "fit": 26, "q": 26, "iter": [26, 35], "markov": 26, "decis": 26, "2012": 26, "neural": 26, "network": 26, "ijcnn": 26, "brisban": 26, "qld": 26, "australia": 26, "pp": 26, "doi": 26, "1109": 26, "6252759": 26, "float": [26, 34, 35], "level": [26, 33], "normalized_act": 26, "percentag": 26, "There": 26, "due": 26, "excess": 26, "wrt": 26, "flood": 26, "threshold": 26, "upstream": 26, "deficit": 26, "suppli": 26, "demand": 26, "hydroelectr": 26, "downstream": 26, "By": 26, "onli": 26, "initi": [26, 34], "random": 26, "160": 26, "ansi": 26, "time_limit": 26, "maximum": 26, "until": 26, "truncat": [26, 30, 31, 33, 34], "nO": 26, "penal": 26, "select": 26, "bound": [26, 33], "normal": [26, 33, 35], "initial_st": 26, "port": 26, "simon": 26, "parisi": 26, "sky": 26, "paulina": 26, "riva": 26, "opengameart": 26, "org": [26, 30], "content": 26, "repositori": [27, 28, 30, 34], "research": [27, 29, 30], "pleas": [27, 29, 30], "cite": 27, "inproceed": [27, 30], "2022bnaic": [27, 30], "author": [27, 30], "luca": [27, 30], "n": [27, 30], "florian": [27, 30], "el": [27, 30], "ghazali": [27, 30], "talbi": [27, 30], "gr": [27, 30], "goir": [27, 30], "danoi": [27, 30], "ann": [27, 30], "now": [27, 30, 33], "ana": [27, 30], "l": [27, 30, 35], "c": [27, 30], "bazzan": [27, 30], "bruno": [27, 30], "da": [27, 30], "silva": [27, 30], "titl": [27, 30], "librari": [27, 29, 30, 33], "booktitl": [27, 30], "34th": [27, 30], "benelux": [27, 30], "artifici": [27, 30], "intellig": [27, 30], "bnaic": [27, 29, 30], "benelearn": [27, 30], "year": [27, 30], "algorithm": [28, 30], "under": [28, 33], "api": [28, 33, 34], "It": [28, 30, 31, 33, 34], "wrapper": [28, 30, 31, 33], "formerli": 29, "publish": 29, "submiss": 29, "open": [29, 30], "pull": [29, 34], "request": 29, "add": [29, 33], "miss": [29, 33], "entri": 29, "sampl": 29, "effici": 29, "via": 29, "gener": [29, 33, 34], "improv": 29, "priorit": 29, "aama": 29, "2023": [29, 33], "hyperparamet": 29, "modem": 29, "workshop": 29, "leverag": 29, "approxim": 29, "model": [29, 30, 31], "neurip": 29, "toolkit": 29, "reliabl": 29, "benchmark": 29, "standard": [30, 33], "suit": 30, "python": [30, 31, 33], "develop": 30, "compar": 30, "commun": [30, 33], "compliant": 30, "essenti": [30, 34], "numpi": [30, 31, 33, 34, 35], "document": [30, 34], "websit": [30, 33], "public": 30, "work": 30, "gg": 30, "bnj6kubtg6": 30, "As": [30, 31], "class": [30, 31, 35], "instanc": [30, 31, 33], "interact": [30, 31], "veri": [30, 31, 34], "mo_gym": [30, 31, 33], "np": [30, 31, 33], "info": [30, 31, 33, 34, 35], "reset": [30, 31, 33, 34], "vector_reward": [30, 31, 33], "next_ob": [30, 31, 33], "termin": [30, 31, 33, 34], "your_ag": [30, 31, 33], "act": [30, 31, 33], "option": [30, 31, 33, 34], "scalar": [30, 31, 33, 35], "function": [30, 31, 33], "linearreward": [30, 31, 33], "weight": [30, 31, 33, 35], "detail": [30, 31], "mdp": [30, 31, 33], "momdp": [30, 31, 33], "definit": [30, 31], "practic": [30, 31], "guid": [30, 31], "plan": [30, 31], "check": [30, 31], "colab": [30, 31], "notebook": [30, 31], "doe": [30, 32, 35], "depend": [30, 32], "famili": [30, 32], "problemat": [30, 32], "certain": [30, 32], "system": [30, 32], "one": [30, 32], "08": [33, 35], "24": 33, "pypi": 33, "lucasalegr": 33, "75": 33, "momaxandskipobserv": 33, "76": 33, "modifi": 33, "part": [33, 34], "info_dict": 33, "ianleongudri": 33, "69": 33, "order": 33, "wrap": [33, 35], "morecordepisodestatist": 33, "ffelten": 33, "70": 33, "73": 33, "tuto": 33, "custom": 33, "creation": 33, "72": 33, "test": 33, "worker": 33, "67": 33, "pf": 33, "cc": 33, "determinist": 33, "74": 33, "changelog": 33, "06": 33, "12": 33, "thrill": 33, "design": 33, "expand": 33, "capabl": 33, "rl": 33, "scenario": 33, "need": 33, "mai": 33, "conflict": 33, "repres": 33, "distinct": 33, "In": 33, "context": 33, "trade": 33, "off": 33, "combin": 33, "linearli": 33, "predefin": 33, "shown": 33, "snippet": 33, "ctrl_cost": 33, "self": [33, 34], "control_cost": 33, "forward_reward": 33, "_forward_reward_weight": 33, "x_veloc": 33, "With": 33, "user": 33, "flexibl": 33, "determin": 33, "compromis": 33, "thei": [33, 35], "desir": 33, "prefer": 33, "consequ": 33, "do": 33, "thu": 33, "extend": 33, "vectori": 33, "illustr": 33, "domain": 33, "balanc": 33, "save": 33, "batteri": 33, "mark": 33, "within": [33, 34], "stabl": 33, "achiev": 33, "qualiti": 33, "allow": 33, "fall": 33, "back": 33, "singl": 33, "rang": 33, "inher": 33, "exhaust": 33, "list": 33, "avail": [33, 35], "addition": 33, "tailor": 33, "monormalizereward": 33, "linearwrapp": 33, "transform": 33, "new": 33, "bump": 33, "51": 33, "52": 33, "accept": 33, "55": 33, "updat": 33, "pre": 33, "commit": 33, "spell": 33, "mistak": 33, "56": 33, "method": [33, 34], "57": 33, "59": 33, "60": 33, "61": 33, "65": 33, "caus": 33, "randint": 33, "deprec": 33, "53": 33, "edit": 33, "button": 33, "58": 33, "62": 33, "logo": 33, "64": 33, "03": 33, "pareto_front": [33, 34], "get": 33, "45": 33, "43": 33, "baselin": 33, "readm": 33, "40": 33, "link": 33, "mgoulao": 33, "42": 33, "enabl": 33, "46": 33, "02": 33, "ezpickl": 33, "34": 33, "37": 33, "convers": 33, "33": 33, "eval_mo": 33, "observationspac": 33, "dtype": 33, "35": 33, "gif": 33, "minor": 33, "directli": [33, 35], "26": 33, "pydoc": 33, "31": 33, "hotfix": 33, "27": 33, "vecenv": 33, "30": 33, "01": 33, "benefit": 33, "structur": 33, "should": [33, 34], "higher": 33, "integr": 33, "tool": 33, "break": 33, "been": 33, "renam": 33, "previous": 33, "call": 33, "reli": 33, "instead": 33, "16": 33, "access": 33, "deep_sea_treasur": 33, "gt": 33, "18": 33, "life": 33, "ci": 33, "setup": 33, "19": 33, "init": 33, "Of": 33, "conduct": 33, "21": 33, "09": 33, "revamp": 33, "lint": 33, "bib": 33, "citat": 33, "few": [33, 35], "bugfix": 33, "tutori": 34, "goe": 34, "through": 34, "sinc": [34, 35], "close": 34, "ti": 34, "refer": 34, "its": 34, "inherit": 34, "__init__": 34, "constructor": 34, "action_spac": 34, "observation_spac": 34, "attribut": 34, "moreov": 34, "deal": 34, "reward_spac": 34, "reward_dim": 34, "integ": 34, "seed": 34, "kwarg": 34, "perform": 34, "next": 34, "gamma": [34, 35], "discount": [34, 35], "pareto": 34, "front": 34, "known": 34, "oject": 34, "metric": 34, "invert": 34, "igd": 34, "registri": 34, "done": 34, "line": 34, "py": 34, "directori": 34, "registr": 34, "id": 34, "my_env_v0": 34, "entry_point": 34, "my_env_dir": 34, "my_env_fil": 34, "myenv": 34, "pytest": 34, "inspir": 35, "modul": 35, "ndarrai": 35, "dot": 35, "product": 35, "idx": 35, "int": 35, "99": 35, "epsilon": 35, "1e": 35, "index": 35, "touch": 35, "min_r": 35, "max_r": 35, "clip": 35, "min": 35, "max": 35, "env_fn": 35, "callabl": 35, "copi": 35, "bool": 35, "serial": 35, "deque_s": 35, "keep": 35, "track": 35, "cumul": 35, "length": 35, "complet": 35, "look": 35, "dr": 35, "contrari": 35, "t": 35, "elaps": 35, "begin": 35, "output": 35, "form": 35, "care": 35, "befor": 35, "morewordstatist": 35, "final_observ": 35, "num": 35, "_final_observ": 35, "final_info": 35, "_final_info": 35, "num_env": 35, "dim_reward": 35, "_episod": 35}, "objects": {"mo_gymnasium": [[35, 0, 1, "", "LinearReward"], [35, 0, 1, "", "MOClipReward"], [35, 0, 1, "", "MONormalizeReward"], [35, 0, 1, "", "MORecordEpisodeStatistics"], [35, 0, 1, "", "MOSyncVectorEnv"]]}, "objtypes": {"0": "py:class"}, "objnames": {"0": ["py", "class", "Python class"]}, "titleterms": {"404": 0, "page": [0, 33], "Not": 0, "found": 0, "The": 0, "request": 0, "could": 0, "mo": [1, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 33], "gymnasium": [1, 33], "document": [1, 33], "build": 1, "commun": 3, "acknowledg": 3, "avail": 4, "environ": [4, 33, 34], "breakabl": [5, 33], "bottl": [5, 33], "descript": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 23, 24, 25, 26], "action": [5, 6, 7, 8, 9, 10, 11, 12, 13, 23, 25, 26], "space": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26], "observ": [5, 6, 7, 8, 9, 10, 11, 12, 13, 23, 25, 26], "reward": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26], "start": [5, 6, 7, 8, 9, 10, 11, 12, 13, 25, 26], "state": [5, 6, 7, 8, 9, 10, 11, 12, 13, 25, 26], "episod": [5, 6, 7, 8, 10, 11, 12, 13, 24, 25], "termin": [5, 6, 7, 8, 9, 10, 11, 12, 13, 24, 25], "argument": [5, 6, 7, 8, 9, 10, 12, 13, 26], "credit": [5, 6, 7, 8, 9, 10, 12, 13, 25, 26], "deep": [6, 7, 8], "sea": [6, 7, 8], "treasur": [6, 7, 8], "concav": 7, "mirror": 8, "fishwood": 9, "four": 10, "room": 10, "refer": 10, "fruit": 11, "tree": 11, "minecart": [12, 13], "determinist": 13, "halfcheetah": 14, "highwai": [15, 16], "fast": 16, "hopper": [17, 18], "2d": 18, "lunar": [19, 20], "lander": [19, 20], "continu": 20, "mountaincar": 21, "mountaincarcontinu": 22, "reacher": [23, 33], "supermario": 24, "resourc": 25, "gather": 25, "water": 26, "reservoir": 26, "citat": 27, "morl": 28, "baselin": 28, "list": 29, "public": 29, "api": [30, 31], "instal": [30, 32], "cite": 30, "releas": 33, "note": 33, "v1": 33, "0": 33, "1": [33, 34], "support": 33, "29": 33, "pygam": 33, "render": 33, "more": 33, "becom": 33, "matur": 33, "v0": 33, "3": [33, 34], "4": [33, 34], "known": 33, "pareto": 33, "front": 33, "improv": 33, "polici": 33, "evalu": 33, "bug": 33, "fix": 33, "better": 33, "2": [33, 34], "webpag": 33, "mujoco": 33, "migrat": 33, "creat": 34, "custom": 34, "new": 34, "class": 34, "regist": 34, "test": 34, "instanti": 34, "your": 34, "wrapper": 35, "linearreward": 35, "monormalizereward": 35, "moclipreward": 35, "mosyncvectorenv": 35, "morecordepisodestatist": 35}, "envversion": {"sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx": 60}, "alltitles": {"404 - Page Not Found": [[0, "page-not-found"]], "The requested page could not be found.": [[0, "the-requested-page-could-not-be-found"]], "MO-Gymnasium documentation": [[1, "mo-gymnasium-documentation"]], "Build the Documentation": [[1, "build-the-documentation"]], "Community": [[3, "community"]], "Acknowledgements": [[3, "acknowledgements"]], "Available environments": [[4, "available-environments"]], "Breakable-Bottles": [[5, "breakable-bottles"]], "Description": [[5, "description"], [6, "description"], [7, "description"], [8, "description"], [9, "description"], [10, "description"], [11, "description"], [12, "description"], [13, "description"], [14, "description"], [15, "description"], [17, "description"], [18, "description"], [19, "description"], [20, "description"], [23, "description"], [24, "description"], [25, "description"], [26, "description"]], "Action Space": [[5, "action-space"], [6, "action-space"], [7, "action-space"], [8, "action-space"], [9, "action-space"], [10, "action-space"], [11, "action-space"], [12, "action-space"], [13, "action-space"], [23, "action-space"], [25, "action-space"], [26, "action-space"]], "Observation Space": [[5, "observation-space"], [6, "observation-space"], [7, "observation-space"], [8, "observation-space"], [9, "observation-space"], [10, "observation-space"], [11, "observation-space"], [12, "observation-space"], [13, "observation-space"], [23, "observation-space"], [25, "observation-space"], [26, "observation-space"]], "Reward Space": [[5, "reward-space"], [6, "reward-space"], [7, "reward-space"], [8, "reward-space"], [9, "reward-space"], [10, "reward-space"], [11, "reward-space"], [12, "reward-space"], [13, "reward-space"], [14, "reward-space"], [15, "reward-space"], [17, "reward-space"], [18, "reward-space"], [19, "reward-space"], [20, "reward-space"], [23, "reward-space"], [24, "reward-space"], [25, "reward-space"], [26, "reward-space"]], "Starting State": [[5, "starting-state"], [6, "starting-state"], [7, "starting-state"], [8, "starting-state"], [9, "starting-state"], [10, "starting-state"], [11, "starting-state"], [12, "starting-state"], [13, "starting-state"], [25, "starting-state"], [26, "starting-state"]], "Episode Termination": [[5, "episode-termination"], [6, "episode-termination"], [7, "episode-termination"], [8, "episode-termination"], [10, "episode-termination"], [11, "episode-termination"], [12, "episode-termination"], [13, "episode-termination"], [24, "episode-termination"], [25, "episode-termination"]], "Arguments": [[5, "arguments"], [6, "arguments"], [7, "arguments"], [8, "arguments"], [9, "arguments"], [10, "arguments"], [12, "arguments"], [13, "arguments"], [26, "arguments"]], "Credits": [[5, "credits"], [6, "credits"], [7, "credits"], [8, "credits"], [9, "credits"], [10, "credits"], [12, "credits"], [13, "credits"], [25, "credits"], [26, "credits"]], "Deep-Sea-Treasure": [[6, "deep-sea-treasure"]], "Deep-Sea-Treasure-Concave": [[7, "deep-sea-treasure-concave"]], "Deep-Sea-Treasure-Mirrored": [[8, "deep-sea-treasure-mirrored"]], "Fishwood": [[9, "fishwood"]], "Termination": [[9, "termination"]], "Four-Room": [[10, "four-room"]], "References": [[10, "references"]], "Fruit-Tree": [[11, "fruit-tree"]], "Minecart": [[12, "minecart"]], "Minecart-Deterministic": [[13, "minecart-deterministic"]], "MO-Halfcheetah": [[14, "mo-halfcheetah"]], "MO-Highway": [[15, "mo-highway"]], "MO-Highway-Fast": [[16, "mo-highway-fast"]], "MO-Hopper": [[17, "mo-hopper"]], "MO-Hopper-2D": [[18, "mo-hopper-2d"]], "MO-Lunar-Lander": [[19, "mo-lunar-lander"]], "MO-Lunar-Lander-Continuous": [[20, "mo-lunar-lander-continuous"]], "MO-Mountaincar": [[21, "mo-mountaincar"]], "Reward space:": [[21, "reward-space"], [22, "reward-space"]], "MO-Mountaincarcontinuous": [[22, "mo-mountaincarcontinuous"]], "MO-Reacher": [[23, "mo-reacher"]], "MO-Supermario": [[24, "mo-supermario"]], "Resource-Gathering": [[25, "resource-gathering"]], "Water-Reservoir": [[26, "water-reservoir"]], "Citation": [[27, "citation"]], "MORL Baselines": [[28, "morl-baselines"]], "List of Publications": [[29, "list-of-publications"]], "API": [[30, "api"], [31, "api"]], "Install": [[30, "install"], [32, "install"]], "Citing": [[30, "citing"]], "Release Notes": [[33, "release-notes"]], "v1.0.1: MO-Gymnasium 1.0.1 Release: Support Gymnasium 0.29, breakable-bottles pygame render, and more": [[33, "release-v1-0-1"]], "v1.0.0: MO-Gymnasium becomes mature": [[33, "release-v1-0-0"]], "v0.3.4: MO-Gymnasium 0.3.4 Release: Known Pareto Front, improved renders and documentation": [[33, "release-v0-3-4"]], "v0.3.3: MO-Gymnasium 0.3.3 Release: Policy Evaluation bug fix, better documentation page": [[33, "release-v0-3-3"]], "MO-Gymnasium 0.3.2 Release: Bug fixes, improved webpage": [[33, "release-0-3-2"]], "MO-Gymnasium 0.3.1 Release: Improved documentation and MuJoco MO-Reacher environment": [[33, "release-0-3-1"]], "MO-Gymnasium 0.3.0 Release: Migrate to Gymnasium": [[33, "release-0-3-0"]], "0.2.1": [[33, "release-0-2-1"]], "0.2.0": [[33, "release-0-2-0"]], "0.1.2": [[33, "release-0-1-2"]], "0.1.1": [[33, "release-0-1-1"]], "Creating a custom environment": [[34, "creating-a-custom-environment"]], "1. Create a new environment class": [[34, "create-a-new-environment-class"]], "2. Register the environment": [[34, "register-the-environment"]], "3. Test the environment": [[34, "test-the-environment"]], "4. Instantiate your environment": [[34, "instantiate-your-environment"]], "Wrappers": [[35, "wrappers"]], "LinearReward": [[35, "linearreward"]], "MONormalizeReward": [[35, "monormalizereward"]], "MOClipReward": [[35, "moclipreward"]], "MOSyncVectorEnv": [[35, "mosyncvectorenv"]], "MORecordEpisodeStatistics": [[35, "morecordepisodestatistics"]]}, "indexentries": {"linearreward (class in mo_gymnasium)": [[35, "mo_gymnasium.LinearReward"]], "moclipreward (class in mo_gymnasium)": [[35, "mo_gymnasium.MOClipReward"]], "monormalizereward (class in mo_gymnasium)": [[35, "mo_gymnasium.MONormalizeReward"]], "morecordepisodestatistics (class in mo_gymnasium)": [[35, "mo_gymnasium.MORecordEpisodeStatistics"]], "mosyncvectorenv (class in mo_gymnasium)": [[35, "mo_gymnasium.MOSyncVectorEnv"]]}})
\ No newline at end of file