Implement a multi-objective pistonball environment #10

wilrop · 2023-11-06T16:10:35Z

I implemented a multi-objective version of Pistonball. This essentially boils down to separating the three components of the original reward function and exposing these as a vector reward instead.

ffelten

Looks good, only one minor comment

ffelten · 2023-11-08T15:23:18Z

momadm_benchmarks/envs/pistonball/pistonball.py

+        )
+        self.reward_dim = 3  # [global, local, time]
+        self.reward_spaces = {
+            f"piston_{i}": Box(low=-np.inf, high=np.inf, shape=(self.reward_dim,), dtype=np.float32)


Isn't it possible to have better bounds on this?

wilrop · 2023-11-24T13:41:19Z

I've added more informative reward bounds with documentation for how I obtained these bounds. If someone could briefly check whether this makes sense that would be good. I also ran random policies with 50 different seeds to verify whether the rewards were indeed within the specified bounds and everything seemed okay.

wilrop added 5 commits October 19, 2023 16:22

Initial commit for mo-pistonball

50b825e

Updates to make mo-pistonball work

99f7206

Extract the multi-objective reward in each step

f068e19

Fix tests

c15f6bd

Merge into main

e6167a8

wilrop requested review from ffelten and umutucak and removed request for ffelten November 6, 2023 16:10

ffelten reviewed Nov 8, 2023

View reviewed changes

wilrop added 2 commits November 24, 2023 14:28

Set more informative reward bounds

f75f3ce

Merge pistonball into main branch

fa57b4b

Merge main into branch

05295c9

wilrop merged commit 0b83951 into main Dec 8, 2023

wilrop deleted the mo-pistonball branch February 1, 2024 15:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement a multi-objective pistonball environment #10

Implement a multi-objective pistonball environment #10

wilrop commented Nov 6, 2023 •

edited

Loading

ffelten left a comment

ffelten Nov 8, 2023

wilrop commented Nov 24, 2023

Implement a multi-objective pistonball environment #10

Implement a multi-objective pistonball environment #10

Conversation

wilrop commented Nov 6, 2023 • edited Loading

ffelten left a comment

Choose a reason for hiding this comment

ffelten Nov 8, 2023

Choose a reason for hiding this comment

wilrop commented Nov 24, 2023

wilrop commented Nov 6, 2023 •

edited

Loading