Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SISL/Multiwalker env MO Port #5

Merged
merged 23 commits into from
Oct 19, 2023
Merged

SISL/Multiwalker env MO Port #5

merged 23 commits into from
Oct 19, 2023

Conversation

umutucak
Copy link
Collaborator

I've vectorized all the rewards in the sisl/multiwalker env from PZ to be MO

Objectives

  1. package moves forward
  2. no agent falls
  3. package doesnt fall

@umutucak umutucak requested a review from ffelten October 10, 2023 10:09
@umutucak umutucak self-assigned this Oct 10, 2023
@umutucak umutucak changed the title SISL/Multiwaler env MO Port SISL/Multiwalker env MO Port Oct 10, 2023
@umutucak
Copy link
Collaborator Author

umutucak commented Oct 10, 2023

I am not sure why I am getting this final error. It is claiming that the object I am indexing is None type, but it is not. We can discuss it tomorrow?

@umutucak umutucak removed the request for review from ffelten October 10, 2023 16:28
SSH_AUTH_SOCK=/private/tmp/com.apple.launchd.dekcBjReVV/Listeners
LC_TERMINAL_VERSION=3.4.21
COLORFGBG=15;0
ITERM_PROFILE=Main
XPC_FLAGS=0x0
PWD=/Users/ucak/MOMAS/momadm-bechmarks
SHELL=/bin/zsh
__CFBundleIdentifier=com.googlecode.iterm2
LC_CTYPE=UTF-8
TERM_PROGRAM_VERSION=3.4.21
TERM_PROGRAM=iTerm.app
PATH=/Users/ucak/MOMAS/momadm-bechmarks/.venv/bin:/opt/homebrew/bin:/opt/homebrew/sbin:/usr/local/bin:/System/Cryptexes/App/usr/bin:/usr/bin:/bin:/usr/sbin:/sbin:/Library/Frameworks/Mono.framework/Versions/Current/Commands:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/appleinternal/bin
LC_TERMINAL=iTerm2
COLORTERM=truecolor
COMMAND_MODE=unix2003
TERM=xterm-256color
HOME=/Users/ucak
TMPDIR=/var/folders/s5/3px3g159399c2l1bm_n1bw4c0000gn/T/
USER=ucak
XPC_SERVICE_NAME=0
LOGNAME=ucak
ITERM_SESSION_ID=w0t0p0:8A09846D-45E4-45E6-9AFE-C591885D192A
__CF_USER_TEXT_ENCODING=0x0:0:0
SHLVL=1
OLDPWD=/Users/ucak/MOMAS
ZSH=/Users/ucak/.oh-my-zsh
HOMEBREW_PREFIX=/opt/homebrew
HOMEBREW_CELLAR=/opt/homebrew/Cellar
HOMEBREW_REPOSITORY=/opt/homebrew
MANPATH=/opt/homebrew/share/man::
INFOPATH=/opt/homebrew/share/info:
PAGER=less
LESS=-R
LSCOLORS=Gxfxcxdxbxegedabagacad
LS_COLORS=di=1;36:ln=35:so=32:pi=33:ex=31:bd=34;46:cd=34;43:su=30;41:sg=30;46:tw=30;42:ow=30;43
VIRTUAL_ENV=/Users/ucak/MOMAS/momadm-bechmarks/.venv
PS1=(.venv) %(?:%{�[01;32m%}➜ :%{�[01;31m%}➜ ) %{�[36m%}%c%{�[00m%} $(git_prompt_info)
VIRTUAL_ENV_PROMPT=(.venv)
_=/usr/bin/env, ,  functions
@umutucak umutucak requested a review from ffelten October 16, 2023 12:26
@umutucak
Copy link
Collaborator Author

The failed test is for deterministic environments, not applicable to Multiwalker. All other tests pass. 👍

Copy link
Collaborator

@ffelten ffelten left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I still need to go through the "core" code in subroutine but have to go now. Will do this later.

momadm_benchmarks/envs/multiwalker/multiwalker.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/momultiwalker_v0.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
@umutucak umutucak requested a review from ffelten October 18, 2023 12:40
Copy link
Collaborator

@ffelten ffelten left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Almost there! Nice one

momadm_benchmarks/envs/multiwalker/multiwalker.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
rewards[i][1] = self.fall_reward # not all, only the one that fell
if self.remove_on_fall:
walker._destroy()
if not self.terminate_on_fall:
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this not be if self.terminate_on_fall? This is for giving everyone the termination penalty on top of the falling penalty if the setting is toggled.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I changed it to be like this, but if I'm wrong we can revert.

if self.remove_on_fall:
walker._destroy()
if not self.terminate_on_fall:
rewards[:][1] = self.terminate_reward
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this be rewards[:, 2] for the 3rd objective (pkg not falling)? Which objective should be penalize for terminate_on_fall?

@umutucak umutucak requested a review from ffelten October 19, 2023 09:43
Copy link
Collaborator

@ffelten ffelten left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After fixing the last comments we can merge

momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
momadm_benchmarks/envs/multiwalker/multiwalker_base.py Outdated Show resolved Hide resolved
@umutucak umutucak merged commit 94ecaeb into main Oct 19, 2023
5 checks passed
@ffelten ffelten deleted the multiwalker branch October 19, 2023 15:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants