-
-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SISL/Multiwalker env MO Port #5
Conversation
I am not sure why I am getting this final error. It is claiming that the object I am indexing is |
SSH_AUTH_SOCK=/private/tmp/com.apple.launchd.dekcBjReVV/Listeners LC_TERMINAL_VERSION=3.4.21 COLORFGBG=15;0 ITERM_PROFILE=Main XPC_FLAGS=0x0 PWD=/Users/ucak/MOMAS/momadm-bechmarks SHELL=/bin/zsh __CFBundleIdentifier=com.googlecode.iterm2 LC_CTYPE=UTF-8 TERM_PROGRAM_VERSION=3.4.21 TERM_PROGRAM=iTerm.app PATH=/Users/ucak/MOMAS/momadm-bechmarks/.venv/bin:/opt/homebrew/bin:/opt/homebrew/sbin:/usr/local/bin:/System/Cryptexes/App/usr/bin:/usr/bin:/bin:/usr/sbin:/sbin:/Library/Frameworks/Mono.framework/Versions/Current/Commands:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/appleinternal/bin LC_TERMINAL=iTerm2 COLORTERM=truecolor COMMAND_MODE=unix2003 TERM=xterm-256color HOME=/Users/ucak TMPDIR=/var/folders/s5/3px3g159399c2l1bm_n1bw4c0000gn/T/ USER=ucak XPC_SERVICE_NAME=0 LOGNAME=ucak ITERM_SESSION_ID=w0t0p0:8A09846D-45E4-45E6-9AFE-C591885D192A __CF_USER_TEXT_ENCODING=0x0:0:0 SHLVL=1 OLDPWD=/Users/ucak/MOMAS ZSH=/Users/ucak/.oh-my-zsh HOMEBREW_PREFIX=/opt/homebrew HOMEBREW_CELLAR=/opt/homebrew/Cellar HOMEBREW_REPOSITORY=/opt/homebrew MANPATH=/opt/homebrew/share/man:: INFOPATH=/opt/homebrew/share/info: PAGER=less LESS=-R LSCOLORS=Gxfxcxdxbxegedabagacad LS_COLORS=di=1;36:ln=35:so=32:pi=33:ex=31:bd=34;46:cd=34;43:su=30;41:sg=30;46:tw=30;42:ow=30;43 VIRTUAL_ENV=/Users/ucak/MOMAS/momadm-bechmarks/.venv PS1=(.venv) %(?:%{�[01;32m%}➜ :%{�[01;31m%}➜ ) %{�[36m%}%c%{�[00m%} $(git_prompt_info) VIRTUAL_ENV_PROMPT=(.venv) _=/usr/bin/env, , functions
The failed test is for deterministic environments, not applicable to Multiwalker. All other tests pass. 👍 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I still need to go through the "core" code in subroutine but have to go now. Will do this later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Almost there! Nice one
rewards[i][1] = self.fall_reward # not all, only the one that fell | ||
if self.remove_on_fall: | ||
walker._destroy() | ||
if not self.terminate_on_fall: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this not be if self.terminate_on_fall
? This is for giving everyone the termination penalty on top of the falling penalty if the setting is toggled.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I changed it to be like this, but if I'm wrong we can revert.
if self.remove_on_fall: | ||
walker._destroy() | ||
if not self.terminate_on_fall: | ||
rewards[:][1] = self.terminate_reward |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this be rewards[:, 2]
for the 3rd objective (pkg not falling)? Which objective should be penalize for terminate_on_fall
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After fixing the last comments we can merge
I've vectorized all the rewards in the sisl/multiwalker env from PZ to be MO
Objectives