Skip to content

Second Version

Latest
Compare
Choose a tag to compare
@georgmartius georgmartius released this 11 Jan 09:48
· 40 commits to master since this release

The reward function is very simple now, just winning or loosing.
Additional reward signals are stored in the info dict and can be used.
Initialization conditions changed with respect to Version 1.