You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi!
Thanks for playing around with the code.
An average score near zero means that the trained agent is basically just
as good at playing Slime Volleyball compared to the built-in AI opponent
(which is already really good).
An average score > zero means that the trained agent is (on average) better
than the built-in AI opponent.
Since the built-in AI opponent is actually *really* good at the game, it
would be virtually impossible to get a score that is substantially higher
than zero. I was able to get ~ 1.4 for some of the best agents I trained.
If you do anything interesting with this project, please do share with me
in the future.
Thanks.
Hi
First of all, thanks for this repo
I tried to run your code, the training
The maximum reward is 0, how is it possible?
Thanks
The text was updated successfully, but these errors were encountered: