You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to get a good agent on the highway environment with high-dimension observations and I'm running into some trouble getting it to behave nicely. I'm traing via PPO from stable-baselines3, on 1.9.1, and using this config at the moment (in yml format):
Hi!
I'm trying to get a good agent on the highway environment with high-dimension observations and I'm running into some trouble getting it to behave nicely. I'm traing via PPO from stable-baselines3, on 1.9.1, and using this config at the moment (in yml format):
Lots of these have come from trying to fix the behavior I've been seeing:
offroad_terminal
set, it just spins very quickly in a circle.At that point, I realized I must be doing something fundamentally wrong.
I see here that the reward is scaled for forward-ness; is that with respect to the current road segment?
Is there something else I'm missing? I don't see other folks having this problem, so I'm assuming I've goofed something up somewhere.
The text was updated successfully, but these errors were encountered: