-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
question regarding to the settings and evaluating the training #4
Comments
Hi, I am not familiar with the boatNav environment, it was not implemented by me. For the rest, I don't quite get your questions. Algorithms-wise I typically use VRACER as default, just because it combines experience replay and policy gradients. The readme contains some default values for the hyper-parameters which depend on problem size. The option you should use how many evaluation sims to run is If the behavior is not as expected: does smarties say that restart was successful? What do you mean by lost? |
I don't use smarties.py instead I run directly with executable as "boatNav --restart /public3/home/sc51719/soft/smarties/apps/boadNav/evalue RACE.json --nEvalEpisodes 1" . the same thing? seem it restart properly here is the screen output in the restart
for boatNav case, the target is to move the boat from the start to the end by adjusting the torque and thrust, there are two situations, either arrived or not. Although there are several successful case(the boat arrives at the target) during training, d I get N Evaluation solutions but non of them arrives at the terminal in the evaluation. any suggestion |
Sorry for the wait. Again, I am afraid you should look into the boatNav case yourself because the person who developed it has left the lab. As far as running with ./boatNav instead with the smarties.py script. |
I rewrite the environment part and tested. I actually don't have any setting.json like VRACER.json in my run folder. seems it reads from the somewhere I don't know. whether we can open a private repo to discuss this ? |
Hi Novatig:
thanks for replying. the updated readme is helpful. Simultaneously I have read your paper of glider, seems the controling results are sensitive to the training methods and the setting. now I implement the a boat navigation case based on the boatNav. everything works fine with runTest for me. then I use RACE to train, the case is to control the boat to turning some angle and move to another location without limitation the final nose angle and termination speed in the reward. but I get quite little success sample during training. most of the time the boat got lost, does that influence the training? would you please make some suggestion? RACE? DQN? PPO? or change some setting?
furthermore, after training, I want to evaluate solution by restart, I use the command "boatNav --restart /public3/home/sc51719/soft/smarties/apps/boadNav/evalue RACE.json " but I got many trajectories. which i suppose to be one trajectory with successful reach the point i specified, since I suppose restart is going to load the trained network to evaluate the performance of one task. but there are even results of the boat got lost. what is meaning of that? sometimes there is event not any reach the harbor
can you provide me some suggestion?
The text was updated successfully, but these errors were encountered: