Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Detail of maze2d experiment #41

Open
Super1ce opened this issue Aug 22, 2023 · 4 comments
Open

Detail of maze2d experiment #41

Super1ce opened this issue Aug 22, 2023 · 4 comments

Comments

@Super1ce
Copy link

I reproduced the results of the original paper on locomotion, but encountered some difficulties on maze2d. I would like to ask:

  1. Is the score given in the original paper normalized?
  2. The diffuser step given in the appendix only mentions locomotion and block-stacking, is maze2d also 100?
  3. The paper says to use start and goal locations, is the goal location obtained through the environment’s get_target and reset_to_location? Is the start location referring to the current observation of the agent? Is the goal location placed at horizon - 1? After setting it up this way, the result I reproduced only got a non-normalized score of 87.
@jannerm
Copy link
Owner

jannerm commented Aug 29, 2023

Are you on the maze2d branch?

  1. Yes
  2. The number of diffusion steps are in the config file here
  3. The goal location is randomized here

@Super1ce
Copy link
Author

Sorry, I didn’t notice this branch. I will try to reproduce it again. Thank you.

@Super1ce
Copy link
Author

Super1ce commented Sep 6, 2023

Are you on the maze2d branch?

  1. Yes
  2. The number of diffusion steps are in the config file here
  3. The goal location is randomized here

I saw a comment saying that the replan method can also be used during evaluation, but when I used the replan method of locomotion to replace the open-loop here, the return was all 0. Is it necessary to make additional changes to take effect?

@jannerm
Copy link
Owner

jannerm commented Sep 7, 2023

Closed-loop planning should work here; it will just be slower. What do the plans and rollouts look like?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants