Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question on sorted pre-defined trajectory data and training with CE loss #31

Open
jcuic5 opened this issue Jan 23, 2025 · 0 comments
Open

Comments

@jcuic5
Copy link

jcuic5 commented Jan 23, 2025

Hi, thanks for the work. By the code, I was wondering why it argsorted the pre-generated trajectories given the GT trajectory and set the closest one as the first, which is then used directly as the data input? Futhermore, when calculating the loss, the labels are all zeros, which is the first, closest one in the trajectories. Isn't it a hack? The model just needs to know the score of the first trajectory is always the highest. Thanks for the explanation in advance.

first_stage_trajs = first_stage_trajs[np.argsort(expert_error_1)]

label = torch.zeros(scores.shape[0], dtype=torch.long).to(scores.device)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant