Fix wrong world vector range in training loss calculation #84

jonathansalzer · 2024-07-22T13:58:40Z

Throughout training and inference, a world vector range of (-2; 2) is consistently used. However, during loss calculation, the world vector range is not explicitly passed to the action tokenization function, causing it to default to a range of (-1; 1). This mismatch leads to scaling issues: values are doubled, as well as clipped if they exceed the range (-1; 1).

By passing the model world vector range to the action tokenization, this issue is resolved.

AoqunJin · 2025-01-01T03:15:58Z

colabs/Minimal_Training_Example.ipynb

I found this due to the poor performance of the trained model when tested. But it's solved now, took days to debug 😂😂

fix wrong world vector range in action tokenization

15e8ca4

AoqunJin reviewed Jan 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix wrong world vector range in training loss calculation #84

Fix wrong world vector range in training loss calculation #84

jonathansalzer commented Jul 22, 2024 •

edited

Loading

AoqunJin Jan 1, 2025

Fix wrong world vector range in training loss calculation #84

Are you sure you want to change the base?

Fix wrong world vector range in training loss calculation #84

Conversation

jonathansalzer commented Jul 22, 2024 • edited Loading

AoqunJin Jan 1, 2025

Choose a reason for hiding this comment

jonathansalzer commented Jul 22, 2024 •

edited

Loading