Fix Qwen2VL mrope for transformers 4.47.0 #464

li-plus · 2024-12-10T09:45:06Z

Summary

Fix #461

Testing Done

Hardware Type: A800-SXM4-80GB
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

ByronHsu · 2024-12-10T09:51:31Z

Dude this is fast lol. I am curious why the tests can pass on AMD?

li-plus · 2024-12-10T09:59:53Z

No idea, but it seems I have no permission to run tests on nvidia gpu so they failed.

Run modal run dev.modal.tests
╭─ Error ──────────────────────────────────────────────────────────────────────╮
│ Token missing. Could not authenticate client. If you have token credentials, │
│ see modal.com/docs/reference/modal.config for setup help. If you are a new   │
│ user, register an account at modal.com, then run `modal token new`.          │
╰──────────────────────────────────────────────────────────────────────────────╯

austin362667 · 2024-12-10T11:54:52Z

Amazing! Thanks for super fast turnaround. @li-plus The Nvidia CI works as expected in my modal account.

I can help with reverting the workaround of #463 after this PR.

Wondering is this code change corresponds to https://github.com/huggingface/transformers/pull/34274/files#diff-09bc594f9680f1d042fd485106c68022d77b59831697a00b3b38f12a3e40f395R1698-R1715?

li-plus · 2024-12-10T13:54:06Z

@austin362667 Thanks! I haven't dug into the code diffs. From the model input perspective, before 4.47.0, the position_ids for each sample within a batch always starts from 0 and then increases, so I only accessed the first row of the cos/sin embed to save HBM bandwidth. However after 4.47.0, the position_ids is left padded by 1s and is different for each sample because the sequence length varies. I have to access the full rotary embeds to get the correct position for each sample. This change is compatible to transformers of older version.

ByronHsu · 2024-12-10T17:20:35Z

I can help with reverting the workaround of #463 after this PR.

@austin362667 Please do that. Thanks! Also, @li-plus i have added you as a maintainer, so you can directly push the the main repo's branch to run CI. We have disabled CI from external forks due to security reasons currently.

## Summary After fix #464 We can revert some changes in - #463 - #459 Which are workarounds of #461  ## Testing Done   - Hardware Type: <BLANK> - [ ] run `make test` to ensure correctness - [X] run `make checkstyle` to ensure code style - [X] run `make test-convergence` to ensure convergence --------- Signed-off-by: Austin Liu <[email protected]>

Fix Qwen2VL mrope for transformers 4.47.0

2fd8f9d

li-plus force-pushed the fix-mrope branch from c794d98 to 2fd8f9d Compare December 10, 2024 09:51

austin362667 approved these changes Dec 10, 2024

View reviewed changes

ByronHsu merged commit 78e8a85 into linkedin:main Dec 10, 2024
3 of 5 checks passed

li-plus deleted the fix-mrope branch December 11, 2024 02:16

austin362667 mentioned this pull request Dec 11, 2024

Revert Workaround of Disabling QWEN2_VL in Convergence Tests #466

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Qwen2VL mrope for transformers 4.47.0 #464

Fix Qwen2VL mrope for transformers 4.47.0 #464

li-plus commented Dec 10, 2024

ByronHsu commented Dec 10, 2024 •

edited

Loading

li-plus commented Dec 10, 2024

austin362667 commented Dec 10, 2024

li-plus commented Dec 10, 2024

ByronHsu commented Dec 10, 2024 •

edited

Loading

Fix Qwen2VL mrope for transformers 4.47.0 #464

Fix Qwen2VL mrope for transformers 4.47.0 #464

Conversation

li-plus commented Dec 10, 2024

Summary

Testing Done

ByronHsu commented Dec 10, 2024 • edited Loading

li-plus commented Dec 10, 2024

austin362667 commented Dec 10, 2024

li-plus commented Dec 10, 2024

ByronHsu commented Dec 10, 2024 • edited Loading

ByronHsu commented Dec 10, 2024 •

edited

Loading

ByronHsu commented Dec 10, 2024 •

edited

Loading