Cuda GatherCompiler
fails on low dimensionality (failing test)
#71
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Related to #70.
This showcases a failure for the Cuda
GatherCompiler
compilation pass.To fetch the PR and run the tests:
The
test_gather_compiler_r1
passes:https://github.com/jafioti/luminal/pull/71/files#diff-ead6d80249befe24da4ea23c236763f82cc1eccfccbb92035217e620e8bf5184R434-R459
While
test_gather_compiler_r0
fails:https://github.com/jafioti/luminal/pull/71/files#diff-ead6d80249befe24da4ea23c236763f82cc1eccfccbb92035217e620e8bf5184R409-R432
The failure error is:
The relevant line of code is:
luminal/crates/luminal_cuda/src/binary.rs
Line 379 in f61d53f
I'm not sure how cuda relates to those operations, but the first thought would be to maybe take the last dimension value instead of the third (
emb_shape.shape()[2]
) one?