Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat/matmul/simple barrier #496

Closed
wants to merge 79 commits into from
Closed

Feat/matmul/simple barrier #496

wants to merge 79 commits into from

Conversation

louisfd
Copy link
Member

@louisfd louisfd commented Feb 26, 2025

  • Add cooperative barrier option, in which all the cube must perform all memcpy_async calls.
  • Add 5 loaders working with the strided layout. Some of them are experimental, will probably be deleted at some point
  • Make loaders generic over the copy mechanism (pipeline vs barrier)
  • Since i'm experimenting, i fork simple_matmul a bit. This will be cleaned up once it's more clear what we want to keep
  • Fix memcpy to use references

@louisfd louisfd requested a review from maxtremblay February 26, 2025 18:53
@louisfd
Copy link
Member Author

louisfd commented Feb 28, 2025

I'll make another PR containing this one

@louisfd louisfd closed this Feb 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant