Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CK] [FP8] Add FP8 support to CK A8W8 GEMM #124

Open
wants to merge 18 commits into
base: main
Choose a base branch
from

Conversation

tjtanaa
Copy link

@tjtanaa tjtanaa commented Feb 18, 2025

Description

This PR refactors the CK A8W8 which initially only support INT8 A8W8 kernel to support FP8.
It also includes updated tuning script to support tuning FP8 A8W8 Kernel.

Caveat

The FP8 and INT8 kernel shares the same tuning filename at this moment.

@tjtanaa tjtanaa marked this pull request as ready for review February 20, 2025 14:02
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we have a test_common.py, move and merge this into test_common

Copy link

@vllmellm vllmellm Feb 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@valarLip b5477f9 merged utilities to test_common.py

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@valarLip It is ready for review. Could you take a look at this PR? Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants