Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tune top 32 BS2 punet dispatches on MI325 QPX #124

Merged
merged 1 commit into from
Feb 23, 2025

Conversation

Max191
Copy link

@Max191 Max191 commented Feb 22, 2025

Tunes the top 32 convs/matmuls in punet BS2. The total gain from this tuning is 2.5%.

Copy link
Collaborator

@monorimet monorimet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, and gives about 2-4% in submission runs.

@monorimet monorimet merged commit bc45d29 into nod-ai:shared/sdxl_on_main Feb 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants