Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Build APIs to make the L3 cache programmable for users (ie, application developers) #295

Open
JianbaoTao opened this issue Feb 20, 2025 · 1 comment

Comments

@JianbaoTao
Copy link

Suggestion Description

The AMD L3 cache (SRAM; aka Infinity Cache) has very attractive capacity (256MB for MI300X). I am very interested to know if we can achieve performance gain by putting model data in the L3 cache when running application on AMD GPUs. IIUC, ROCm is the right layer to build APIs to program the L3 cache. So, here are my questions.First, is that right? Second, if it is right, can you share some code pointers how I can play with the idea myself, please? Many thanks.

Operating System

No response

GPU

No response

ROCm Component

No response

@schung-amd
Copy link

Hi @JianbaoTao, thanks for your interest! That's an interesting idea, but as far as I know this cache is not visible to ROCm applications and this would have to be done at the driver or firmware level.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants