-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
错误:Unexpected MMA layout version found #149
Comments
同问 |
这里 Titan X 也遇到这个问题。 |
p40 遇见同样问题,源自 triton ,matmul_248_kernel 函数执行到 c = accumulator.to(tl.float16) 报错,可能是计算架构太老了,似乎出类似问题的都是7.0以下架构的n卡,有办法处理吗 |
根据昨天的更新 https://github.com/openai/triton/pull/1505/files ''' P100 P40算力版本都是60+所以暂时只能使用Float32,但是显存又不够. 亟待解决? |
|
在P100上遇到同样的问题,是不是MOSS不支持P100? |
今天测试:修改成float32, p100/40不是爆显存就是Unexpected MMA layout version found. triton官网说对fp16量化模型支持不完善, p100/40等老显卡都会报如上的错. 需要等他们写入更多老显卡支持. 另外实测V100 32GB可以跑int4量化模型. (https://github.com/OpenLMLab/MOSS/issues/%E5%8F%8CP100%E6%98%BE%E5%AD%98%E4%B8%8D%E5%A4%9F) |
已发现解决方法: 单卡P40(24G)测试int4量化版本成功 具体方法如下: 修改MOSS\moss_cli_demo.py L31 将 python moss_cli_demo.py |
python: /project/lib/Analysis/Utility.cpp:136: bool mlir::supportMMA(mlir::Value, int): Assertion `(version == 1 || version == 2) && "Unexpected MMA layout version found"' failed.
The text was updated successfully, but these errors were encountered: