You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jul 20, 2024. It is now read-only.
你们在rdk x3上部署的模型,是怎样量化的?能够做到全部8bit量化吗(据我了解,llm很难做到全8bit量化,例如RMSNorm、RoPE这块基本上还是保持浮点计算)?
The text was updated successfully, but these errors were encountered: