You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It is normal when using fp16, but there are some problems with bf16.
It seems that because swiglu uses tl.sigmoid(), it needs to be converted to fp32, but there is a loss of accuracy when converting to bf16.
This part of the code:
@triton.jit
def silu(x):
return x * tl.sigmoid(x)
Reproduce
No response
Versions
v0.4.2
The text was updated successfully, but these errors were encountered:
🐛 Describe the bug
It is normal when using fp16, but there are some problems with bf16.
It seems that because swiglu uses tl.sigmoid(), it needs to be converted to fp32, but there is a loss of accuracy when converting to bf16.
This part of the code:
@triton.jit
def silu(x):
return x * tl.sigmoid(x)
Reproduce
No response
Versions
v0.4.2
The text was updated successfully, but these errors were encountered: