SD3.5 - Workaround for Do not promote FP8 error #2220

saunderez · 2024-10-30T15:08:14Z

Workaround the FP8 Do Not Promote error by casting to FP32 first.
Check both tensors are on the same device before using.

Make sure both tensors are on the same device and cast to FP32 to avoid the do not promote FP8 error and allow FP8 models to work

saunderez · 2024-10-30T15:13:33Z

It's slow but it works, as you can see I'm using the FP8 version of SD3.5 Large.

Result

VeteranXT · 2024-10-31T03:00:34Z

There is SD3.5 medium version of it. I do recommend it to use it.

andy8992 · 2024-11-04T04:55:01Z

What's left to add 3.5 medium support... this is needed

VeteranXT · 2024-11-06T02:53:39Z

Hype Train!

andy8992 · 2024-11-07T00:40:21Z

I check this dang github page every single day hoping there'll be news about this

saunderez · 2024-11-13T23:05:47Z

When I posted this workaround I figured someone would fix it properly in matter of days. I was mucking around with SageAttention which magically fixed the slow generation, and I would've added that here too but its too hacky. Attention needs to be split out so it's all handled in the one place, back in the Auto1111 Dreambooth days I made a wrapper function for the model that worked well but not sure if that's feasible with this implementation.

likelovewant · 2024-11-21T10:36:23Z

by changing float32 to float16 , the speed will catch up and works.

    x_embed = self.x_embedder(x).to(torch.float16)
    pos_embed = self.cropped_pos_embed(hw).to(torch.float16).to("cuda")
    x = x_embed + pos_embed

saunderez · 2024-11-29T18:49:02Z

by changing float32 to float16 , the speed will catch up and works.

    x_embed = self.x_embedder(x).to(torch.float16)
    pos_embed = self.cropped_pos_embed(hw).to(torch.float16).to("cuda")
    x = x_embed + pos_embed

Yeah to be honest I just made it cast to FP32 because it was most likely to just work. I just wanted to try out the model while I waited for official support so when the first thing I tried worked it was enough to satisfy my curiosity.

VeteranXT · 2024-11-29T18:50:09Z

How did you made it work?

likelovewant · 2024-11-30T11:24:20Z

by changing float32 to float16 , the speed will catch up and works.
    x_embed = self.x_embedder(x).to(torch.float16)
    pos_embed = self.cropped_pos_embed(hw).to(torch.float16).to("cuda")
    x = x_embed + pos_embed
Yeah to be honest I just made it cast to FP32 because it was most likely to just work. I just wanted to try out the model while I waited for official support so when the first thing I tried worked it was enough to satisfy my curiosity.

Understood,It’s so great you can find a quick workaround to get FP8 support, even if it's a temporary turnaround fix. Thanks for sharing your solution. Now, let's keep our fingers crossed for the Forge official SD3.5 medium release. I've run a few tests, but unfortunately, without much luck in getting it to work. Let‘s hope Lllyasviel will be able to make it happen soon, despite his busy schedule." @saunderez

likelovewant · 2024-11-30T11:29:01Z

How did you made it work?

simply mannuly change the code https://github.com/lllyasviel/stable-diffusion-webui-forge/pull/2220/files
@VeteranXT

VeteranXT · 2024-12-03T21:39:37Z

i did that i get "cant recognize model..."

likelovewant · 2024-12-04T03:33:38Z

i did that i get "cant recognize model..."

First , git checkout sd35 , edit the files , except changing float32 to float16 , downlaod necessary file for SD35 ,eg ,extra clip-g,t5xxl_fp8_e4m3fn.safetensors``sd3.5 large-fp8 models . then start the program. as showed #2183,
#2161 (comment)

@VeteranXT

danilomaiaweb · 2024-12-17T01:54:53Z

Here not work for me. See my forge version:

app: stable-diffusion-webui-forge.git
updated: 2024-12-10
hash: e073e4e
url: https://github.com/lllyasviel/stable-diffusion-webui-forge.git/tree/main

My branch is origin/main
How make full integration of SD3.5 on my version?

I have than chance my branch by sd35 (main...sd35) ?
Thank in advance

likelovewant · 2024-12-17T08:15:21Z

Here not work for me. See my forge version:

app: stable-diffusion-webui-forge.git updated: 2024-12-10 hash: e073e4e url: https://github.com/lllyasviel/stable-diffusion-webui-forge.git/tree/main

My branch is origin/main How make full integration of SD3.5 on my version?

I have than chance my branch by sd35 (main...sd35) ? Thank in advance

simply
git checkout sd35
and do the edit . launch it . if there is any error and cannot figur it out ,it's better to has error log paste here . otherwise ,nobody knows what's going on .
@danilomaiaweb

VeteranXT · 2024-12-17T20:07:45Z

Well im using AMD forge . Fork of this. I did edit files but all i get is --can't recognize model.

likelovewant · 2024-12-18T05:38:11Z

Well im using AMD forge . Fork of this. I did edit files but all i get is --can't recognize model.

test worked on AMD GPU in Zluda way ,it works. perhaps need update your huggingface guess #2161 (comment)
@VeteranXT

Workaround for Do not promote FP8 error

79fff0e

Make sure both tensors are on the same device and cast to FP32 to avoid the do not promote FP8 error and allow FP8 models to work

danilomaiaweb mentioned this pull request Dec 17, 2024

Full Integration with SD3.5 ISSUE - Please... #2474

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SD3.5 - Workaround for Do not promote FP8 error #2220

SD3.5 - Workaround for Do not promote FP8 error #2220

saunderez commented Oct 30, 2024

saunderez commented Oct 30, 2024

VeteranXT commented Oct 31, 2024

andy8992 commented Nov 4, 2024

VeteranXT commented Nov 6, 2024

andy8992 commented Nov 7, 2024

saunderez commented Nov 13, 2024

likelovewant commented Nov 21, 2024 •

edited

Loading

saunderez commented Nov 29, 2024 •

edited

Loading

VeteranXT commented Nov 29, 2024

likelovewant commented Nov 30, 2024

likelovewant commented Nov 30, 2024

VeteranXT commented Dec 3, 2024

likelovewant commented Dec 4, 2024

danilomaiaweb commented Dec 17, 2024

likelovewant commented Dec 17, 2024 •

edited

Loading

VeteranXT commented Dec 17, 2024

likelovewant commented Dec 18, 2024

SD3.5 - Workaround for Do not promote FP8 error #2220

Are you sure you want to change the base?

SD3.5 - Workaround for Do not promote FP8 error #2220

Conversation

saunderez commented Oct 30, 2024

saunderez commented Oct 30, 2024

VeteranXT commented Oct 31, 2024

andy8992 commented Nov 4, 2024

VeteranXT commented Nov 6, 2024

andy8992 commented Nov 7, 2024

saunderez commented Nov 13, 2024

likelovewant commented Nov 21, 2024 • edited Loading

saunderez commented Nov 29, 2024 • edited Loading

VeteranXT commented Nov 29, 2024

likelovewant commented Nov 30, 2024

likelovewant commented Nov 30, 2024

VeteranXT commented Dec 3, 2024

likelovewant commented Dec 4, 2024

danilomaiaweb commented Dec 17, 2024

likelovewant commented Dec 17, 2024 • edited Loading

VeteranXT commented Dec 17, 2024

likelovewant commented Dec 18, 2024

likelovewant commented Nov 21, 2024 •

edited

Loading

saunderez commented Nov 29, 2024 •

edited

Loading

likelovewant commented Dec 17, 2024 •

edited

Loading