降低训练显存占用可能性 #5

codexq123 · 2025-01-09T03:48:31Z

有可能利用fp8训练，将显存占用降低到24g吗

Passenger12138 · 2025-01-09T04:01:26Z

If your hardware supports the FP8 format, you can certainly use FP8 for training and adopt mixed precision, where the large model operates in BF16 and the LoRA module in FP8, to reduce memory usage. However, it's important to note that the LoRA module itself has relatively few parameters, so using FP8 may not yield significant benefits in this case.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

降低训练显存占用可能性 #5

降低训练显存占用可能性 #5

codexq123 commented Jan 9, 2025

Passenger12138 commented Jan 9, 2025

降低训练显存占用可能性 #5

降低训练显存占用可能性 #5

Comments

codexq123 commented Jan 9, 2025

Passenger12138 commented Jan 9, 2025