Soft Labels in SRe2L #26

qwrazdf · 2024-11-27T07:10:44Z

Dear Authors,

Thank you for your excellent work. I have a question regarding the use of additional soft labels in SRe2L. I am concerned whether this might lead to unfairness and whether it conflicts with the objectives of DD.

I look forward to your response. Once again, thank you for your outstanding contribution.

szq0214 · 2024-11-27T18:24:30Z

Hi @qwrazdf Thanks for your interest in our work.

We emphasize the distillation of dataset (DD) includes both the images and the corresponding soft or hard labels as integral components. Our soft labels are independent of the teacher model during post-evaluation or actual usage, ensuring that no information from the original dataset is involved in this stage. This is why we use FKD instead of conventional KD to generate soft labels. As a result, our setup is entirely fair and reasonable.

qwrazdf · 2024-11-28T06:55:17Z

Thank you for your prompt response. Could it be understood that this approach trades off some storage cost to ensure downstream training efficiency and outstanding performance?

szq0214 · 2024-11-29T00:14:53Z

Hi @qwrazdf Yes, you can think of it that way, but the storage cost for the soft labels can be significantly minimized, there are several label compression/quantization strategies discussed in FKD paper. Of course, in DD scenario, this could cause some performance degradation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Soft Labels in SRe2L #26

Soft Labels in SRe2L #26

qwrazdf commented Nov 27, 2024

szq0214 commented Nov 27, 2024

qwrazdf commented Nov 28, 2024

szq0214 commented Nov 29, 2024

Soft Labels in SRe2L #26

Soft Labels in SRe2L #26

Comments

qwrazdf commented Nov 27, 2024

szq0214 commented Nov 27, 2024

qwrazdf commented Nov 28, 2024

szq0214 commented Nov 29, 2024