issue about MultiheadAttention #1

starhiking · 2022-03-24T14:08:06Z

Hi， Great Work in face alignment！
However, i have a question about the params and flops of the paper.

I have tried to run your code to count the params and flops for 6-layer and 12-layer model.
And I guess your result is coming from the thop tool, but it exists shortage in MultiheadAttention, which accounts major part of Transformer. So your result in the paper may be wrong.
Could you check the issue, and update the real flops if indeed exists the error.

The text was updated successfully, but these errors were encountered:

Jiahao-UTS · 2022-03-24T15:19:19Z

Thanks for your reminder. Because our tokens are sparse, only 29 to 98, the MultiheadAttention actually accounts a very small part of our model. We utilize another toolkits (fvcore) to count the params and flops, the flops are 6.123G, 5.173G and 3.988G for 98 landmarks, 68 landmarks and 29 landmarks respectively. I will update the results.

Moreover, we find the main issue that affect the the inference speed is the interpolation code is not efficient. I modify the interpolation code yesterday, the inference speed is improved 1.5×. I will update the code after testing. Thank you for finding this issue.

starhiking · 2022-03-25T03:15:49Z

The nice work breaks the new record in face alignment，and I want to cite your work in my paper.

I have calculated the params and flops on WFLW are 6.110G and 13.134M for the 6-layer model, as well as 8.138G and 19.445M for the 12-layer model. Could you check its correctness or give detailed information for 6-layer and 12 layer models.

Jiahao-UTS · 2022-03-25T03:30:03Z

I think your results are correct. Could you please give me your email? I will send the details to through email and can discuss more details via Wechat.

starhiking · 2022-03-25T03:58:04Z

你好，刚给论文提供的邮箱发了邮件，但是邮箱没有响应，不知道你收到没有

starhiking · 2022-04-05T13:16:40Z

Hi, I want to cite your result about 12-layer model on the WFLW subsets.
Do you have any data that has been tested?

Jiahao-UTS · 2022-04-06T01:43:23Z

Hi, I want to cite your result about 12-layer model on the WFLW subsets. Do you have any data that has been tested?

            testset        largepose        expression        illumination        makeup        occlusion        blur

NME 4.128 6.988 4.368 4.023 4.032 5.005 4.790
FR 2.72 11.96 1.59 2.15 1.94 5.70 3.88
AUC 0.596 0.349 0.573 0.603 0.608 0.520 0.537

starhiking · 2022-04-06T02:29:58Z

Hi, I want to cite your result about 12-layer model on the WFLW subsets. Do you have any data that has been tested?
            testset        largepose        expression        illumination        makeup        occlusion        blur
NME 4.128 6.988 4.368 4.023 4.032 5.005 4.790 FR 2.72 11.96 1.59 2.15 1.94 5.70 3.88 AUC 0.596 0.349 0.573 0.603 0.608 0.520 0.537

Thanks！

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue about MultiheadAttention #1

issue about MultiheadAttention #1

starhiking commented Mar 24, 2022 •

edited

Loading

Jiahao-UTS commented Mar 24, 2022 •

edited

Loading

starhiking commented Mar 25, 2022

Jiahao-UTS commented Mar 25, 2022

starhiking commented Mar 25, 2022

starhiking commented Apr 5, 2022

Jiahao-UTS commented Apr 6, 2022

starhiking commented Apr 6, 2022

issue about MultiheadAttention #1

issue about MultiheadAttention #1

Comments

starhiking commented Mar 24, 2022 • edited Loading

Jiahao-UTS commented Mar 24, 2022 • edited Loading

starhiking commented Mar 25, 2022

Jiahao-UTS commented Mar 25, 2022

starhiking commented Mar 25, 2022

starhiking commented Apr 5, 2022

Jiahao-UTS commented Apr 6, 2022

starhiking commented Apr 6, 2022

starhiking commented Mar 24, 2022 •

edited

Loading

Jiahao-UTS commented Mar 24, 2022 •

edited

Loading