PET results different from reported in huggingface blog "How many data points is a prompt worth?" study #69

luffycodes · 2021-11-22T20:44:39Z

For MNLI, on the blog https://huggingface.co/blog/how_many_data_points/ - reported accuracy is 0.83 for 1000 data samples.

In the paper (https://arxiv.org/pdf/2001.07676.pdf), (table 1), for MNLI, accuracy reported is 0.85 for 1000 data samples.

I was wondering how the accuracy is reported in the PET paper.

timoschick · 2021-12-07T17:19:37Z

Hi @luffycodes, the accuracy reported in the PET paper is exactly what you obtain using this library. You can check out details about the "How many data points is a prompt worth?" study in their paper - one important difference to our experiments is that they

[...] run every experiment 4 times in order to reduce variance,

Also, I would assume that they have used a different random selection of 1,000 training examples (but to verify this, you should reach out to the authors directly).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PET results different from reported in huggingface blog "How many data points is a prompt worth?" study #69

PET results different from reported in huggingface blog "How many data points is a prompt worth?" study #69

luffycodes commented Nov 22, 2021 •

edited

Loading

timoschick commented Dec 7, 2021

PET results different from reported in huggingface blog "How many data points is a prompt worth?" study #69

PET results different from reported in huggingface blog "How many data points is a prompt worth?" study #69

Comments

luffycodes commented Nov 22, 2021 • edited Loading

timoschick commented Dec 7, 2021

luffycodes commented Nov 22, 2021 •

edited

Loading