Add BLiMP task #72

jumelet · 2022-01-24T15:46:13Z

One thing I was unsure about is how to split up model performance on individual subtasks: within BLiMP it would be a bit odd to just merge all accuracies together into a single number, but I can imagine that given the scale of different datasets that are considered we don't necessarily want to split up tasks into subtasks as well.

However, if we would want that split to be present as well I can easily add it. Can the self.metrics dictionary contain any kind of entry?

Add BLiMP task

3a8de2d

jumelet mentioned this pull request Jan 24, 2022

Add BLiMP to Full Benchmark #22

Open

Remove slice from task_names

660bd1c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add BLiMP task #72

Add BLiMP task #72

jumelet commented Jan 24, 2022

Add BLiMP task #72

Are you sure you want to change the base?

Add BLiMP task #72

Conversation

jumelet commented Jan 24, 2022