QD-ELC-BERT

My own implementation of the babyLM ELC BERT, as described in the 2023 paper "Not all layers are equally as important: Every Layer Counts BERT" by Charpentier and Samuel.

#Bibliography

@inproceedings{georges-gabriel-charpentier-samuel-2023-layers,
    title = "Not all layers are equally as important: Every Layer Counts {BERT}",
    author = "Georges Gabriel Charpentier, Lucas  and
      Samuel, David",
    editor = "Warstadt, Alex  and
      Mueller, Aaron  and
      Choshen, Leshem  and
      Wilcox, Ethan  and
      Zhuang, Chengxu  and
      Ciro, Juan  and
      Mosquera, Rafael  and
      Paranjabe, Bhargavi  and
      Williams, Adina  and
      Linzen, Tal  and
      Cotterell, Ryan",
    booktitle = "Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning",
    month = dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.conll-babylm.20",
    doi = "10.18653/v1/2023.conll-babylm.20",
    pages = "238--252",
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
qdelcbert		qdelcbert
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QD-ELC-BERT

About

Releases

Packages

Languages

SunnyWan59/qdelcbert

Folders and files

Latest commit

History

Repository files navigation

QD-ELC-BERT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages