You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I know that SciBERT is pre-trained by the Semantic Scholar corpus. I also know that the Semantic Scholar corpus is not publicly available.
I am wondering how many new papers are included in the pre-training data. For example, are papers from ACL 2018 included?
The Semantic Scholar Corpus paper was published in 2018 or so, so I'm guessing that's right around the borderline between having a paper...
The text was updated successfully, but these errors were encountered:
I know that SciBERT is pre-trained by the Semantic Scholar corpus. I also know that the Semantic Scholar corpus is not publicly available.
I am wondering how many new papers are included in the pre-training data. For example, are papers from ACL 2018 included?
The Semantic Scholar Corpus paper was published in 2018 or so, so I'm guessing that's right around the borderline between having a paper...
The text was updated successfully, but these errors were encountered: