Code to analyse accepted papers at COLT 2023
Link to accepted papers - https://learningtheory.org/colt2023/accepted-papers.html
The file data/raw_data.txt
contains raw data copied from above link. Each paper is represented by 2 lines. The first line is the name of the paper and the second line is a list of name of authors
There are total 330 lines in data_raw_data.txt
which indicates that there are 330/2 = 165 papers which were accepted in COLT 2023
The structured data contains each paper in following JSON format
{
"paper_id": Index of paper in the raw data,
"paper_title": Title of the paper
"authors": [
{
"author_name": Name of the author
"affiliation_name": Name of author's affiliation
}
]
}
Note - data has been partially cleaned
Results of basic analysis
- There are 165 accepted papers
- There are 438 unique authors
- There are 131 unique affiliations
- Steve Hanneke is the author with highest number of accepted papers. He has 7 accepted papers.
- MIT is the affiliation with highest number of authors. It has 29 authors.
- MIT is the affiliation with highest number of papers. It has 28 papers.
Distribution of paper count and author counts
- Number of author with 1 accepted papers - 373
- Number of author with 2 accepted papers - 54
- Number of author with 3 accepted papers - 4
- Number of author with 4 accepted papers - 3
- Number of author with 6 accepted papers - 3
- Number of author with 7 accepted papers - 1
There are 11 authors with 3 or more papers.
- Steve Hanneke - 7 papers
- Yury Polyanskiy - 6 papers
- Shay Moran - 6 papers
- Ilias Diakonikolas - 6 papers
- Constantinos Daskalakis - 4 papers
- Dylan J Foster - 4 papers
- Noah Golowich - 4 papers
- Nikita Zhivotovskiy - 3 papers
- Nikos Zarifis - 3 papers
- Alexander S Wein - 3 papers
- Kaiqing Zhang - 3 papers
Distribution of author count and affiliation counts
- Number of affiliations with 1 authors - 63
- Number of affiliations with 2 authors - 17
- Number of affiliations with 3 authors - 18
- Number of affiliations with 4 authors - 7
- Number of affiliations with 5 authors - 4
- Number of affiliations with 6 authors - 3
- Number of affiliations with 7 authors - 8
- Number of affiliations with 8 authors - 1
- Number of affiliations with 9 authors - 2
- Number of affiliations with 10 authors - 1
- Number of affiliations with 12 authors - 1
- Number of affiliations with 13 authors - 1
- Number of affiliations with 14 authors - 1
- Number of affiliations with 15 authors - 1
- Number of affiliations with 18 authors - 1
- Number of affiliations with 22 authors - 1
- Number of affiliations with 38 authors - 1
There are 10 affiliations with 9 or more authors
- Massachusetts Institute of Technology - 38 authors
- Google - 22 authors
- Stanford University - 18 authors
- University of Wisconsin-Madison - 15 authors
- Georgia Institute of Technology - 14 authors
- EPFL - 13 authors
- University of California, Berkeley - 12 authors
- Cornell University - 10 authors
- University of Texas at Austin - 9 authors
- Technion - Israel Institute of Technology - 9 authors
Distribution of paper count and affiliation counts
- Number of affiliations with 1 papers - 76
- Number of affiliations with 2 papers - 18
- Number of affiliations with 3 papers - 13
- Number of affiliations with 4 papers - 4
- Number of affiliations with 5 papers - 4
- Number of affiliations with 6 papers - 3
- Number of affiliations with 7 papers - 3
- Number of affiliations with 8 papers - 3
- Number of affiliations with 9 papers - 2
- Number of affiliations with 11 papers - 2
- Number of affiliations with 13 papers - 2
- Number of affiliations with 29 papers - 1
There are 10 affiliations with 8 or more papers
- Massachusetts Institute of Technology - 29 papers
- Google - 13 papers
- Stanford University - 13 papers
- University of California, Berkeley - 11 papers
- University of Wisconsin-Madison - 11 papers
- Purdue University - 9 papers
- Georgia Institute of Technology - 9 papers
- Microsoft - 8 papers
- Unknown - 8 papers
- EPFL - 8 papers