This is the codebase of CollegeScoreCard.io, which is built with Dash by Plotly, and powered (mostly) by the official CollegeScoreCard dataset. This is a work-in-progress, but may be useful to others.
-
The "requirements.txt" file is embarrassingly long. I plan to fix it later.
-
The official dataset is stored into two MySQL databases (or, two SQLite databases) following some processing steps. Here is the codebase to generate the databases.
-
I plan to publish the processing steps, once the related jupyter notebooks are clean enough! For now, you can find a short description of the processes and the variables here.
-
Most of the impotant data in CollegeScoreCard.io comes from the official sources, but errors might have crept in during my processing. Please do not make important life/financial decisions based on CollegeScoreCard.io or this codebase, at least for now.
-
I highly appreciate your feedback -- please do let me know if you catch mistakes in the data analysis, or catch bugs, or the general usability of the website.
- Dash user guide
- Dash/Plotly community forum
- Flask -- a Python-based web framework
- The Flask Mega-Tutorial by Miguel Grinberg
- Official CollegeScoreCard dataset
At collegescorecard.ed.gov, run by the US Department of Education, we can see how the US higher ed institutions fare on a variety of metrics. For example, one can find quantities such as the median debt of the graduates, the standardized test scores required to get in and so on. The goal of the official scorecard website is to help the higher ed stakeholders (e.g., students, parents, teachers, counselors etc.) make a quantitative understanding of the institutions. This is a fantastic goal, especially so, since the dataset that powers the website is publicly available!
While the official CollegeScoreCard website is a good starting point for viewing the dataset, I wanted better ways to analyze and compare different schools, and visualize the results more interactively. Having come across Dash by Plotly recently, I decided to build CollegeScoreCard.io to incorporate some of these ideas.
A special shoutout to the Plotly team for creating the amazing Dash library that makes CollegeScoreCard.io possible!