Skip to content
@comp-strat

Computational Stratification: Data-Intensive Social Science for Inequality & Policy

Computational social science of stratification & policy--esp. education, organizations, politics--and human costs, using web-scraping, text analysis, & modeling

Pinned Loading

  1. web_scraping web_scraping Public

    Code and data for the research team scraping charter websites using scrapy, requests, Selenium, and wget with Python, shell, and Docker. This is the foundation of analyses into charter schools' lin…

    Python 11 8

  2. scraping_server scraping_server Public

    code for universal web-crawling UI

    JavaScript 3

  3. web-crawling-ic2s2-2022 web-crawling-ic2s2-2022 Public

    Forked from jhaber-zz/web-crawling-intro-2021

    An introduction to web-crawling/scraping for beginners with some Python know-how. Created for IC2S2 Summer 2022 by Jaren Haber, PhD

    Jupyter Notebook 3 3

  4. sorting-schools-2020 sorting-schools-2020 Public

    Replication code for "Sorting Schools: A Computational Analysis of Charter School Identities and Stratification" research article by Jaren Haber, UC Berkeley. Paper investigates the relationships b…

    Jupyter Notebook 1 1

  5. text_analysis text_analysis Public

    Code and data for research team that does text analysis: word counts, word embeddings, topic models, parsing HTML, unsupervised clustering, etc.

    Jupyter Notebook 3 3

  6. data_management data_management Public

    Code for managing large data sets in Python, usually with Pandas. These scripts mostly merge, filter, inspect, and count things. Developed for a charter school database of 10K+ units based on web-c…

    Jupyter Notebook 1

Repositories

Showing 10 of 10 repositories
  • web_scraping Public

    Code and data for the research team scraping charter websites using scrapy, requests, Selenium, and wget with Python, shell, and Docker. This is the foundation of analyses into charter schools' linguistic strategies and social implications.

    comp-strat/web_scraping’s past year of commit activity
    Python 11 8 0 10 Updated Feb 8, 2023
  • scrape_obituaries Public

    Code for scraping obituaries from Legacy.com. 3 steps: scrape URLs & paragraphs then extract age, sex, & race

    comp-strat/scrape_obituaries’s past year of commit activity
    Jupyter Notebook 9 GPL-3.0 1 1 0 Updated Dec 9, 2022
  • scraping_server Public

    code for universal web-crawling UI

    comp-strat/scraping_server’s past year of commit activity
    JavaScript 3 0 0 4 Updated Jul 31, 2022
  • web-crawling-ic2s2-2022 Public Forked from jhaber-zz/web-crawling-intro-2021

    An introduction to web-crawling/scraping for beginners with some Python know-how. Created for IC2S2 Summer 2022 by Jaren Haber, PhD

    comp-strat/web-crawling-ic2s2-2022’s past year of commit activity
    Jupyter Notebook 3 GPL-3.0 7 0 0 Updated Jul 20, 2022
  • edunomics_arrays Public

    Arrays of school level spending across student poverty/disadvantage for Edunomics Lab. DC and maybe other states/districts.

    comp-strat/edunomics_arrays’s past year of commit activity
    Jupyter Notebook 0 MIT 0 0 0 Updated Apr 4, 2022
  • text_analysis Public

    Code and data for research team that does text analysis: word counts, word embeddings, topic models, parsing HTML, unsupervised clustering, etc.

    comp-strat/text_analysis’s past year of commit activity
    Jupyter Notebook 3 3 0 0 Updated Oct 28, 2021
  • data_management Public

    Code for managing large data sets in Python, usually with Pandas. These scripts mostly merge, filter, inspect, and count things. Developed for a charter school database of 10K+ units based on web-crawling and federal data sources (CCD, ACS, etc).

    comp-strat/data_management’s past year of commit activity
    Jupyter Notebook 1 0 0 0 Updated Apr 20, 2021
  • sorting-schools-2020 Public

    Replication code for "Sorting Schools: A Computational Analysis of Charter School Identities and Stratification" research article by Jaren Haber, UC Berkeley. Paper investigates the relationships between charter school and school district poverty & race, on one hand, and school ideology and academic performance, on the other.

    comp-strat/sorting-schools-2020’s past year of commit activity
    Jupyter Notebook 1 MIT 1 0 0 Updated Apr 6, 2021
  • geospatial Public

    Code that examines geographic patterns in charter school proliferation, size, performance, and especially ideology within race- and class-structured school districts and Census tracts. Key packages include matplotlib, folium, and geoplotlib.

    comp-strat/geospatial’s past year of commit activity
    HTML 1 MIT 0 0 0 Updated Apr 16, 2019
  • scrapy-cluster Public Forked from istresearch/scrapy-cluster

    This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.

    comp-strat/scrapy-cluster’s past year of commit activity
    Python 0 MIT 328 0 0 Updated Dec 3, 2018

Top languages

Loading…

Most used topics

Loading…