GitHub - rochelleterman/worlds-women: Analysis of US media's representation of women worldwide

This project:

Collects articles about women from the New York Times and Washington Post, 1980-2014
Categorizes each article by country + region
Uses Stanford's Named Entity Recognizer to remove proper nouns from article texts
Uses STM (R package) to analyze topical trends in the corpus over time and across region
Compare coverage across region using word separating alogrithms and other techniques.
Conducts statistical analysis regressing number of documents and mean topic distributions on country level variables (note the country level dataset is not included in this repo)

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
NYT-scraping		NYT-scraping
Results		Results
Scraps		Scraps
smd_equation		smd_equation
.gitignore		.gitignore
01_clean-and-categorize.R		01_clean-and-categorize.R
02_NER.ipynb		02_NER.ipynb
03_stm_estimate.R		03_stm_estimate.R
04_stm_analysis.R		04_stm_analysis.R
05_alt-DV.R		05_alt-DV.R
06_country-year.R		06_country-year.R
07_same-topic-different-region.R		07_same-topic-different-region.R
08_descriptive.R		08_descriptive.R
09_model-tests.R		09_model-tests.R
10_model.Rmd		10_model.Rmd
LICENSE		LICENSE
README.md		README.md
country_codes.csv		country_codes.csv
interaction_plots.R		interaction_plots.R
ner.sh		ner.sh

Provide feedback