Prediction: Bias follow-up

How we assessed the accuracy of predictive policing software

The repository contains code and data supporting The Markup's investigation, "Predictive Policing Software Terrible At Predicting Crimes." Read our methodology to understand the context for the code and data in this repository.

For the analyses in this repository, we use Geolitica (formerly PredPol) prediction data from our previous investigation, Prediction: Bias, as well as crime reports obtained from the Plainfield, New Jersey police department (PD).

Using that data, the code in this repository carries out the following functions:

Join Geolitica's predictions to the crime reports we received from Plainfield PD for the same time period
Calculate the software's crime prediction success rate

Repository contents

File/Folder	Description
`Makefile`	Lists all steps taken to prepare data for analysis conducted in notebooks
`scripts/`	Folder contains code used to prepare data for analysis; takes files from `data/inputs` and `data/manual` as inputs and saves generated dataframes in `data/processed`
`notebooks/`	Folder contains analysis notebook

`data/` folder

File	Description
`inputs/plainfield_dosage.csv`	Dosage data received from Plainfield PD
`inputs/plainfieldpdnj-geojson-with-data.json`	Shapefile of Plainfield census block groups, with race and household income demographics
`inputs/plainfield_predictions_with_shifts.csv`	Geolitica predictions for Plainfield from 2018 to 2020
`manual/vague-addresses.csv`	Spreadsheet of manually checked addresses to exclude from analysis
`processed/crime_with_predictions.csv`	Crime reports with columns describing whether crime type could have been predicted by Geolitica software and whether crime reports had any associated predictions
`processed/predictions_with_crime_dosage.csv`	Geolitica predictions with columns describing whether crime report was generated during that prediction

Reproducibility

Reproducing the notebook’s calculations requires having Python 3.8 or greater installed on your computer and installing the Python libraries defined in this repository’s requirements.txt file, ideally in a Python virtual environment. To re-run the scripts and notebooks, use the command make reproduce. You can also open the notebooks in Jupyter and run them manually.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
Makefile		Makefile
readme.md		readme.md
requirements.in		requirements.in
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prediction: Bias follow-up

How we assessed the accuracy of predictive policing software

Repository contents

`data/` folder

Reproducibility

About

Releases

Packages

Languages

the-markup/investigation-geolitica-plainfield

Folders and files

Latest commit

History

Repository files navigation

Prediction: Bias follow-up

How we assessed the accuracy of predictive policing software

Repository contents

data/ folder

Reproducibility

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`data/` folder

Packages