Reproducibility Project: Fairness without Demographics through Adversarial Reweighted Learning

Goal:

This repository re-implements Fairness without Demographics through Adversarial Reweighted Learning in PyTorch. The goal was to reproduce the results from the paper and to extend ARL to image data.

Notebook

To receive a guided end-to-end tour through our code, open results.ipynb. It contains the final results that we used for our report, you can also rerun parts of it or the entire notebook, and you can play around with many hyperparameters. The easiest way to run the notebook is to open it in Google Colab: Please note the instructions at the top of the notebook (you need to uncomment a cell if you're using Colab)

If you want to run the notebook locally, you need to install the right Python environment first (see below). The notebook can reproduce all of our results except for the grid search to find the optimal hyperparameters. If you want to run the grid search yourself, see below or the end of the notebook for instructions.

Organisation of this repo

/data
The datasets used in the experiments:

Adult
LSAC
COMPAS
EMNIST_35
EMNIST_10

The tabular data is usable out of the box (though you can recreate it from scratch, see below). The image data (EMNIST) needs to be downloaded and preprocessed first (see below) if you want to use it.

/paper_results
Contains the results that were achieved by the authors of the original ARL paper in json format.

/job_scripts
SLURM job scripts used in creating the results. Can be ignored.

/grid_search
raw outputs, checkpoints and logs of our grid search

/training_logs
raw outputs, checkpoints and logs of scripts you run yourself

/final_logs
raw_outputs, checkpoints and logs of our final runs

./
The root folder contains the code necessary to prepare the data, run all experiments and analyse the results. For a guided tour we recommend checking out results.ipynb.

Installing the environment

Execute the following commands to install the required packages and activate the environment. Note that for installing on macOS, you need to remove the package cudatoolkit from the environment file or select a different available version.

conda env create -f environment.yml
conda activate fact-ai

Creating the dataset

Simply run prepare_data.sh in the project root directory to download and preprocess the datasets. Alternatively you can download the datasets yourself and then run python prepare_data.py (URLs and filepaths can be found in the shell script).

Finding optimal hyperparameters

To execute grid searches for all models and datasets with default settings run the following command:

python get_opt_hparams.py --num_workers 2

(you can of course adjust --num_workers and may also want to set --num_cpus. The optimal hyperparameters will be saved to optimal_hparams.json. More details can be found at the end of the notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 528 Commits
.vscode		.vscode
data		data
final_logs		final_logs
group_agnostic_fairness		group_agnostic_fairness
job_scripts		job_scripts
paper_results		paper_results
.gitignore		.gitignore
README.md		README.md
analysis_utils.py		analysis_utils.py
arl.py		arl.py
baseline_model.py		baseline_model.py
computational_identifiability.py		computational_identifiability.py
compute_mean_std.py		compute_mean_std.py
dataset_statistics.py		dataset_statistics.py
datasets.py		datasets.py
dro.py		dro.py
environment.yml		environment.yml
environment_Lisa.yml		environment_Lisa.yml
example_ARL_weights.py		example_ARL_weights.py
get_opt_hparams.py		get_opt_hparams.py
ipw.py		ipw.py
main.py		main.py
metrics.py		metrics.py
optimal_hparams.json		optimal_hparams.json
plot.py		plot.py
plot_hparam_bins.py		plot_hparam_bins.py
prepare_data.py		prepare_data.py
prepare_data.sh		prepare_data.sh
prepare_paper_results.py		prepare_paper_results.py
results.ipynb		results.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproducibility Project: Fairness without Demographics through Adversarial Reweighted Learning

Goal:

Notebook

Organisation of this repo

Installing the environment

Creating the dataset

Finding optimal hyperparameters

WARNING: This command can take multiple hours, depending on your machine. You can also use the already supplied optimal parameters.

About

Releases

Packages

Contributors 4

Languages

TomFrederik/fact-ai

Folders and files

Latest commit

History

Repository files navigation

Reproducibility Project: Fairness without Demographics through Adversarial Reweighted Learning

Goal:

Notebook

Organisation of this repo

Installing the environment

Creating the dataset

Finding optimal hyperparameters

WARNING: This command can take multiple hours, depending on your machine. You can also use the already supplied optimal parameters.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages