Generating set of counterfactuals within distance to optimum

This code used as a starting point an implementation of the paper "Efficient Search for Diverse Coherent Explanations" by Chriss Russel. Only the implementation of a mixed polytope input encoding is used here.

Original code by Chriss Russell can be found in the original repository.

He used Logistic Regression and focused on generating diverse explanations. This codebase used his formulation of mixed type inputs (polytopes) and modelling of the changes of inputs with respect to the original value of the factual. Other than that, code is my own. The code used was also significantly improved and clarified as to what the various parts of the code mean.

Instead of diverse coherent explanations, the focus here is on generating a set of counterfactuals closest to original factual.

Examples provided use the adult dataset (included), or MNIST for the mutli class problem.

The code uses the Gurobi solver for the MIP solver, and gurobi-machinelearning package for the NN computation.

There is also a custom NN implementation, using the methods presented by M. Fischetti and J. Jo "Deep neural networks and mixed integer linear optimization" That implementation has shown better performance when it comes to speed, but lower quality of solutions, because it generates duplicate counterfactuals. If that does not bother you, use the code in the custom_nn_implementation/ folder.

Encoder for data

The encoder explicitly targets the FICO dataset and has made a couple of simple assumptions as to the form the dataset takes. Each variable is assumed to take a range of continuous values and a set of discrete values; as simplifying assumptions we assume that all strictly negative values are the discrete values, while the continuous values are the non-negative ones.

If you wish to add an entirely discrete variable i.e. without a continious range included, these variables should be indexed from zero. For example, in the adult dataset the 'workclass' variable takes the following values. {0: 'Government', -3: 'Other/Unknown', -2: 'Private', -1: 'Self-Employed'}

If this is not the case for your dataset, the code can be adapted to match assumptions, but it probably easier to manipulate the data so that it follows these assumptions -- this manipulation has already been done for the adult dataset.

Further contribution

The input encoder was improved from the work of Chriss Russel. The handling of categorical variables is corrected, so now the model works well for categorical, numerical and mixed input features.

Objective functions

This repository also contains a couple of attempts to create a utility function regarding the set of counterfactuals.

See example_objective.py for furhter details about the functions.

This is still a work-in-progress.

Master's Thesis

This repository is a part of Jiří Němeček's Master's Thesis at FEE CTU in Prague

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
custom_nn_implementation		custom_nn_implementation
original		original
.gitignore		.gitignore
InitialVisualization.ipynb		InitialVisualization.ipynb
README.md		README.md
adult_frame.csv		adult_frame.csv
binary_classification.ipynb		binary_classification.ipynb
counterfactual_generator.py		counterfactual_generator.py
data.py		data.py
example_cf_gen.py		example_cf_gen.py
example_cf_gen_mutliclass.py		example_cf_gen_mutliclass.py
example_objective.py		example_objective.py
mutli_class.ipynb		mutli_class.ipynb
nn_model.py		nn_model.py
objectives.py		objectives.py
requirements.txt		requirements.txt
textualizer.py		textualizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generating set of counterfactuals within distance to optimum

Encoder for data

Further contribution

Objective functions

Master's Thesis

About

Releases

Packages

Contributors 2

Languages

Epanemu/counterfactual_explanations

Folders and files

Latest commit

History

Repository files navigation

Generating set of counterfactuals within distance to optimum

Encoder for data

Further contribution

Objective functions

Master's Thesis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages