Creates near duplicates of the records in datasets by adding different kinds of noise. This system can be used to test the efficacy of an entity resolution framework.
A submodule fakedata
that generates fake data that resembles real data is also under development.
This project is inspired by and borrows heavily from the FEBRL project.