Harold Erbin, Riccardo Finotello, Bio Wahabou Kpera, Vincent Lahoche, Dine Ousmane Samary
Signal detection is one of the main challenges of data science.
As it often happens in data analysis, the signal in the data may be corrupted by noise.
There is a wide range of techniques aimed at extracting the relevant degrees of freedom from data.
However, some problems remain difficult.
It is notably the case of signal detection in almost continuous spectra when the signal-to-noise ratio is small enough.
This paper follows a recent bibliographic line which tackles this issue with field-theoretical methods.
Previous analysis focused on equilibrium Boltzmann distributions for some effective field representing the degrees of freedom of data.
It was possible to establish a relation between signal detection and
You can install all dependencies using the requirements.txt
list:
python -m venv venv/
source activate venv/bin/activate
pip install -r requirements.txt
Notice that the code depends crucially on py-pde
.
At the time of writing, python==3.10.12
was used for the development.
The full documentation is available at ReadTheDocs.
The documentation can be built using sphinx
(pip install sphinx sphinx_rtd_theme
):
sphinx-build -b html docs/source <build directory>
You will be able to open the file <build directory>/index.html
in your browser.
Though the easiest way to get started with the library is to use the script simulation_temp.py
(to scan using a single temperature parameter) or simulation_traj.py
(to scan using a different parametrization), we include two jupyter notebooks to show the basic usage of the library. In particular, we show the case of the analytical Marchenko-Pastur distribution and the case of the empirical distribution of the eigenvalues of a random matrix.
For more information on the scripts, you can use python <script> --help
to get the definition of all command line parameters. Notice that numerical results will be saved in a SQLite database (path provided by the user from command line): you should foresee a utility to explore such database for further processing.