Python repository for a small terrestrial mammals classifier

Python repository for training, prediction, and evaluation of a small terrestrial mammal classifier

Description

This Python project contains all steps to build a machine-learning classifier designed to recognize and classify small terrestrial mammal calls, including species groups like rodents, voles, and shrews. Also, scripts for prediction and evaluation are available, i.e. using the provided models. Built with Python, this tool can be used by researchers and enthusiasts for detecting calls in Passive Acoustic Monitoring (PAM) audio files, understanding habitat diversity, and aiding conservation efforts.

Components

Species Detection: Identify calls of various small terrestrial mammals in audio files.
Pre-trained Model: Use a pre-trained model to get started immediately or train your own model.
Audio Preprocessing: Automatically creates audio chunks from long audio files for easier postprocessing.
Easy Integration: Simple to use with any Python environment and customizable for specific use cases.

Installation

To get started with the Small Terrestrial Mammals Classifier, clone the repository and install the required dependencies.

Execute these lines on a Linux OS:

# Make sure to have relevant packages installed
$ sudo apt-get install git  && sudo apt-get install python3-venv

# Clone the repository to your desired directory
$ cd /path/to/projects && git clone https://github.com/riffelmaria/smallmammalsclassifier.git

# Change directory to the project
$ cd smallmammalsclassifier

# Pull repository
$ git pull

# Install a Python venv
$ python3.11 -m venv /path/to/.venv

# Activate the venv
$ source /path/to/.venv/bin/activate

# Adjusting local variables ...?

# Install Python project ...
$ pip install -e .

# Now, all functions are ready to use!

And these lines on a Windows OS:

# Make sure to have relevant packages installed
$ winget install --id Git.Git -e --source winget
$ winget install -e --id Python.Python.3.11 --scope machine

# Clone the repository to your desired directory
$ cd C:\path\to\projects && git clone https://github.com/riffelmaria/smallmammalsclassifier.git

# Change directory to the project
$ cd smallmammalsclassifier

# Make sure to have the virtual environment (venv) installed
$ pip install virtualenv    

# Create a new Python venv
$ py -m venv C:\path\to\new\virtual\environment\myvenv

# Activate the venv in PowerShell
$ C:\path\to\new\virtual\environment\myvenv\Scripts\Activate.ps1

# Alternatively, activate the venv in cmd.exe
$ C:\path\to\new\virtual\environment\myvenv\Scripts\activate.bat

# Adjusting local variables ...?

# Install Python project ...
$ pip install -e .

# Now, all functions are ready to use!

Usage

After installing, you can use all the functions in the project and the classifier on your data. Here's a quick example of how to get started with a CLI:

# Split audio data for prediction into clips
$ AudioSnippetWriter [base_path] [--snippet_folder] [--duration]

# Classify data with a chosen model (detemined by the model_path)
$ MLClassification [model_path] [folder_path] [threshold] [--workers]

# Evaluate performance of chosen model on test dataset
# ! Important: classify the test dataset before evaluation!
$ EvaluateTestDF [model_path] [--testdf_path] [--no-display]

Dataset

The dataset used for training the provided models includes audio data of various small terrestrial mammals. The dataset itself is not included in this repository for storage and licensing reasons. However, you can find available data under BTO (British Trust of Ornithology) and ChiroVox. Additionally, if you need the data I annotated with a coarse class scale (small terrestrial mammals, noise, and bats), please contact me at [email protected] or the Chair of Geobotany at the University of Freiburg, Germany.

Please make sure to follow the annotation scheme as in ./src/shared/groups_minimal.py or ./src/shared/genera.py, or simply add your own annotations to match the class scales.

To train the model on your own dataset, please confirm that your audio and Raven data can be found in the same folder, like so:

dataset/
├── species_1/
│   ├── audio1.WAV
│   ├── audio1.Table.1.selections.txt
│   ├── audio2.WAV
│   ├── audio2.Table.1.selections.txt
├── species_2/
│   ├── audio1.WAV
│   ├── audio1.Table.1.selections.txt
│   ├── audio2.WAV
│   ├── audio2.Table.1.selections.txt
...

Models

The models provided are built using the OpenSoundscape library (internally uses PyTorch). They are trained on an audio dataset of small terrestrial mammal calls. The model G2-1-long achieves approximately 25% target recall with an F1-Score of 0.23 in identifying the species groups of small terrestrial mammals. The model G2-6-long achieves approxiamtely 79% target recall. You can find deailt for each model in "resources".

If you want to train your own model, please adjust the parameter with the SetParameters script before executing train.py.

Contributing

Contributions are welcome :) If you have ideas for improving the code, the classifier, adding new species, or enhancing recall, precision, or F1, please submit a pull request.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
resources/models		resources/models
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python repository for a small terrestrial mammals classifier

Contents

Description

Components

Installation

Usage

Dataset

Models

Contributing

License

About

Releases

Packages

Languages

License

armsto88/smallmammalsclassifier

Folders and files

Latest commit

History

Repository files navigation

Python repository for a small terrestrial mammals classifier

Contents

Description

Components

Installation

Usage

Dataset

Models

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages