Deforestation risk prediction (EXPERIMENTS)

Experiments for the DSSGx-UK 2023 Deforestation project with UN-REDD:

Overview

This project aims to provide predictive models and tools for understanding deforestation risk in the Brazilian Amazon, targeting predictions for up to three years into the future. The project was conceived and executed over a span of three months, showing promising results and avenues for future work.

Models

We employed two primary models for our prediction tasks:

A Deep Learning model using UNet architecture, trained on the Mapbiomas dataset and some auxilary data.
A Random Forest model.

These models were benchmarked against the Jurisdictional and Nested REDD+ (JNR) methodology, a widely-recognized standard for evaluating deforestation risk. The models are designed to predict both short-term (one year) and medium-term (three years) deforestation risk, although the three-year predictions currently don't perform as robustly as the one-year predictions.

Features

Google Earth Engine App: A user-friendly application to visualize the deforestation risk across various regions. App link
Comprehensive Data Pipeline: Includes various jobs for data preparation, normalization, and feature engineering.
UNet Model Pipeline: A dedicated pipeline for the deep learning UNet model that includes training, evaluation logging and output generation.
Feature Ablation: Insights into the feature importances identified by the Deep Learning model.
Modular Code Base: Code for each part of the pipeline and models is separately maintained for better readability and usability.

Performance Metrics

For the deep learning model, a generalised (see below) F1 score of 0.4757 was achieved for three-year predictions, slightly underperforming the JNR benchmark of 0.5043. For one-year predictions, the F1 score stood at 0.3644 against the JNR score of 0.4372.

Next Steps Future work could focus on increasing model capacity or sourcing more recent data to enhance performance.

The Continuous Precision and Recall Metrics

The following code contains the function to return the normalized continuous precision and recall values to validate the 5-year average benchmark and JNR.

Benchmarks

JNR

Creating and generating predictions using JNR Risk maps:
- 1. Create Input : Process and obtain maps of forest cover change (FCC) for the required years
- 2. Run JNR : Run the JNR algorithm to obtain the risk maps
- 3. Create Output : Create the deforestation prediction and ground truth maps using the risk maps

5-year-avg Benchmark

The ART/TREES-inspired benchmark computes the average of deforestation taking place in a 5-year window and uses that to predict the deforestation in 3 subsequent years in the future. It uses the Mapbiomas dataset, and has been validated on 6x6 km validation grid.

Data Pipeline

The data_pipeline directory contains scripts and configurations for pre-processing, transforming, and sampling the data used in the experiments. It is designed to be modular and configurable, enabling the user to customize the data processing steps as needed.

Deep Learning Model - UNet

The pipeline for the UNet model is implemented in the folder model_pipeline and included functionality for training, testing and running feature ablation. The model is trained on the Mapbiomas dataset and all the metrics are logged onto wandb.

Additional Experiments

Data preprocessing:
- Downsample features : process MapBiomas dataset over given time periods to get downsampled forest cover, deforestation and forest edge density maps
- Average values : script to run averaging over several tiles in parallel, while converting from 900m^2 to hectares.
Creating JNR Risk maps:
- Create Forest Cover Change Map : download, process and obtain maps of forest cover change (FCC) for the years 2000-2022 using Global Forest Change dataset
- Generate JNR Risk Map : obtain maps of the spatial risk of deforestation and forest degradation following the methodology of REDD+
k-Means Clustering : perform k-Means clustering on the PRODES data to obtain the deforested regions. The deforested/ no forest regions have been segmented in a deep blue color whereas the remaining forest cover remains green

Name		Name	Last commit message	Last commit date
Latest commit History 178 Commits
JNR		JNR
PRODES_clustering		PRODES_clustering
average_metric		average_metric
data_pipeline		data_pipeline
downsample_mapbiomas		downsample_mapbiomas
gee_tool		gee_tool
model_pipeline		model_pipeline
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deforestation risk prediction (EXPERIMENTS)

Overview

Models

Features

Performance Metrics

The Continuous Precision and Recall Metrics

Benchmarks

JNR

5-year-avg Benchmark

Data Pipeline

Deep Learning Model - UNet

Additional Experiments

About

Releases

Packages

Contributors 4

Languages

License

DSSGxUK/s23_deforestation_exp

Folders and files

Latest commit

History

Repository files navigation

Deforestation risk prediction (EXPERIMENTS)

Overview

Models

Features

Performance Metrics

The Continuous Precision and Recall Metrics

Benchmarks

JNR

5-year-avg Benchmark

Data Pipeline

Deep Learning Model - UNet

Additional Experiments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages