Semantic segmentation with MobileNetV3

This repository contains the code for training of MobileNetV3 for segmentation as well as default model for classification. Every module here is subject for subsequent customizing.

Content

Requirements
Quick setup and start
- Preparations
- Run
CNN architectures
Loss functions
Augmentations
Training
Convert to TensorFlow Lite
Pretrained models
Projects use the MobileNetV3-segm model implementation

Requirements

Machine with an NVIDIA GPU
NVIDIA driver >= 418
CUDA >= 10.1
Docker >= 19.03
NVIDIA Container Toolkit (https://github.com/NVIDIA/nvidia-docker)

Quick setup and start

Preparations

Clone the repo, build a docker image using provided Makefile and Dockerfile.
```
git clone 
make build
```

The final folder structure should be:

Semantic-segmentation-with-MobileNetV3
├── data
├── notebooks
├── modules
├── train
├── Dockerfile
├── Makefile
├── requirements.txt
├── README.md

Run

The container could be started by a Makefile command. Training and evaluation process was made in Jupyter Notebooks so Jupyter Notebook should be started.
```
make run
jupyter notebook --allow-root
```

CNN architectures

MobileNetV3 backnone with Lite-RASSP modules were implemented. Architecture may be found in modules/keras_models.py

Loss functions

F-beta and FbCombinedLoss (F-beta with Cross Entropy) losses were implemented. Loss functions may be found in modules/loss.py

Augmentations

There were implemented the following augmentations: Random rotation, random crop, scaling, horizontal flip, brightness, gamma and contrast augmentations, Gaussian blur and noise.

Details of every augmentation may be found in modules/segm_transforms.py

Training

Training process is implemented in notebooks/train_mobilenet.ipynb notebook.

Provided one has at least PicsArt AI Hackathon dataset and Supervisely Person Dataset it is only needed to run every cell in the notebook subsequently.

Convert to TensorFlow Lite

To successfully convert this version of MobileNetV3 model to TFLite optional argument "training" must be removed from every batchnorm layer in the model and after that pretrained weights may be loaded and notebook cells for automatic conversion may be executed.

notebooks/convert2tflite.ipynb notebook contains model conversion sample scripts with and without quanization.

Pretrained models

Only person segmentation datasets were used for training models in this project: PicsArt AI Hackathon dataset and Supervisely Person Dataset.

Trained Keras model (input size 224x224 px) may be found here.

Trained model converted to a TensorFlow Lite FlatBuffer may be found here.

The same model but quantized after training may be downloaded via this link.

Note: The model was trained with TF2.0, so, it may contain some bugs as compared with the current TF version.

Projects use the MobileNetV3-segm model implementation

Real-time CPU person segmentation in video calls: repo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic segmentation with MobileNetV3

Content

Requirements

Quick setup and start

Preparations

Run

CNN architectures

Loss functions

Augmentations

Training

Convert to TensorFlow Lite

Pretrained models

Projects use the MobileNetV3-segm model implementation

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
modules		modules
notebooks		notebooks
train		train
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

License

OniroAI/Semantic-segmentation-with-MobileNetV3

Folders and files

Latest commit

History

Repository files navigation

Semantic segmentation with MobileNetV3

Content

Requirements

Quick setup and start

Preparations

Run

CNN architectures

Loss functions

Augmentations

Training

Convert to TensorFlow Lite

Pretrained models

Projects use the MobileNetV3-segm model implementation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages