Unbalanced Taxonomy Instance Segmentation (Master's Thesis)

This is a code repository for my Master's thesis which explores different approaches to instance segmentation in computer vision, with a special focus on conventional deep learning models and models which use language embeddings. For more details and results check out thesis.pdf.

Classical instance segmentation models

In the first part of the thesis, a detailed evaluation of classical models such as Mask R-CNN and Mask2Former, trained on the TACO dataset for waste segmentation, was conducted. The results show that Mask R-CNN provides solid performance, while Mask2Former encounters challenges with contextual understanding of small and visually similar waste objects. The models were then modified with a module for dynamic class weights balancing of the segmentation loss. The technique is based on class frequencies and evaluation recall. The experiments were run again ot explore possible improvements.

Mask R-CNN Architecture

Mask2Former Architecture

Instance segmentation model with language embeddings

In the second part of the thesis, the FC-CLIP model with language embeddings was investigated. It combines visual and textual information for zero-shot instance segmentation. FC-CLIP uses the CLIP model as it's backbone and demonstrated interesting results in instance segmentation with richer hand-created prompts, suggesting the need for further adjustments and exploration.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
data		data
logs		logs
models		models
modules		modules
notebooks		notebooks
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build_datasets.sh		build_datasets.sh
fcclip.webp		fcclip.webp
m2f.jpg		m2f.jpg
mrcnn.png		mrcnn.png
requirements.txt		requirements.txt
setup.sh		setup.sh
taco.webp.avif		taco.webp.avif
thesis.pdf		thesis.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unbalanced Taxonomy Instance Segmentation (Master's Thesis)

Classical instance segmentation models

Mask R-CNN Architecture

Mask2Former Architecture

Instance segmentation model with language embeddings

FC-CLIP Architecture

About

Releases

Packages

Languages

License

rejsafranko/Unbalanced-Taxonomy-Instance-Segmentation

Folders and files

Latest commit

History

Repository files navigation

Unbalanced Taxonomy Instance Segmentation (Master's Thesis)

Classical instance segmentation models

Mask R-CNN Architecture

Mask2Former Architecture

Instance segmentation model with language embeddings

FC-CLIP Architecture

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages