Visionbased Multipage Classifier

Overview

This repository contains the code and documentation for my master's thesis project titled "Visuelle Klassifizierung mehrseitiger Dokumente ohne OCR". The project, developed in collaboration with lector.ai GmbH, explores the use of self-attention in image-based transformer architectures for multi-page document understanding, specifically for the separation and classification of document stacks.

Usage Guide

Training and Evaluation:
- To train one of the architectures, please customise the train.py according to your needs and implement your own data source
Trained Architecture:
- To use one of the trained models, simply load it with the LightningModule class and refer to the desired checkpoint

Contact

For questions or suggestions, you can reach me at [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
multipage_classifier		multipage_classifier
training		training
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visionbased Multipage Classifier

Overview

Contents

Usage Guide

Contact

About

Releases

Packages

Languages

johannesscheibe/visionbased-multipage-classifier

Folders and files

Latest commit

History

Repository files navigation

Visionbased Multipage Classifier

Overview

Contents

Usage Guide

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages