Skip to content

Creating a document scanner and extracting text and numbers from the extracted document using python libraries

Notifications You must be signed in to change notification settings

pranav1698/Document-Scanner-Extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Document-Scanner and Text-Extractor

Document-Scanner

This is a simple a document scanner, made with very popular library for computer vision OpenCV, then I have used scikit-image for giving a black and white touch to the file. The process is to first resizing the image to a desired height, converting the image to grayscale, then finding all contours present in the file.

There is a assumption that the numbers of the sides in the piece of paper to be scanned is equal to four.

Then, we get the top-view of the image and atlast, we give it a black and white touch.

Text-Extractor

We will take the scanned image and then use the pytesseract library to extract from the scanned image.

About

Creating a document scanner and extracting text and numbers from the extracted document using python libraries

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages