Document-Scanner and Text-Extractor

Document-Scanner

This is a simple a document scanner, made with very popular library for computer vision OpenCV, then I have used scikit-image for giving a black and white touch to the file. The process is to first resizing the image to a desired height, converting the image to grayscale, then finding all contours present in the file.

There is a assumption that the numbers of the sides in the piece of paper to be scanned is equal to four.

Then, we get the top-view of the image and atlast, we give it a black and white touch.

Text-Extractor

We will take the scanned image and then use the pytesseract library to extract from the scanned image.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
images		images
Output Image.PNG		Output Image.PNG
README.md		README.md
output.txt		output.txt
scan.py		scan.py
text_extractor.py		text_extractor.py
transform.py		transform.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document-Scanner and Text-Extractor