Skip to content

ChaitanyaYeole02/ocr-using-gcp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Optical Character Recognition using GCP

Optical Character Recognition (OCR for short) is a technique that converts digital images of text into machine-readable data. OCR is the process of converting scanned images of machine printed or handwritten text (numerals, letters, and symbols), into machine readable character streams, plain (e.g. text files) or formatted (e.g. HTML files). OCR is a field of research in pattern recognition, artificial intelligence and computer vision. It is a common method of digitizing printed texts.
OCR helps users by recognizing letters, numbers, and other written characters. What is even more special about this is that it can convert images and scanned paper documents into electronic data. These documents can be searchable through digital means, making for seamless access.

Block Diagram

ocr-using-gcp architecture

Methodology

  1. Set up the Google Vision API.
  2. After creating the JSON key file, set the environment variable as GOOGLE_APPLICATION_CREDENTIALS='/path_to_the_file/key.json'
  3. Run main.py using Python3.9

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages