OCR using pytesseract and OpenCV

This Git repository provides an example of how to perform Optical Character Recognition (OCR) using Python libraries pytesseract and OpenCV.

Overview

OCR is the process of electronically extracting text from images or scanned documents. In this repository, we use pytesseract and OpenCV to extract text from an image and save it to a text file.

Pytesseract is a Python wrapper for Google's Tesseract-OCR Engine, which can be used to recognize a wide variety of fonts and languages. OpenCV is an open-source computer vision and machine learning software library, which can be used for image processing tasks such as image enhancement, image segmentation, and more.

Setup

To use this repository, you'll need to have Python installed on your computer. You'll also need to install the following libraries:

pytesseract
opencv-python

Application

This project is used to extract information from chemistry data books.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
OCR.py		OCR.py
cropImage.py		cropImage.py
readMe.md		readMe.md
rotateImage.py		rotateImage.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR using pytesseract and OpenCV

Overview

Setup

Application

About

Releases

Packages

Languages

sreekanth221998/OCR

Folders and files

Latest commit

History

Repository files navigation

OCR using pytesseract and OpenCV

Overview

Setup

Application

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages