Skip to content

sreekanth221998/OCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

OCR using pytesseract and OpenCV

This Git repository provides an example of how to perform Optical Character Recognition (OCR) using Python libraries pytesseract and OpenCV.

Overview

OCR is the process of electronically extracting text from images or scanned documents. In this repository, we use pytesseract and OpenCV to extract text from an image and save it to a text file.

Pytesseract is a Python wrapper for Google's Tesseract-OCR Engine, which can be used to recognize a wide variety of fonts and languages. OpenCV is an open-source computer vision and machine learning software library, which can be used for image processing tasks such as image enhancement, image segmentation, and more.

Setup

To use this repository, you'll need to have Python installed on your computer. You'll also need to install the following libraries:

  • pytesseract
  • opencv-python

Application

This project is used to extract information from chemistry data books.

About

Optical Character Recognition

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages