Optical Character Recognition

Abstract

Optical Character Recognition is a Python project that uses OCR to recognize text in images. The application features image processing, text extraction, and a CLI interface, demonstrating best practices in computer vision and automation.

Prerequisites

Python 3.8 or above
A code editor or IDE
Basic understanding of OCR and computer vision
Required libraries: pytesseractpytesseract, opencv-pythonopencv-python, numpynumpy

Before you Start

Install Python and the required libraries:

Install dependencies

pip install pytesseract opencv-python numpy

Install dependencies

pip install pytesseract opencv-python numpy

Getting Started

Create a Project

Create a folder named optical-character-recognitionoptical-character-recognition.
Open the folder in your code editor or IDE.
Create a file named optical_character_recognition.pyoptical_character_recognition.py.
Copy the code below into your file.

Write the Code

⚙️ Optical Character Recognition

Optical Character Recognition

import cv2
import pytesseract
import numpy as np
 
class OpticalCharacterRecognition:
    def __init__(self):
        pass
 
    def recognize_text(self, image):
        text = pytesseract.image_to_string(image)
        print(f"Recognized text: {text}")
        return text
 
    def demo(self):
        img = np.zeros((100, 300, 3), dtype=np.uint8)
        cv2.putText(img, 'Python OCR', (5, 70), cv2.FONT_HERSHEY_SIMPLEX, 2, (255,255,255), 3)
        self.recognize_text(img)
        cv2.imshow('OCR Demo', img)
        cv2.waitKey(1000)
        cv2.destroyAllWindows()
 
if __name__ == "__main__":
    print("Optical Character Recognition Demo")
    ocr = OpticalCharacterRecognition()
    ocr.demo()

Optical Character Recognition

import cv2
import pytesseract
import numpy as np
 
class OpticalCharacterRecognition:
    def __init__(self):
        pass
 
    def recognize_text(self, image):
        text = pytesseract.image_to_string(image)
        print(f"Recognized text: {text}")
        return text
 
    def demo(self):
        img = np.zeros((100, 300, 3), dtype=np.uint8)
        cv2.putText(img, 'Python OCR', (5, 70), cv2.FONT_HERSHEY_SIMPLEX, 2, (255,255,255), 3)
        self.recognize_text(img)
        cv2.imshow('OCR Demo', img)
        cv2.waitKey(1000)
        cv2.destroyAllWindows()
 
if __name__ == "__main__":
    print("Optical Character Recognition Demo")
    ocr = OpticalCharacterRecognition()
    ocr.demo()

Example Usage

Run OCR

python optical_character_recognition.py

Run OCR

python optical_character_recognition.py

Explanation