OCR with Python, Tesseract, and OpenCV

This project demonstrates Optical Character Recognition (OCR) on images using Python, Tesseract, and OpenCV.

Requirements

Python
OpenCV
Tesseract

Setup

Install the required packages:
- Download and install Tesseract from here.
Update the tesseract_cmd variable in run.py with the path to the Tesseract executable on your system.

Usage

Place the image you want to perform OCR on in the img directory.
Update the cv2.imread function in run.py with the path to your image.
Run the script:

python run.py

The script will load the image, convert it to grayscale (optional, depending on the image), and then apply OCR using Tesseract. The resulting text will be printed to the console.

Note

The current script is set to recognize the Portuguese language. If you want to use another language, change the lang parameter in the pytesseract.image_to_string function to the appropriate language code. You can find the list of supported languages here.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
app		app
config		config
img		img
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR with Python, Tesseract, and OpenCV

Requirements

Setup

Usage

Note

Contributing

License

About

Releases

Packages

Languages

igorxcardoso/ocr-fast

Folders and files

Latest commit

History

Repository files navigation

OCR with Python, Tesseract, and OpenCV

Requirements

Setup

Usage

Note

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages