Persian OCR

This project implements a LeNet convolutional neural network (CNN) for recognizing Persian letters. The model has been trained on a dataset consisting of 570,000 images of Persian letters, and it can accurately classify individual letters from images.

Dataset

Finding a suitable dataset for classifying Persian alphabets proved challenging, so we took matters into our own hands and created one! The dataset used for training the model can be found on Kaggle: Persian Alphabets and Numbers. This meticulously curated dataset contains a diverse collection of images featuring handwritten Persian letters and numbers. With over [number of images] samples, it provides ample data for training robust and accurate models for Persian letter recognition. We've ensured the dataset's quality and diversity to encompass various writing styles and variations commonly found in real-world scenarios. Whether it's distinct handwriting styles or variations in letter shapes, our dataset offers a comprehensive representation of Persian script. Feel free to explore and utilize this dataset for your own projects, and don't hesitate to provide feedback or contribute to its enrichment.

Model Architecture

The LeNet neural network architecture consists of several layers, including convolutional layers, pooling layers, and fully connected layers. The exact architecture used in this project is as follows:

Convolutional Layer (input: 30x25x1, output: 28x223x32)
ReLU Activation
Average Pooling (output: 14x11x32)
Convolutional Layer (output: 12x9x64)
ReLU Activation
Average Pooling (output: 6x4x64)
Convolutional Layer (output: 4x3x128)
Flatten (output: 1536)
Fully Connected Layer (output: 120)
ReLU Activation
Fully Connected Layer (output: 84)
ReLU Activation
Output Layer (output: 73, representing 10 classes of Persian letters)

Usage

To use the trained model for letter recognition, you can install the app and use. We also provide English image to text using EasyOCR.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
AI		AI
ocr		ocr
ocr_project		ocr_project
.gitignore		.gitignore
Dockerfile		Dockerfile
LeNet5.h5		LeNet5.h5
README.md		README.md
cleaned_words.txt		cleaned_words.txt
docker-compose.yaml		docker-compose.yaml
manage.py		manage.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Persian OCR

Dataset

Model Architecture

Usage

About

Releases

Packages

Contributors 2

Languages

OCR-App/Core

Folders and files

Latest commit

History

Repository files navigation

Persian OCR

Dataset

Model Architecture

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages