document-ocr

Layout preserving OCR for documents. Includes text, tables and figures. Useful for LEAP OCR and Bhashini apps API call.

Step 1 : Create Virtual Environment

Make sure you are using Python 3.10 and create a virtual environment to install upcoming dependencies

python3 -m venv <myenvpath>

Step 2 : Install Requirements

Use this virtual environment to install the following dependencies

pip install -r requirements.txt

Step 3 : Download Models

From the release section download the two models. Place figure-detector model in 'figures/model' and place sprint.pt for table strcuture recogniiton in 'tables/model' directory

Step 4 : Run the pipeline

Use main.py to set the input file parameters, output set name, language, table, and figures flag and execute as follows.

python3 main.py

Step 5 : Using the UI

You can also use the streamlit UI to execute the pipeline and download the compressed output.

streamlit run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
equations		equations
figures		figures
tables		tables
README.md		README.md
app.py		app.py
config.py		config.py
main.py		main.py
perform_ocr.py		perform_ocr.py
requirements.txt		requirements.txt
viewer.py		viewer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

document-ocr

Step 1 : Create Virtual Environment

Step 2 : Install Requirements

Step 3 : Download Models

Step 4 : Run the pipeline

Step 5 : Using the UI

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

IITB-LEAP-OCR/document-ocr

Folders and files

Latest commit

History

Repository files navigation

document-ocr

Step 1 : Create Virtual Environment

Step 2 : Install Requirements

Step 3 : Download Models

Step 4 : Run the pipeline

Step 5 : Using the UI

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages