searchable-pdf

Here are 9 public repositories matching this topic...

NanoNets / ocr-python

OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

python pdf ocr tesseract pdf-to-text image-to-text textract pdf-to-csv pdf-to-json searchable-pdf pytesseract-ocr extract-table table-extract image-to-text-converter extract-text-from-image extract-text-from-pdf

Updated Dec 2, 2022
Jupyter Notebook

timberger / Searchable-Image-PDF-Creat-O-Mat

Star

This batch script creates a searchable PDF of a PDF with one or more scanned pages which contain images.

pdf ghostscript imagemagick converter ocr drag drop tesseract scan batch scanned-documents batch-script scanned-pages imagemagick-wrapper searchable-pdfs scanned-image-pdfs tesseract-wrapper ghostscript-wrapper searchable-pdf

Updated Oct 22, 2022
Batchfile

zaakki-ahamed / Arabic_OCR_From_PDF

Star

Perform Optical Character Recognition (OCR) on a scanned PDF file containing Arabic text and output a searchable PDF

optical-character-recognition arabic pytesseract searchable-pdf

Updated Dec 18, 2023
Python

Achiwilms / OCR-Wizard

Star

A powerful and user-friendly tool based on OCRmyPDF, offering a seamless GUI for conversion of image-based PDFs into searchable text.

python pdf ocrmypdf ocr-recognition pdf-ocr-extraction ocr-python searchable-pdf ocr-pdf pdf-ocr

Updated Oct 28, 2023
Python

pratik149 / pdf-table-extractor

Star

Extract tables from searchable as well as non-searchable pdf files

python console pdf opencv excel table extract-data searchable-pdf

Updated Oct 6, 2020
Jupyter Notebook

Haighton / create_searchable_pdf

Star

Create a searchable PDF with ALTO-XML and JP2 files.

pdf alto-xml searchable-pdf

Updated Nov 30, 2020
CSS

jidel / Searchable-PDF-Creator

Star

Quick proof of concept to perform OCR on images.

ocr wpf tesseract searchable-pdf

Updated Jul 22, 2020
C#

AlfredoCubitos / ocr2pdf

Star

Tool for creating searchable PDFs

ocr tesseract pdf-document searchable-pdf ocr2pdf

Updated Nov 27, 2019
Python

sxaxmz / handle_scanned_pdf

Star

A wrapper on top of python-OCR tools such as pytesseract and easyocr, to recognize and extract text embedded in images. Also, convert scanned-PDFs to text searchable PDFs.

tesseract-ocr pytesseract ocr-python scanned-image-pdfs searchable-pdf easyocr scanned-pdf-documents extract-text-from-image extract-text-from-pdf

Updated Jul 6, 2024
Python

Improve this page

Add a description, image, and links to the searchable-pdf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the searchable-pdf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

searchable-pdf

Here are 9 public repositories matching this topic...

NanoNets / ocr-python

timberger / Searchable-Image-PDF-Creat-O-Mat

zaakki-ahamed / Arabic_OCR_From_PDF

Achiwilms / OCR-Wizard

pratik149 / pdf-table-extractor

Haighton / create_searchable_pdf

jidel / Searchable-PDF-Creator

AlfredoCubitos / ocr2pdf

sxaxmz / handle_scanned_pdf

Improve this page

Add this topic to your repo