Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add OCR Support for PDF and Image Files to Enhance System Usability #597

Open
IANTHEREAL opened this issue Jan 16, 2025 · 0 comments
Open

Comments

@IANTHEREAL
Copy link
Contributor

IANTHEREAL commented Jan 16, 2025

To improve system usability and handle a broader range of document types, it is suggested to integrate OCR (Optical Character Recognition) capabilities. This enhancement will enable the system to process PDF files and images that do not contain embedded text but instead rely on scanned images or other visual formats.

Use Case:
A user uploads a PDF file that contains complicated structure (e.g., scanned documents). Currently, the system extracts 0 characters from such files, resulting in errors or failed processing steps like vector index construction. By integrating OCR, the system can extract meaningful text from these image-based files, allowing seamless processing.

@IANTHEREAL IANTHEREAL changed the title Support ingesting PDF files that contains picture Add OCR Support for PDF and Image Files to Enhance System Usability Jan 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant