Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support saving of layouts to open-standard hOCR file format. #188

Open
Blue-PCB opened this issue Aug 1, 2023 · 0 comments
Open

Support saving of layouts to open-standard hOCR file format. #188

Blue-PCB opened this issue Aug 1, 2023 · 0 comments

Comments

@Blue-PCB
Copy link

Blue-PCB commented Aug 1, 2023

Motivation
I hope layout-parser can support the open standard HTML OCR (hOCR) file format that represents document layouts. It would allow easier creation of OCR'ed PDFs and allow for interoperability with other tools.

Related resources
hOCR Specification v1.2

Additional context
Ocropus hOCR-Tools supports the hOCR format, but hasn't been updated in a while.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant