You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Motivation
I hope layout-parser can support the open standard HTML OCR (hOCR) file format that represents document layouts. It would allow easier creation of OCR'ed PDFs and allow for interoperability with other tools.
Motivation
I hope layout-parser can support the open standard HTML OCR (hOCR) file format that represents document layouts. It would allow easier creation of OCR'ed PDFs and allow for interoperability with other tools.
Related resources
hOCR Specification v1.2
Additional context
Ocropus hOCR-Tools supports the hOCR format, but hasn't been updated in a while.
The text was updated successfully, but these errors were encountered: