Replies: 1 comment
-
No. Docling does not output raw OCR data, it assembles the output into a structured form where the units are paragraphs, list items, section headings, tables, pictures and more. It is not possible to trace which tokens in those units originated from OCR and which were encoded digitally in a PDF. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
As the title says.
Beta Was this translation helpful? Give feedback.
All reactions