-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect Reading Order in Single-page Image-Text Layouts #570
Comments
@Bariskau what is the input format you were using? Is this a native Powerpoint, a PDF, or something else? If you provide the source file we could verify more easily. |
sample-cpu.pdf |
I am also having same issue, is there any solution to solve this order problem? |
LayoutReader (LayoutML) ordering works compatibly with DocLing. However, DocLing has limitations in obtaining line height and width values. Due to this technical limitation, dividing layout bounding boxes into random smaller bounding boxes and then ordering them with the model generally yields successful results. However, there are two significant issues with this approach:
|
@Bariskau @mkhalid12 a revised reading order model is currently under development. We will post updates when we have them ready. |
@Bariskau @mkhalid12 You can track this PR: #811 |
@cau-git this is such a great news looking forward to it. |
Bug
There is an issue with the page reading order. Especially in single-page documents, the reading order of images on the left and text content on the right is not working as expected. This causes incorrect information mapping under images when converting to Markdown format.
Steps to reproduce
Expected Behavior:
Reading order should be: Page Title => Image 1 => Section-Header 1 => List Items => Image 2
Actual Behavior:
Reading order is incorrect: Page Title => Image 1 => Image 2 => Section-Header 1
Docling version
2.10.0
Python version
3.10
Sample layout
The text was updated successfully, but these errors were encountered: