Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is the auto-merge broken text blocks capability active in the last stable version (3.0.5)? #336

Closed
Alb93 opened this issue Apr 3, 2024 · 3 comments

Comments

@Alb93
Copy link

Alb93 commented Apr 3, 2024

I'm experiencing some broken text blocks problems with the latest version of the library.

The pdf looks like this

image

and this is what I'm getting, as an example:
{ "x": 13.214, "y": 5.237, "w": 2.542, "oc": "#00416e", "sw": 0.44271875, "A": "left", "R": [ { "T": "QUA", "S": -1, "TS": [ 0, 13.72, 1, 0 ] } ] }, { "x": 14.743, "y": 5.237, "w": 4.115, "oc": "#00416e", "sw": 0.44271875, "A": "left", "R": [ { "T": "DRO%20RS", "S": 9, "TS": [ 0, 12, 1, 0 ] } ] },

Is there any way to activate the auto-merging step by using the library on a node environment? Is there any way to eventually tune it? Thank you!

@Alb93 Alb93 changed the title Is auto merge text block capability active in the last stable version (3.0.5)? Is the auto-merge broken text block capability active in the last stable version (3.0.5)? Apr 3, 2024
@Alb93 Alb93 changed the title Is the auto-merge broken text block capability active in the last stable version (3.0.5)? Is the auto-merge broken text blocks capability active in the last stable version (3.0.5)? Apr 3, 2024
@modesty
Copy link
Owner

modesty commented Jun 9, 2024

can you upload the PDF?

@Alb93
Copy link
Author

Alb93 commented Jul 3, 2024

From the output it seems that the fontsize is (strangely) different. "QUA" is [ 0, 13.72, 1, 0 ] and "DRO RS" is [ 0, 12, 1, 0 ]. This is something that can be seen also by opening the PDF with a PDF reader. So is it probably a printing problem more than a merging one?
How does the merging work?

@modesty
Copy link
Owner

modesty commented Jul 28, 2024

the -m option in command line will turn on PROCESS_MERGE_BROKEN_TEXT_BLOCKS, tried it?

@modesty modesty closed this as completed Aug 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants