Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix content not correctly parsed at pdf content importer #12338

Merged
merged 2 commits into from
Jan 4, 2025

Conversation

leaf-soba
Copy link
Contributor

@leaf-soba leaf-soba commented Dec 27, 2024

Closes #12278

  1. is far away should compare with the last one in font size, not the former text
  2. fix the abs value issue.
  3. fix the Ygap calculate issue.

Mandatory checks

  • I own the copyright of the code submitted and I licence it under the MIT license
  • Change in CHANGELOG.md described in a way that is understandable for the average user (if change is visible to the user)
  • Tests created for changes (if applicable)
  • Manually tested changed features in running JabRef (always required)
  • Screenshots added in PR description (for UI changes)
  • Checked developer's documentation: Is the information available and up to date? If not, I outlined it in this pull request.
  • Checked documentation: Is the information available and up to date? If not, I created an issue at https://github.com/JabRef/user-documentation/issues or, even better, I submitted a pull request to the documentation repository.

1. is far away should compare with the last one in font size, not the former text
2. fix the abs value issue.
3. fix the Ygap calculate issue.
@Siedlerchr
Copy link
Member

@InAnYan You are currently more into the pdf parsing, can you take a look?

@Siedlerchr Siedlerchr changed the title Fix the isFarAway issue Fix content not correctly parsed at pdf content importer Jan 1, 2025
Siedlerchr
Siedlerchr previously approved these changes Jan 3, 2025
@Siedlerchr
Copy link
Member

Tests are inlcuded so from my point of view this is fine

@Siedlerchr
Copy link
Member

Siedlerchr commented Jan 3, 2025

As we released 6.0 alpha, can you please add a changelog entry like?
We improved the offline pdf content parsing

@Siedlerchr Siedlerchr added the status: ready-for-review Pull Requests that are ready to be reviewed by the maintainers label Jan 3, 2025
@Siedlerchr Siedlerchr enabled auto-merge January 4, 2025 13:11
@Siedlerchr Siedlerchr added this pull request to the merge queue Jan 4, 2025
Merged via the queue into JabRef:main with commit 1e837cd Jan 4, 2025
23 checks passed
@leaf-soba
Copy link
Contributor Author

As we released 6.0 alpha, can you please add a changelog entry like? We improved the offline pdf content parsing

sry I was busy last week didn't do it, next time I'll add changelog if necessary.

Siedlerchr added a commit that referenced this pull request Jan 8, 2025
* upstream/main:
  Update abbrv.jabref.org (#12365)
  Bump buildres/abbrv.jabref.org from `78e1b08` to `c202741` (#12363)
  Bump org.beryx.jlink from 3.1.0-rc-1 to 3.1.1 (#12362)
  Bump tech.units:indriya from 2.2.1 to 2.2.2 (#12361)
  Bump com.dlsc.gemsfx:gemsfx from 2.80.0 to 2.81.0 (#12360)
  Bump org.mariadb.jdbc:mariadb-java-client from 2.7.9 to 2.7.12 (#12359)
  Bump org.mockito:mockito-core from 5.14.2 to 5.15.2 (#12358)
  Fix content not correctly parsed at pdf content importer (#12338)
  Bump de.undercouch:citeproc-java from 3.1.0 to 3.2.0 (#12347)
  Bump org.openrewrite.rewrite from 6.29.0 to 6.29.2 (#12345)
  Improve utf8 detection  (#12342)
  Bump org.fxmisc.richtext:richtextfx from 0.11.3 to 0.11.4 (#12346)
  Bump ai.djl:bom from 0.30.0 to 0.31.1 (#12344)
  Bump com.puppycrawl.tools:checkstyle from 10.21.0 to 10.21.1 (#12343)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
status: ready-for-review Pull Requests that are ready to be reviewed by the maintainers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Content of "opium.pdf" not correctly parsed
2 participants