Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cannot find loader for this WMF file <- When extracting a PPTX. #560

Closed
JTCorrin opened this issue Dec 10, 2024 · 5 comments
Closed

cannot find loader for this WMF file <- When extracting a PPTX. #560

JTCorrin opened this issue Dec 10, 2024 · 5 comments
Assignees
Labels
pptx issue related to pptx backend question Further information is requested

Comments

@JTCorrin
Copy link

Question

...

Seem to get this when running on a python container. Installed libwmf to no avail.

Is this expected and does anyone have any pointers here?

Appreciated!

@JTCorrin JTCorrin added the question Further information is requested label Dec 10, 2024
@dolfim-ibm dolfim-ibm added the pptx issue related to pptx backend label Dec 11, 2024
@maxmnemonic
Copy link
Contributor

maxmnemonic commented Dec 11, 2024

@JTCorrin, any chance you could share example of a problematic PPTX with us?
I would expect that it appears not on every PPTX, but just the ones that have WMF embedded?

Also would be great to know which version of Docling you are running?
Thanks in advance!

@maxmnemonic maxmnemonic self-assigned this Dec 11, 2024
@nikhildigde
Copy link

Facing the same issue.
docling - 2.10.0
docling-core - 2.9.0
docling-ibm-models - 2.0.7
docling-parse - 3.0.0

@JTCorrin
Copy link
Author

Hey @maxmnemonic sorry for the delayed response. I can't share that pptx im afraid as its a clients file. This is what I'm running:

docling==2.7.0
docling-core==2.4.0
docling-ibm-models==2.0.6
docling-parse==2.1.0

@PeterStaar-IBM
Copy link
Contributor

@JTCorrin Can you reproduce the error with a "demo" file and share it with us?

@cau-git
Copy link
Contributor

cau-git commented Dec 18, 2024

@JTCorrin We will continue tracking this problem in #594, which has provided a sample that fails.

@cau-git cau-git closed this as completed Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pptx issue related to pptx backend question Further information is requested
Projects
None yet
Development

No branches or pull requests

6 participants