-
Notifications
You must be signed in to change notification settings - Fork 893
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Identification of images in docx #273
Comments
@jkindahood Thanks for the feedback! Let us look into it and come back to you. I dont see any handling of pictures in the msword-backend. If we need to add pictures (which we obviously need to do), we need to update |
can you describe me what to do? |
Yes, absolutely, if you follow the link, you see that we are have no |
Thank you for your answer @PeterStaar-IBM. |
@jkindahood, small update, I'm working on PR to resolve image identification in DOCX: #330 |
@maxmnemonic all fine, i try to understand how your codebase works and still want to contribute. |
Hello everyone,
im working on docx document type with docling.
My first test was very poor.
I have a docx file which is only a testfile and this contains one image.
The standard procedure doesnt even recognise the image in the file...
Standard is this:
from docling.document_converter import DocumentConverter
source = "data/example_small.docx"
converter = DocumentConverter()
result = converter.convert(source)
for item in result.document:
print(item)
The example file is contained here!
Am im wrong with the code or is this a bug?
Greetings
example_small.docx
word_sample.docx
The text was updated successfully, but these errors were encountered: