-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Request: Convert PDF to Markdown inside Chat UI #441
Comments
Hey @julien-blanchon thanks for contributing all of this! I'm a bit busy atm with the agents feature but I have seen your PR #449 and I'll review it once I have a bit more time 😁 Just so you know it hasn't been forgotten haha |
Yes, but this is still a draft. As it don't fit the usage of 99% of the hf-chat user I don't know if we sould merge it or not (or maybe disable the feature by default).
|
Yeah I think in particular your drag&drop feature looks great! Would it be okay if we used it for agents? |
Yes of course, you're talking about #462 ? |
Yeah! We already support in the backend uploading images & audios for agents (stuff like speech transcription, image description), there's just no UI component for it so I think yours could work well when we implement that feature! |
@julien-blanchon I do not see that agent branch end up using the file drop UI. Just any updates on how we want to go from here is nice. Why is this PR dropped? I would love to see this merged to main. Many people want this: #609, #482 thanks! very nice to see that #442. |
Hi, good news! @mishig25 has started working on this feature so it should come soon! |
@mishig25 Feel free to tag me when a PR is out, I would love to help testing 😍. |
Should this mean we could also add Text files as well as HTML files? this would be amazing |
@julien-blanchon please feel free to test #641 |
Closing in favor of #609, right? Which is the one that the referenced PR (641) closes. |
Feature Description:
I'd like to propose the following feature: Add a PDF to Markdown converter and they include this Markdown content directly in the chat. This feature would not only enhance the user experience but also provide a seamless way to discuss and reference content from PDFs, especially research papers.
It's worth noting that similar features exist in platforms like the Anthropic Claude 2 and Perplexity.AI interface. However, their implementations primarily focus on a basic text extraction, which often results in a lossy conversion. Specifically, with mathematical equations, tables, and other complex formatting are not retaine. My proposed implementation aims to address this limitation by ensuring a more comprehensive and lossless conversion using the Mathpix Paid API, and maybe other service in the future.
Design Implementation:
Example
Pull Request
#442
The text was updated successfully, but these errors were encountered: