You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If you create a workflow with a file input and a data output, and try to load a pdf with more than 8 pages, you will that the file output content is truncated at 7 or 8 pages...
Reproduction
Steps
1-create a flow
2-with a file loader
3-put a pdf with more than 8 pages
4-execute the flow
5-output is truncated at 8 pages
Expected behavior
Should load all pages of the pdf
Who can help?
No response
Operating System
Ubuntu
Langflow Version
1.1.2
Python Version
None
Screenshot
No response
Flow File
No response
The text was updated successfully, but these errors were encountered:
The issue you're experiencing with the file widget truncating PDF content at 7 or 8 pages might be related to the PDF parsing method used in Langflow. The library pypdf is used for PDF parsing, specifically the PdfReader class, which extracts text from PDF files [1]. There might be a limitation or bug in how the parse_pdf_to_text function processes the pages, or there could be a configuration setting affecting this behavior.
To address this, you might want to check if there are any updates or patches available for the pypdf library or Langflow itself that address this issue. Additionally, reviewing the implementation of the parse_pdf_to_text function could provide insights into any potential limitations or areas for improvement.
Bug Description
If you create a workflow with a file input and a data output, and try to load a pdf with more than 8 pages, you will that the file output content is truncated at 7 or 8 pages...
Reproduction
Steps
1-create a flow
2-with a file loader
3-put a pdf with more than 8 pages
4-execute the flow
5-output is truncated at 8 pages
Expected behavior
Should load all pages of the pdf
Who can help?
No response
Operating System
Ubuntu
Langflow Version
1.1.2
Python Version
None
Screenshot
No response
Flow File
No response
The text was updated successfully, but these errors were encountered: