Generated application - how do I break up the loading #497

kehh · 2025-01-30T07:59:00Z

I'm finding that the ingestion pipeline is not able to deal with very large amounts of data as it appears to load all of the documents in memory before persisting them in a storage context. Is there a way to partition documents by loader or something similar so that we don't clear out all documents when we run a single loader. I'm specifically looking at the code in https://github.com/run-llama/create-llama/blob/main/templates/types/streaming/fastapi/app/engine/generate.py#L78

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generated application - how do I break up the loading #497

Generated application - how do I break up the loading #497

kehh commented Jan 30, 2025

Generated application - how do I break up the loading #497

Generated application - how do I break up the loading #497

Comments

kehh commented Jan 30, 2025