fix(infra): fix some dependency hells and add some lazy loading to reduce celery worker RAM usage #2

cubic-dev-local · 2025-09-24T16:39:04Z

## Description

https://linear.app/danswer/issue/DAN-2573/move-more-imports-onto-lazy-loading

so i have a script that i run to check memory usage for celery workers,

before this pr its ~600MB per worker

after its ~250 MB for some workers

docker container from 4.3GB -> 2.819GiB

to diagnose why, its not easy - its not that all pip dependencies are loaded into memory at worker start so i can just lazy load any, its specifically ones that get imported at runtime due to an actual import statement

this makes it very tricky to track down exactly what causes the 600 MB. literally had to trial and error suspiscious imports tracing starting from the celery worker main file

TBH the existing repo dependency graph is a little scuffed, one example branch that caused the worker to import llm stuff (there are like a dozen of these had to swift through to get all the memory offenders down):

app base
->

/Users/edwinluo/onyx/backend/onyx/background/celery/tasks/docprocessing/utils.py
->

redis connector
->

/Users/edwinluo/onyx/backend/onyx/redis/redis_connector_delete.py

->

/Users/edwinluo/onyx/backend/onyx/db/document.py

->

/Users/edwinluo/onyx/backend/onyx/db/feedback.py

->

/Users/edwinluo/onyx/backend/onyx/db/chat.py

->

/Users/edwinluo/onyx/backend/onyx/context/search/utils.py

->

/Users/edwinluo/onyx/backend/onyx/db/search_settings.py

->

/Users/edwinluo/onyx/backend/onyx/db/llm.py OR /Users/edwinluo/onyx/backend/onyx/natural_language_processing/search_nlp_models.py

->

/Users/edwinluo/onyx/backend/onyx/llm/utils.py (langchian, litellm, etc.)

How Has This Been Tested?

[Describe the tests you ran to verify your changes]

Backporting (check the box to trigger backport action)

Note: You have to check that the action passes, otherwise resolve the conflicts manually and tag the patches.

This PR should be backported (make sure to check that the backport attempt succeeds)
[Optional] Override Linear Check

Summary by cubic

Moves heavy imports to lazy-loading across indexing, LLM, NLP, and file-processing code to reduce worker memory and speed up startup. Also consolidates search doc conversion into SearchDoc and extracts PromptSnapshot to a shared schema. Addresses Linear DAN-2573 (Reduce Memory usage in Onyx).

Refactors
- Lazy-load litellm, tiktoken, openai, markitdown, read_pdf_file, instantiate_connector, and run_indexing_pipeline within functions.
- Move chunks_or_sections_to_search_docs into SearchDoc as classmethods (plus from_inference_section/from_inference_chunk); remove the utils version.
- Extract PromptSnapshot to onyx.chat.prompt_builder.schemas and update imports.
- Redis connector delete: remove direct DB import to break import chain; per-document enqueue loop is currently disabled.
Migration
- Replace chunks_or_sections_to_search_docs(...) with SearchDoc.chunks_or_sections_to_search_docs(...).
- Import PromptSnapshot from onyx.chat.prompt_builder.schemas.

---

Based on: onyx-dot-app/onyx#5478

cubic-dev-ai

1 issue found across 6 files

Prompt for AI agents (all 1 issues)


Understand the root cause of the following 1 issues and fix them.


<file name="backend/onyx/context/search/models.py">

<violation number="1" location="backend/onyx/context/search/models.py:360">
chunks_or_sections_to_search_docs drops is_relevant and relevance_explanation from InferenceChunk, unlike the specific converters, causing loss of relevance data for downstream consumers.</violation>
</file>

_{React with 👍 or 👎 to teach cubic. Mention @cubic-dev-ai to give feedback, ask questions, or re-run the review.}

cubic-dev-ai · 2025-09-24T16:54:52Z

backend/onyx/context/search/models.py

    is_internet: bool = False

+    @classmethod
+    def chunks_or_sections_to_search_docs(


chunks_or_sections_to_search_docs drops is_relevant and relevance_explanation from InferenceChunk, unlike the specific converters, causing loss of relevance data for downstream consumers.

Prompt for AI agents

Address the following comment on backend/onyx/context/search/models.py at line 360: <comment>chunks_or_sections_to_search_docs drops is_relevant and relevance_explanation from InferenceChunk, unlike the specific converters, causing loss of relevance data for downstream consumers.</comment> <file context> @@ -355,6 +356,97 @@ class SearchDoc(BaseModel): is_internet: bool = False + @classmethod + def chunks_or_sections_to_search_docs( + cls, + items: "Sequence[InferenceChunk | InferenceSection] | None", </file context>

[internal] Confidence score: 8/10

[internal] Posted by: General AI Review Agent

.

51afca7

cubic-dev-ai bot reviewed Sep 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(infra): fix some dependency hells and add some lazy loading to reduce celery worker RAM usage #2

fix(infra): fix some dependency hells and add some lazy loading to reduce celery worker RAM usage #2

Uh oh!

cubic-dev-local bot commented Sep 24, 2025

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

cubic-dev-ai bot Sep 24, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix(infra): fix some dependency hells and add some lazy loading to reduce celery worker RAM usage #2

Are you sure you want to change the base?

fix(infra): fix some dependency hells and add some lazy loading to reduce celery worker RAM usage #2

Uh oh!

Conversation

cubic-dev-local bot commented Sep 24, 2025

How Has This Been Tested?

Backporting (check the box to trigger backport action)

Summary by cubic

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cubic-dev-ai bot Sep 24, 2025 •

edited

Loading