Fix prompt truncation logic #838

DonggeLiu · 2025-03-04T13:04:00Z

Given an overlong prompt, we want to truncate it to:

<prompt short initial text>
...(truncated due to exceeding input token limit)...
<prompt long ending text>

Where ...(truncated due to exceeding input token limit)... replaces sufficient prompt text so that the final prompt is within token limit.

DonggeLiu · 2025-03-04T13:04:33Z

/gcbrun exp -n dg -ag

DonggeLiu · 2025-03-04T21:47:01Z

/gcbrun exp -n dg -ag

DonggeLiu · 2025-03-05T00:47:07Z

Truncation looking good now:
https://llm-exp.oss-fuzz.com/Result-reports/ofg-pr/2025-03-05-838-dg-comparison/sample/output-xs-fxloadmodulesrejected/04.html

I reckon we can be even more aggressive on the amount of text to truncate.
Most of text are not very useful when they are that large.
Minimizing it can probably improve performance.

DonggeLiu · 2025-03-05T02:17:14Z

/gcbrun exp -n dg1 -ag

DonggeLiu · 2025-03-05T04:09:51Z

Report looking alright:
https://llm-exp.oss-fuzz.com/Result-reports/ofg-pr/2025-03-05-838-dg-comparison/

DonggeLiu · 2025-03-05T04:11:12Z

/gcbrun skip

oliverchang · 2025-03-05T04:51:54Z

llm_toolkit/models.py

+    total_tokens = self.estimate_token_num(raw_prompt_text)
+
+    # Allow buffer space for potential prompts that will be appended later.
+    allowed_tokens = self.MAX_INPUT_TOKEN // 10 - extra_tokens


why // 10 ? can you add a comment to explain?

Done.
A bit more context:
This is mainly used when sending the stdout/stderr of agent's bash commands or compilation requests to LLM.
Empirically, my observation is that each LLM response contains up to 10 such commands/requests from LLM. We allocate at most 1/10 of MAX_INPUT_TOKEN to each item to ensure balanced token distribution.

DonggeLiu · 2025-03-05T05:10:07Z

/gcbrun skip

Fix truncation

24cf9eb

Temporarily set agent max round to 100

03f00fb

DonggeLiu added 2 commits March 5, 2025 12:49

Fix typo

55c7646

More aggressive truncation and minor fixes

5f3bd4c

Revert temp change

30eb876

DonggeLiu requested review from oliverchang and mihaimaruseac March 5, 2025 04:10

oliverchang approved these changes Mar 5, 2025

View reviewed changes

Add an explanation to magic number 10

a745f4c

mihaimaruseac approved these changes Mar 5, 2025

View reviewed changes

DonggeLiu merged commit b53d7c7 into main Mar 6, 2025
6 checks passed

DonggeLiu deleted the fix-prompt-truncation branch March 6, 2025 00:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix prompt truncation logic #838

Fix prompt truncation logic #838

DonggeLiu commented Mar 4, 2025

DonggeLiu commented Mar 4, 2025

DonggeLiu commented Mar 4, 2025

DonggeLiu commented Mar 5, 2025 •

edited

Loading

DonggeLiu commented Mar 5, 2025

DonggeLiu commented Mar 5, 2025

DonggeLiu commented Mar 5, 2025

oliverchang Mar 5, 2025

DonggeLiu Mar 5, 2025

DonggeLiu commented Mar 5, 2025

Fix prompt truncation logic #838

Fix prompt truncation logic #838

Conversation

DonggeLiu commented Mar 4, 2025

DonggeLiu commented Mar 4, 2025

DonggeLiu commented Mar 4, 2025

DonggeLiu commented Mar 5, 2025 • edited Loading

DonggeLiu commented Mar 5, 2025

DonggeLiu commented Mar 5, 2025

DonggeLiu commented Mar 5, 2025

oliverchang Mar 5, 2025

Choose a reason for hiding this comment

DonggeLiu Mar 5, 2025

Choose a reason for hiding this comment

DonggeLiu commented Mar 5, 2025

DonggeLiu commented Mar 5, 2025 •

edited

Loading