[Critical] - Fix ollama_chat reasoning content by DenisStefanAndrei · Pull Request #20750 · BerriAI/litellm

DenisStefanAndrei · 2026-02-09T11:21:23Z

Relevant issues

For ollama_chat models, reasoning context is ignored after 2 consecutive thinking chunks.
Fixes #20737

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

[x ] I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
My PR passes all unit tests on make test-unit
[x ] My PR's scope is as isolated as possible, it only solves 1 specific problem

CI (LiteLLM team)

CI status guideline:

50-55 passing tests: main is stable with minor issues.

45-49 passing tests: acceptable but needs attention

<= 40 passing tests: unstable; be careful with your merges and assess the risk.

Branch creation CI run
Link:
CI run for the last commit
Link:
Merge / cherry-pick CI run
Links:

Type

🆕 New Feature
🐛 Bug Fix
🧹 Refactoring
📖 Documentation
🚄 Infrastructure
✅ Test

Changes

For ollama_chat models, reasoning context is ignored after 2 consecutive thinking chunks.

vercel · 2026-02-09T11:21:29Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
litellm	Ready	Preview, Comment	Feb 9, 2026 11:21am

greptile-apps · 2026-02-09T11:23:51Z

Greptile Overview

Greptile Summary

Fixed critical bug in Ollama chat streaming where reasoning_content (thinking chunks) was ignored after the first 2 consecutive chunks. The old code incorrectly set finished_reasoning_content = True after processing just 2 thinking chunks, causing all subsequent thinking content to be lost. The fix removes the flawed counter logic and only sets finished_reasoning_content = True when transitioning from thinking chunks to regular content chunks.

Key changes:

Removed the buggy conditional logic that limited thinking chunks to 2
Now all message.thinking chunks are properly returned as reasoning_content
Transition detection moved to the content handling branch to properly mark when reasoning ends
Added comprehensive test coverage with 4 test cases covering multiple thinking chunks, transitions, <think> tags, and done chunks

Confidence Score: 5/5

This PR is safe to merge with high confidence
The fix is straightforward and clearly addresses the reported issue. The old logic had an obvious bug limiting thinking chunks to 2, and the new logic correctly processes all thinking chunks. Comprehensive test coverage validates all edge cases including multiple chunks, transitions, tag parsing, and done chunks. The change is isolated to the streaming response iterator with no impact on other components.
No files require special attention

Important Files Changed

Filename	Overview
litellm/llms/ollama/chat/transformation.py	Fixed bug where reasoning_content was ignored after 2 consecutive thinking chunks - now all thinking chunks are properly captured
tests/test_litellm/llms/ollama/test_ollama_chat_transformation.py	Added comprehensive test coverage for reasoning_content streaming with multiple thinking chunks and edge cases

Sequence Diagram

sequenceDiagram
    participant Client
    participant OllamaChatCompletionResponseIterator
    participant chunk_parser
    participant Delta

    Note over OllamaChatCompletionResponseIterator: started_reasoning_content = False<br/>finished_reasoning_content = False

    Client->>OllamaChatCompletionResponseIterator: chunk 1 (thinking: "Chunk 1")
    OllamaChatCompletionResponseIterator->>chunk_parser: Parse chunk
    chunk_parser->>chunk_parser: Check message.thinking != None
    chunk_parser->>chunk_parser: Set reasoning_content = "Chunk 1"
    chunk_parser->>chunk_parser: Set started_reasoning_content = True
    chunk_parser->>Delta: Create delta with reasoning_content
    Delta-->>Client: Return chunk with reasoning_content

    Client->>OllamaChatCompletionResponseIterator: chunk 2 (thinking: "Chunk 2")
    OllamaChatCompletionResponseIterator->>chunk_parser: Parse chunk
    chunk_parser->>chunk_parser: Check message.thinking != None
    chunk_parser->>chunk_parser: Set reasoning_content = "Chunk 2"
    chunk_parser->>Delta: Create delta with reasoning_content
    Delta-->>Client: Return chunk with reasoning_content

    Client->>OllamaChatCompletionResponseIterator: chunk 3 (thinking: "Chunk 3")
    OllamaChatCompletionResponseIterator->>chunk_parser: Parse chunk
    chunk_parser->>chunk_parser: Check message.thinking != None
    chunk_parser->>chunk_parser: Set reasoning_content = "Chunk 3"
    Note right of chunk_parser: OLD BUG: Would skip this<br/>because finished_reasoning_content<br/>was set to True after 2 chunks
    chunk_parser->>Delta: Create delta with reasoning_content
    Delta-->>Client: Return chunk with reasoning_content

    Client->>OllamaChatCompletionResponseIterator: chunk 4 (content: "Answer")
    OllamaChatCompletionResponseIterator->>chunk_parser: Parse chunk
    chunk_parser->>chunk_parser: Check message.content != None
    chunk_parser->>chunk_parser: Set finished_reasoning_content = True
    chunk_parser->>chunk_parser: Set content = "Answer"
    chunk_parser->>Delta: Create delta with content
    Delta-->>Client: Return chunk with content

greptile-apps

_{2 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

DenisStefanAndrei · 2026-02-09T14:18:00Z

@Sameerlite, @ishaan-jaff, @krrishdholakia could you please take a look at this small fix? We really need it in our product. 🙏

* Fix ollama_chat reasoning_context. For ollama_chat models, reasoning context is ignored after 2 consecutive thinking chunks. * add test

Sameerlite · 2026-02-18T02:37:07Z

We use oss staging branch for contributor's PRs, once the branch is stable, it is merged, fix should be in the latest version

…

On Mon, Feb 16, 2026 at 11:33 AM Kowyo ***@***.***> wrote: *kowyo* left a comment (BerriAI/litellm#20750) <#20750 (comment)> hey @DenisStefanAndrei <https://github.com/DenisStefanAndrei> @krrishdholakia <https://github.com/krrishdholakia>, could you take a look at why the PR was merged into branch litellm_oss_staging_02_09_2026? I cannot find the fix in any of the LiteLLM release. — Reply to this email directly, view it on GitHub <#20750 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BXGLABFDBIPZTFNZSZ5R2234MFMR3AVCNFSM6AAAAACUO6XD3SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTSMBWGYYTANBYGE> . You are receiving this because you were mentioned.Message ID: ***@***.***>

DenisStefanAndrei added 2 commits February 4, 2026 16:15

Fix ollama_chat reasoning_context.

25536f5

For ollama_chat models, reasoning context is ignored after 2 consecutive thinking chunks.

add test

9f4df81

greptile-apps Bot reviewed Feb 9, 2026

View reviewed changes

ghost changed the base branch from main to litellm_oss_staging_02_09_2026 February 10, 2026 02:53

ghost merged commit 5adee48 into BerriAI:litellm_oss_staging_02_09_2026 Feb 10, 2026
10 of 13 checks passed

Sameerlite pushed a commit that referenced this pull request Feb 10, 2026

[Critical] - Fix ollama_chat reasoning content (#20750)

2e680ca

* Fix ollama_chat reasoning_context. For ollama_chat models, reasoning context is ignored after 2 consecutive thinking chunks. * add test

This was referenced Feb 14, 2026

[Bug]: reasoning_content will be cutoff for Ollama thinking models when using ollama_chat #15463

Closed

[Bug]: ollama_chat unable to use vision model #15464

Closed

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Critical] - Fix ollama_chat reasoning content#20750

[Critical] - Fix ollama_chat reasoning content#20750
2 commits merged intoBerriAI:litellm_oss_staging_02_09_2026from
DenisStefanAndrei:patch-1

DenisStefanAndrei commented Feb 9, 2026 •

edited by Sameerlite

Loading

Uh oh!

vercel Bot commented Feb 9, 2026

Uh oh!

greptile-apps Bot commented Feb 9, 2026

Important Files Changed

Uh oh!

greptile-apps Bot left a comment

Uh oh!

DenisStefanAndrei commented Feb 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

Sameerlite commented Feb 18, 2026 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

DenisStefanAndrei commented Feb 9, 2026 • edited by Sameerlite Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Relevant issues

Pre-Submission checklist

CI (LiteLLM team)

Type

Changes

Uh oh!

vercel Bot commented Feb 9, 2026

Uh oh!

greptile-apps Bot commented Feb 9, 2026

Greptile Overview

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps Bot left a comment

Choose a reason for hiding this comment

Uh oh!

DenisStefanAndrei commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Sameerlite commented Feb 18, 2026 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DenisStefanAndrei commented Feb 9, 2026 •

edited by Sameerlite

Loading

DenisStefanAndrei commented Feb 9, 2026 •

edited

Loading