fix: handle None input_other in token usage to stop cli from crashing #265

ZakWork · 2025-11-13T01:36:23Z

Add defensive handling for cases where result.usage.input_other is None by setting it to 0. This prevents potential issues when provider returns token counts as None and ensures the app does not crash.

Related Issue

#264

Resolve #(issue_number)

Description

Added a check if the input_other field is None it gets replaced by 0

Add defensive handling for cases where result.usage.input_other is None by setting it to 0. This prevents potential issues when provider returns token counts as None and ensures the app does not crash.

src/kimi_cli/soul/kimisoul.py

stdrc · 2025-11-13T08:45:46Z

input_other is defined as int in kosong.chat_provider.TokenUsage, I don't think there should be any chance that it is set to None if pyright check is enabled.

stdrc · 2025-11-13T08:48:41Z

According to the OpenAI SDK definition of CompletionUsage, completion_tokens is also defined as int. Please report to the Chutes vendor that their API is not OpenAI-compatible.

ZakWork · 2025-11-13T09:16:00Z

This is for the Anthropic API endpoint but your point still makes sense as the Anthropic API also requires and integer for input tokens. However, the Kimi CLI should not crash and exit because the usage input_token is missing and instead gracefully deal with the problem. Claude Code deals with this and the Chutes Anthropic API endpoint has no such issue there.

Therefore, to make the Kimi CLI more robust either this change and/or the pull request in the Kosong library (MoonshotAI/kosong#21) should be made

stdrc · 2025-11-16T02:12:40Z

OK. Maybe you can contribute this to https://github.com/MoonshotAI/kosong/blob/main/src/kosong/contrib/chat_provider/anthropic.py, adding a note comment stating that it's specific to Chutes Anthropic API. Thanks!

ZakWork · 2025-11-16T21:14:35Z

Hey @stdrc

Yes, the fix needs to be there, I have looked into this and there's a bit more to it.

The reason for the crash was mainly due to the idea that for stream messages not all message types require input tokens to be included in the usage field, in fact this is only included in the first message "MessageStartEvent" the following events will not include this except for specific conditions. So most MessageDeltaEvent will have the input token field set as None in their MessageDeltaUsage.

You can find more info on this here:
https://docs.claude.com/en/docs/build-with-claude/streaming#full-http-stream-response

https://github.com/anthropics/anthropic-sdk-python/blob/d9aea38e754d55f8f0875fdf19ee44f78ca7b845/src/anthropic/types/raw_message_delta_event.py#L24

https://github.com/anthropics/anthropic-sdk-python/blob/d9aea38e754d55f8f0875fdf19ee44f78ca7b845/src/anthropic/types/message_delta_usage.py#L18

https://github.com/anthropics/anthropic-sdk-python/blob/d9aea38e754d55f8f0875fdf19ee44f78ca7b845/src/anthropic/types/usage.py#L23C4-L23C22

The current code is not set to handle this and therefore will fail not only for Chutes Anthropic API but most likely for all other providers using the Anthropic API given this is how the standard is suppose to work.

This pr should fix the issue above and stop the Kimi CLI from crashing

MoonshotAI/kosong#21

I have also identified a second issue with this and that is the MessageStartEvent is not used to updated the input token usage (this is used for context management and therefore this will also likely lead to an error). Furthermore, the usage update from MessageDeltaEvent is used to replace the usage object instead of updating it and although the data from MessageDeltaUsage is cumulative it does not necessarily have all the fields populated including the input tokens field. So currently the usage info comes in only from the MessageDeltaUsage and therefore the context limit is not calculated correctly (massively underestimated as the initial input tokens are discarded/not counted) which will probably result in an error from the provider for input being too long.

I have a fix for this (make sure to use MessageStartEvent data to set the initial usage then update usage data from MessageDeltaUsage correctly) I can update the current pr above or open a new one?

I am a big fan of the work your team has done with Kimi K2 Thinking and I hope this helps the kimi cli get better support/adoption by the community, keep up the great work!

stdrc · 2025-11-18T05:47:10Z

I have also identified a second issue with this and that is the MessageStartEvent is not used to updated the input token usage (this is used for context management and therefore this will also likely lead to an error). Furthermore, the usage update from MessageDeltaEvent is used to replace the usage object instead of updating it and although the data from MessageDeltaUsage is cumulative it does not necessarily have all the fields populated including the input tokens field. So currently the usage info comes in only from the MessageDeltaUsage and therefore the context limit is not calculated correctly (massively underestimated as the initial input tokens are discarded/not counted) which will probably result in an error from the provider for input being too long.

I have a fix for this (make sure to use MessageStartEvent data to set the initial usage then update usage data from MessageDeltaUsage correctly) I can update the current pr above or open a new one?

Could you please directly push the fix of the second issue to MoonshotAI/kosong#21? I guess it would be better to have them fixed together. Thanks for your time!

ZakWork · 2025-11-18T17:21:19Z

That should be ready for you to review now.

fix: handle None input_other in token usage

22c179a

Add defensive handling for cases where result.usage.input_other is None by setting it to 0. This prevents potential issues when provider returns token counts as None and ensures the app does not crash.

yihong0618 reviewed Nov 13, 2025

View reviewed changes

src/kimi_cli/soul/kimisoul.py Show resolved Hide resolved

stdrc closed this Nov 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: handle None input_other in token usage to stop cli from crashing #265

fix: handle None input_other in token usage to stop cli from crashing #265

ZakWork commented Nov 13, 2025

Uh oh!

Uh oh!

stdrc commented Nov 13, 2025 •

edited

Loading

Uh oh!

stdrc commented Nov 13, 2025

Uh oh!

ZakWork commented Nov 13, 2025

Uh oh!

stdrc commented Nov 16, 2025

Uh oh!

ZakWork commented Nov 16, 2025

Uh oh!

stdrc commented Nov 18, 2025

Uh oh!

ZakWork commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: handle None input_other in token usage to stop cli from crashing #265

fix: handle None input_other in token usage to stop cli from crashing #265

Conversation

ZakWork commented Nov 13, 2025

Related Issue

Description

Uh oh!

Uh oh!

stdrc commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stdrc commented Nov 13, 2025

Uh oh!

ZakWork commented Nov 13, 2025

Uh oh!

stdrc commented Nov 16, 2025

Uh oh!

ZakWork commented Nov 16, 2025

Uh oh!

stdrc commented Nov 18, 2025

Uh oh!

ZakWork commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stdrc commented Nov 13, 2025 •

edited

Loading