Total Tokens & Total Cost by Model was not being output correctly… #53

daihiraoka · 2024-07-14T17:35:18Z

1. Issue Summary

Total Tokens by Model was not being output correctly.
Total Cost by Model was not being output.
AI system names were being identified as "n/a" instead of their correct names.
IntervalTokens was consistently reported as 0 in token usage collection and reporting.

2. Investigation Results

The AI system name inference logic was inadequate, always outputting "n/a".
The MetricsCollectorService class was not correctly processing token usage metrics (gen_ai.client.token.usage).
The token aggregation logic in the LLMDc class's collectData method was problematic.
The token count and cost calculation logic was incomplete.

3. Interim Solutions Implemented

3.1 MetricsCollectorService Class Changes

Improved AI system inference logic:

private String inferAiSystem(String metricName, String aiSystem) {
    if (aiSystem == null || aiSystem.isEmpty() || aiSystem.equals("n/a")) {
        if (metricName.startsWith("gen_ai.")) {
            return "openai";
        } else if (metricName.startsWith("llm.")) {
            String[] parts = metricName.split("\\.", 3);
            if (parts.length > 1) {
                return parts[1];
            }
        }
        return "unknown";
    }
    return aiSystem;
}

Enhanced token processing in processSumMetric method:

if (!modelId.isEmpty()) {
    OtelMetric otelMetric = new OtelMetric();
    otelMetric.setModelId(modelId);
    otelMetric.setAiSystem(aiSystem);
    if (tokenType.compareTo("prompt") == 0 || tokenType.compareTo("input") == 0) {
        otelMetric.setPromptTokens(tokens);
    } else if (tokenType.compareTo("completion") == 0 || tokenType.compareTo("output") == 0) {
        otelMetric.setCompleteTokens(tokens);
    } else {
        otelMetric.setPromptTokens(tokens);
        otelMetric.setCompleteTokens(tokens);
    }
    exportMetrics.add(otelMetric);
}

Improved logging for better debugging and traceability.

3.2 LLMDc Class Changes

Enhanced config.yaml settings logging:

private void logLLMSpecificConfig(Map<String, Object> properties) {
    logger.info("LLM Specific Configuration:");
    logger.info("  OPENAI_PRICE_PROMPT_TOKES_PER_KILO: " + 
                properties.getOrDefault(OPENAI_PRICE_PROMPT_TOKES_PER_KILO, "Not set"));
    logger.info("  OPENAI_PRICE_COMPLETE_TOKES_PER_KILO: " + 
                properties.getOrDefault(OPENAI_PRICE_COMPLETE_TOKES_PER_KILO, "Not set"));
    // Log other relevant configuration values
}

Improved token and cost calculation in collectData method:

double intervalPromptTokens = (double)deltaPromptTokens/intervalSeconds;
double intervalCompleteTokens = (double)deltaCompleteTokens/intervalSeconds;
double intervalTotalTokens = intervalPromptTokens + intervalCompleteTokens;
double intervalPromptCost = (intervalPromptTokens/1000) * pricePromptTokens;
double intervalCompleteCost = (intervalCompleteTokens/1000) * priceCompleteTokens;
double intervalTotalCost = intervalPromptCost + intervalCompleteCost;

Added separate methods for getting prompt and complete token prices:

private double getPricePromptTokens(String aiSystem) {
    switch (aiSystem) {
        case "watsonx": return watsonxPricePromptTokens;
        case "openai": return openaiPricePromptTokens;
        case "anthropic": return anthropicPricePromptTokens;
        default: return 0.0;
    }
}

private double getPriceCompleteTokens(String aiSystem) {
    switch (aiSystem) {
        case "watsonx": return watsonxPriceCompleteTokens;
        case "openai": return openaiPriceCompleteTokens;
        case "anthropic": return anthropicPriceCompleteTokens;
        default: return 0.0;
    }
}

4. Results After Modifications

IntervalTokens is now correctly reported with non-zero values.
IntervalCost is calculated and reported (though values are very small).
AI system names are correctly identified and reported, no longer showing as "n/a".
Different AI systems and models are correctly identified and reported.

5. Remaining Issues and Recommendations

Total Cost by Model Display: UI improvements needed to handle very small values.
openai.price.prompt.tokens.per.kilo: 0.0005 # 1M tokens = $0.50、---> 1k tokens = $0.0005
openai.price.complete.tokens.per.kilo: 0.0015 # 1M tokens = $1.50. ---> 1k tokens = $0.0015
Total Cost by Model Display: UI improvements needed to handle very small values.
Token Count and Cost Calculation Precision: May need adjustment.
AI System Name Inference: Current logic prioritizes OpenAI; should be expanded to equally handle WatsonX and Anthropic.
Multiple Price Inputs for Different Models: Support for entering different prices for different models, such as GPT-4 and GPT-3.5, to account for pricing variations.

These interim modifications have addressed the major issues.

Note:
These changes were implemented by a non-Java expert and are interim solutions.
The accuracy and completeness of the solutions are not guaranteed, and further review and testing by Java experts are recommended.

Contact: [email protected]

…. 2. Total Cost by Model was not being output. 3. AI system names were being identified as "n/a" instead of their correct names. 4. IntervalTokens was consistently reported as 0 in token usage collection and reporting. Results After Modifications 1. IntervalTokens is now correctly reported with non-zero values. 2. IntervalCost is calculated and reported (though values are very small). 3. AI system names are correctly identified and reported, no longer showing as "n/a". 4. Different AI systems and models are correctly identified and reported. 5. All config.yaml settings, including pricing, are logged at startup for verification. # Please enter the commit message for your changes. Lines starting with '#' will be ignored, and an empty message aborts # the commit. # # On branch main Your branch is up to date with 'origin/main'. # # Changes to be committed: modified: llm/impl/llm/LLMDc.java modified: llm/impl/llm/MetricsCollectorService.java # # Changes not staged for commit: modified: ../../../../../../config/config.yaml #

jinsongo · 2024-07-15T08:17:21Z

@daihiraoka The DC need to work with traceloop 0.18.2, I think that's why you encountered many problems. Please reference the document: https://www.ibm.com/docs/en/instana-observability/current?topic=technologies-monitoring-llms,

If we need to support traceloop 0.18.2 above, we must sync with the following PR of traceloop, such as directly to get the AI provider name from gen_ai.system, and to use the unified metrics name beginning with gen_ai, to use histogram metric type for tokens, etc.

fix: llm metrics naming + views traceloop/openllmetry#1121
fix(openai+anthropic+watsonx): align duration and token.usage metrics attributes with conventions traceloop/openllmetry#1182

daihiraoka · 2024-07-22T16:22:49Z

@jinsongo
Regarding the previous issue, as you pointed out, the problem was due to the version of traceloop I was using. I was using a version above 0.18.2, which caused the issues. By switching to version 0.18.2, I was able to get it working properly.

I apologize for any inconvenience caused.

Thanks!

jinsongo · 2024-07-22T21:01:12Z

@daihiraoka As you can see, Traceloop is in a phase of rapid development and iteration, with even some critical aspects undergoing changes. This is why we have had to bind to a specific version of Traceloop for a certain period. Some parts of the current code might seem confusing because we are aware that Traceloop is making improvements, preventing us from having a stable implementation. Later, we plan to update and optimize our monitoring code based on a relatively stable version of Traceloop.
Thank you for your contribution. I will reference the improvements and suggestions from you for next release.

liurui-1 requested review from jinsongo and liyanwei93 July 15, 2024 04:44

liurui-1 force-pushed the main branch from 742d999 to eac73e0 Compare August 7, 2024 17:09

jinsongo closed this Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Total Tokens & Total Cost by Model was not being output correctly… #53

Total Tokens & Total Cost by Model was not being output correctly… #53

daihiraoka commented Jul 14, 2024

jinsongo commented Jul 15, 2024 •

edited

Loading

daihiraoka commented Jul 22, 2024

jinsongo commented Jul 22, 2024

Total Tokens & Total Cost by Model was not being output correctly… #53

Total Tokens & Total Cost by Model was not being output correctly… #53

Conversation

daihiraoka commented Jul 14, 2024

1. Issue Summary

2. Investigation Results

3. Interim Solutions Implemented

3.1 MetricsCollectorService Class Changes

3.2 LLMDc Class Changes

4. Results After Modifications

5. Remaining Issues and Recommendations

jinsongo commented Jul 15, 2024 • edited Loading

daihiraoka commented Jul 22, 2024

jinsongo commented Jul 22, 2024

jinsongo commented Jul 15, 2024 •

edited

Loading