Improve Handling of Large JSON Data Arrays #2

bloodlinealpha · 2024-01-27T18:59:23Z

Overview

The NHL GPT model currently experiences issues when processing large JSON data arrays, leading to the hallucination of data. This issue significantly impacts the accuracy and reliability of the model's output, especially in scenarios involving complex or extensive datasets. Noted by user in issue #1.

Problem Description

When the NHL GPT model processes large JSON data arrays, it tends to generate incorrect or fabricated data, which does not accurately represent the original dataset. This problem appears to be more pronounced with increasing size and complexity of the JSON data.

Problem Analysis

token window limitations
python analysis interpreter seems to assume data at times, especially if for larger lists.
we do not have control of the NHL API, so we cannot modify the response arguments in hopes of reducing the response length.

Expected Behavior

Ideally, the model should accurately interpret and utilize large JSON data arrays without altering, omitting, or fabricating information. The ability to handle complex datasets is essential for maintaining the integrity and usefulness of the model in various NHL data analysis scenarios.

Suggested Enhancement

To address this issue, the following enhancements are proposed:

Optimize Data Parsing: Update the GPT configuration logic to put more emphasis on using only necessary data while parsing.
Create API to transform data: Create custom API to transform NHL API data and update the GPT Actions to use these more specific endpoints. This would act as a buffer or transformation layer to restructure the data.

Adds BloodLineAlpha api wrapper for game-logs endpoint and removes the NHL Web API endpoint for GPT Actions. Used to improve accuracy for game-log responses and results. Part of #2

bloodlinealpha added the enhancement New feature or request label Jan 27, 2024

bloodlinealpha self-assigned this Jan 27, 2024

bloodlinealpha mentioned this issue Jan 27, 2024

This GPT is cannot even perform a simple analysis #1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Handling of Large JSON Data Arrays #2

Improve Handling of Large JSON Data Arrays #2

bloodlinealpha commented Jan 27, 2024

Improve Handling of Large JSON Data Arrays #2

Improve Handling of Large JSON Data Arrays #2

Comments

bloodlinealpha commented Jan 27, 2024

Overview

Problem Description

Problem Analysis

Expected Behavior

Suggested Enhancement