Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Good first issue | Enhance] Add column description info in LLM prompt #142

Open
MooooCat opened this issue Feb 26, 2024 · 0 comments
Open
Labels
difficulty-easy enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed
Milestone

Comments

@MooooCat
Copy link
Contributor

🚅Search before asking

I have searched for issues similar to this one.

🚅Description

This GFI allows sdgx to obtain the column description from raw_data or raw_data sampled data, and return a string in a text form that LLM can understand. The information in the text should include but not be limited to:

  • Column type: float type, int type, category type, datetime type, etc.;
  • For numeric type columns: maximum value, minimum value, mean, standard deviation, distribution, etc.;
  • For datetime type: start and end dates, datetime type format, etc.;
  • For category types: specific category value, number of category values, etc.;
  • For ID type: ID category, format, etc.;
  • For other types of necessary information, developers are asked to add it based on their own creativity.

🏕Solution

Implements the _form_columns_description method of sdgx.models.LLM.single_table.base.LLMBaseModel.

def _form_columns_description(self):

This method returns a string.

Developers can refer to the implementation ideas of _form_message_with_offtable_features and _form_dataset_description methods.


@MooooCat MooooCat added enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed difficulty-easy labels Feb 26, 2024
@MooooCat MooooCat changed the title [Good first issue | Ehnance] Add column description info in LLM prompt [Good first issue | Enhance] Add column description info in LLM prompt Feb 26, 2024
@MooooCat MooooCat modified the milestones: 0.2.0, 1.0.0 Feb 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
difficulty-easy enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant