Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feat] preprocessing helper #575

Open
ngupta23 opened this issue Dec 19, 2024 · 0 comments
Open

[feat] preprocessing helper #575

ngupta23 opened this issue Dec 19, 2024 · 0 comments

Comments

@ngupta23
Copy link
Member

ngupta23 commented Dec 19, 2024

Description

Based on various user feedback

Although there are tutorials on how to process the data, it would be beneficial to provide helper functions to check or preprocess the data in the right format

Some examples indicated by users

  • Flag duplicate rows for combination of id_col, time_col - Could indicate a need for aggregation
  • Flag duplicate rows for combination of id_col, time_col and target_col - Could indicate duplicates
  • Flag missing dates for each unique_id. Could also indicate to the user that they can use fill_gaps function from utilsforecast
  • Flag non-numeric columns (categorical, etc.) in the data
  • Flags for short time series (not sure how this will work, need to think about it) - remove fine tuning for short time series, separate data into short and long time series and make 2 calls - short without preprocessing, long with preprocessing.

cc: @cchallu

Link

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant