-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Health checker for Qlib Data #854
Comments
@you-n-g Looking into this at the moment. Is there any related work of such a checker (papers, repos) to point me in the right direction? Or could you specify some of the key features this checker would have? |
Hi, @benheckmann Thanks for your interest in Qlib. Here is the motivation for a health checker. Many users try to leverage their own data to run models with Qlib. However, there is no guarantee that users have correctly converted their data or provided enough fields for related modules. For example, the following checker may be helpful for users (although some checkers may only be able to give warnings instead of errors due to their inability to distinguish between an error and a character specific to a certain market).
As Qlib continues to develop, there will be an increasing number of checkers. Therefore, the health checker is expected to be an extensible framework. |
Hi @you-n-g, just drafted a first idea, and would really appreciate some feedback. Also, some questions I still have:
Thank you. |
Thanks. |
…r check, reformatted summary
Hi, @benheckmann @Fivele-Li Will help to review this issue. |
+1 or this feature. Also, if we look at this holistically data completeness is easier to check for. But I've come across scenarios where consistency can cause issues. An example is a different in normalizing data collected after market close but changes the precision. While it is not the end of the world, this would result in a few percentage point differences in data collected at different times. See highlighted precision difference below in data collected 1 minute apart.
|
🌟 Feature Description
A lot of users encountered data errors when integrating their own data.
If Qlib can provide a health checker to help users automatically check the heath of Qlib data.
For example,
Motivation
Alternatives
Additional Notes
The text was updated successfully, but these errors were encountered: