-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow time-based data-splitting with multiple datasources as 'split.basis' #261
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for this PR @MaLoefUDS.
Before I start with my actual review, just a general comment: Could you please reference the issue number somewhere in the PR? Either in the description of the PR or inside the commit messages of suitable commits, such that we have a connection between PR and issue? Thanks!
In general, this PR looks good. I just have a few comments.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me now.
I will update the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. I just have two comments regarding comments.
7e4f665
to
9bf170c
Compare
Allow to explicitly specify 'split.basis' as a vector when using 'split.data.time.based'. The resulting splits will then be constructed from the union of elements from all datasources in 'split.basis'. Signed-off-by: Maximilian Löffler <[email protected]>
Signed-off-by: Maximilian Löffler <[email protected]>
Signed-off-by: Maximilian Löffler <[email protected]>
Signed-off-by: Maximilian Löffler <[email protected]>
* Concretize the roxygen documentation for 'split.data.by.bins' and 'split.data.by.time.or.bins' regarding the 'split.basis' parameter * Reword comments in 'split.data.activity.based' to not include 'split basis' but rather 'activity type' for consistency with the parameter name * Remove unecessary default value for 'split.basis' in 'split.data.by.time.or.bins' since defaults are now handled in all wrapper functions Signed-off-by: Maximilian Löffler <[email protected]>
Signed-off-by: Maximilian Löffler <[email protected]>
Signed-off-by: Maximilian Löffler <[email protected]>
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## dev #261 +/- ##
==========================================
+ Coverage 79.84% 79.89% +0.04%
==========================================
Files 16 16
Lines 4898 4905 +7
==========================================
+ Hits 3911 3919 +8
+ Misses 987 986 -1 ☔ View full report in Codecov by Sentry. |
🔔 Ping! Do not miss this PR :) |
Thanks for the ping. However, this time, the ping was not necessary 😄 Christian and I planned to have a look at this PR today anyway. So, I assume you will either receive comments later today or we will merge it later today. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me thank you @MaLoefUDS !
I have one formatting comment that you have to address before merging but I approve the PR nonetheless. Please fix this and anything @bockthom might still find.
Signed-off-by: Maximilian Löffler <[email protected]>
There is nothing else to find (at least, for me) 😄 |
Prerequisites
showcase.R
with respect to my changes.dev
.Scope
This PR aims to resolve issue #254
Description
Allow time-based data-splitting with multiple datasources as
split.basis
Changelog
Added
Changed
split.basis
insplit.data.by.time.or.bins
andsplit.data.time.based
to allow for splitting by multiple datasources