-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update filtering and subsampling docs #222
Conversation
704f86d
to
b9ec0bb
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice improvement! I only left non-blocking suggestions, but I think all the additions are very helpful.
555a3e1
to
c334b31
Compare
significant changes to "Filtering" section added
Reword some text and add an example for --query.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice additions -- the added "notes" are especially helpful
5962ddb
to
65f8fe4
Compare
65f8fe4
to
b5cb803
Compare
Overview | ||
======== | ||
|
||
``augur filter`` provides the flexibility to choose different subsets of input | ||
data for various types of analysis. There are several options which can be | ||
categorized based on the information source and selection method. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This new overview portion looks more like reference material that should go on the CLI page. But also, the rest of this page relies heavily on the terminology laid out here. Maybe fine to keep it here and leave the CLI page as a reference for what the options are.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks really good! Thanks for humouring my ideas
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some non-blocking suggestions.
96f68da
to
62e01e5
Compare
Note that I'm introducing new terminology here: "preliminary" vs. "subsampling" vs. "force-inclusive" filtering options. These are clearly distinct in the order of operations, making these labels helpful for explaining that process. For "preliminary", I had considered a term such as "exclusive" to better contrast with "force-inclusive". However, the expression syntax used for options in this category can be both exclusive (--exclude-where region!=Asia) and inclusive (--min-date 2012). This is also why "inclusive" is not a sufficient name for the "force-inclusive" category. Co-authored-by: James Hadfield <[email protected]>
Reword the subsampling introduction with *what* it is, followed by examples on *why* paired with *how*. This also allows future sampling methods such as weighted sampling to be added by simply including a new section.
62e01e5
to
5779a70
Compare
preview
Description of proposed changes
Prepping for upcoming weighted sampling docs. These changes are applicable to current
augur filter
features.Related issue(s)
Checklist