Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Standardized metrics class to compute data set balance #18

Open
bvanessen opened this issue Oct 6, 2016 · 1 comment
Open

Standardized metrics class to compute data set balance #18

bvanessen opened this issue Oct 6, 2016 · 1 comment

Comments

@bvanessen
Copy link
Collaborator

For a labeled data set, can we build a metrics package that calculates how many labels exist, and of which type, and then the system can compute raw accuracy and corrected accuracy.

A secondary question is if we can create a cross validation library that split a single data set into a good cross validation set. Right now we can perform this partitioning based on a random distribution. A more advanced approach would be to make sure that the held-out portion of the data set is a representative sample of the data set.

@bvanessen
Copy link
Collaborator Author

One example of this is in the imbalance python package

@bvanessen bvanessen changed the title Standardized metrics class Standardized metrics class to compute data set balance Mar 14, 2017
benson31 added a commit to benson31/lbann that referenced this issue Nov 27, 2019
* clean up zero,one use in entrywise loss layers

* changes to allow building with datatype = double
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant