Standardized metrics class to compute data set balance #18

bvanessen · 2016-10-06T16:31:50Z

For a labeled data set, can we build a metrics package that calculates how many labels exist, and of which type, and then the system can compute raw accuracy and corrected accuracy.

A secondary question is if we can create a cross validation library that split a single data set into a good cross validation set. Right now we can perform this partitioning based on a random distribution. A more advanced approach would be to make sure that the held-out portion of the data set is a representative sample of the data set.

bvanessen · 2017-03-14T16:18:12Z

One example of this is in the imbalance python package

* clean up zero,one use in entrywise loss layers * changes to allow building with datatype = double

bvanessen added the enhancement label Oct 6, 2016

bvanessen added this to the Improved metrics and output analysis milestone Mar 14, 2017

bvanessen changed the title ~~Standardized metrics class~~ Standardized metrics class to compute data set balance Mar 14, 2017

benson31 added a commit to benson31/lbann that referenced this issue Nov 27, 2019

Last of Tom's changes to refactor add data type layer (LBANN#18)

3d89e04

* clean up zero,one use in entrywise loss layers * changes to allow building with datatype = double

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standardized metrics class to compute data set balance #18

Standardized metrics class to compute data set balance #18

bvanessen commented Oct 6, 2016

bvanessen commented Mar 14, 2017

Standardized metrics class to compute data set balance #18

Standardized metrics class to compute data set balance #18

Comments

bvanessen commented Oct 6, 2016

bvanessen commented Mar 14, 2017