Explain augment dataset #30

zegermouw · 2024-01-29T10:42:16Z

Augment dataset has flag explain.

Autofeat computes relationships between n number of tables in the {dataset} repository using {coma, jaccard} similarity with a threshold of x.
Autofeat computes n join_trees.

AndraIonescu · 2024-01-29T11:08:15Z

The explain text should be the following:

AutoFeat computes the relationships between N tables from the {dataset} repository, using {coma, jaccard} similarity score with a threshold of X (i.e., all the relationships with a similarity < threshold will be discarded).
AutoFeat creates M join trees: the best performing join tree is {ID_number}
- < print paths with the data quality score >
- < print the selected features with the relevance and redundancy score >
- < print ML result: accuracy, model, feature importance >

zegermouw assigned AndraIonescu Jan 29, 2024

Provide feedback