Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explain augment dataset #30

Open
zegermouw opened this issue Jan 29, 2024 · 1 comment
Open

Explain augment dataset #30

zegermouw opened this issue Jan 29, 2024 · 1 comment
Assignees

Comments

@zegermouw
Copy link
Collaborator

Augment dataset has flag explain.

  1. Autofeat computes relationships between n number of tables in the {dataset} repository using {coma, jaccard} similarity with a threshold of x.
  2. Autofeat computes n join_trees.
@AndraIonescu
Copy link
Member

AndraIonescu commented Jan 29, 2024

The explain text should be the following:

  1. AutoFeat computes the relationships between N tables from the {dataset} repository, using {coma, jaccard} similarity score with a threshold of X (i.e., all the relationships with a similarity < threshold will be discarded).
  2. AutoFeat creates M join trees: the best performing join tree is {ID_number}
    • < print paths with the data quality score >
    • < print the selected features with the relevance and redundancy score >
    • < print ML result: accuracy, model, feature importance >

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants