-
Notifications
You must be signed in to change notification settings - Fork 218
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement feature_importances_ in sksurv.ensemble.RandomSurvivalForest #140
Comments
Feature importances based on node/split statistics are rather flawed (see e.g. this paper). Therefore, I'm hesitant to implement this feature. In particular, you can already compute permutation-based feature importance via ELI5. It is more expensive to compute, but has better properties. |
My vote would be for adding the feature, at the very least for compatibility with scikit-learn. |
sklearn has https://scikit-learn.org/stable/modules/generated/sklearn.inspection.permutation_importance.html#sklearn.inspection.permutation_importance now, which is the much better option. |
Yes, thanks! I understand your point of view, and that there are alternative ways to compute importance. Thanks for the package and thanks for considering! :) |
I also have a use-case where I am only interested in which feature are used or not used. For that, the feature importances based on node/split statistics could do the job and would be quick to calculate. In contrast, the calculation of permutation feature importances takes so much longer. Thanks a lot for this package and your work. |
Feature importances based on split criteria have been requested in the
past. Unfortunately, the way sklearn implemented feature importance in
the tree-growing algorithm doesn't work with the log-rank criteria used
to grow the survival tree. The log-rank criteria measures the quality of
the split, but sklearn assumes feature importance measure the purity of
a node.
|
Implement feature_importances_ in sksurv.ensemble.RandomSurvivalForest.
Examples:
https://cran.r-project.org/web/packages/randomForestSRC/randomForestSRC.pdf
https://square.github.io/pysurvival/models/random_survival_forest.html
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6364686/
The text was updated successfully, but these errors were encountered: