Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Datatype Support in Quality Control and Impute #865

Draft
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

aGuyLearning
Copy link
Collaborator

@aGuyLearning aGuyLearning commented Feb 5, 2025

PR Checklist

  • This comment contains a description of changes (with reason)
  • Referenced issue is linked
  • If you've fixed a bug or added code that should be tested, add tests!
  • Documentation in docs is updated

Description of changes
As discussed in #861, this refactors the quality control functions to use single dispatch for future datatype support.

To this end, the explicit_impute function has been reworked already. closes #848

Technical details
This pull request introduces significant changes to the ehrapy preprocessing module, focusing on improving imputation and quality control functionalities. The main changes involve the addition of support for dask arrays, the use of singledispatch for function overloading, and the enhancement of test coverage for different array types.

Possible enhancements
Please let's discuss the changes, as I am not yet fully happy with them.

  • adata.x datatype cheker wrapper for parent function and more restrictive datatypes

@aGuyLearning aGuyLearning requested a review from eroell February 5, 2025 16:32
@aGuyLearning aGuyLearning self-assigned this Feb 5, 2025
@aGuyLearning aGuyLearning linked an issue Feb 5, 2025 that may be closed by this pull request
2 tasks
@aGuyLearning aGuyLearning marked this pull request as draft February 5, 2025 16:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Compatibility of ep.pp.explicit_impute with different datatypes
1 participant