Skip to content

Speeding Up GENCODE Parsing

Latest
Compare
Choose a tag to compare
@michal-g michal-g released this 02 Mar 22:21
· 19 commits to master since this release

This release includes an improved dryadic.features.cohorts.utils.get_gencode function that should be roughly 2x faster than the old version, mainly due to smarter parsing of the annotation fields in a loaded GENCODE .gtf file.

Also included in this release are a fresh README file, as well as making pheno a required argument for train/test_data cohort methods so that only retrieving the input features requires passing pheno=None explicitly.