Skip to content

Multimodal adding datasets for the GMM for BA9 #2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 6 commits into
base: multimodal
Choose a base branch
from

Conversation

lesolorzanov
Copy link

Adding the necessary datasets for the BA9 models

add a dataset "ImageAnnDataset" to bring only images and no counts at all.
Adding MMFeatureGridDataset which is needed for the count logits coming from scBERT plus the UNI features to which f_img will be applied to do the final g_mm
to have gene_symbols as kwarg in preprocess to ask for a column name in adata.var storing gene_symbols matching target_genes
using glob instead of a hardcoded locations to find spaceranger files
adding the glob library missing in utils
added parameters atonce_count_limit=None, device="cpu", delay_sending_to_device=True,  to GridNetHexMM on the hopes of delaying sending ton GPU when the counts and images are full size. Recently, f_count is the logits and f_img is a 1024 vector of UNI instead of the image itself.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant