Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option to use TCRblosum for distance calculation #591

Open
riederd opened this issue Feb 10, 2025 · 2 comments
Open

Option to use TCRblosum for distance calculation #591

riederd opened this issue Feb 10, 2025 · 2 comments

Comments

@riederd
Copy link
Collaborator

riederd commented Feb 10, 2025

TCRblosum, a substitution matrix specifically derived from TCR sequences, is likely to provide a more accurate measure of TCR sequence similarity, however it is not yet used in scirpy.

I propose adding support for TCRblosum as an alternative substitution matrix for the distance calculation. Ideally, this could be implemented as a parameter within the ir_dist function, allowing users to choose also TCRblosum.

Publication:

@grst
Copy link
Collaborator

grst commented Feb 10, 2025

It's not very well documented in scirpy, but it should be possible to define a substitution matrix in a file as supported by parasail:
https://github.com/jeffdaily/parasail-python?tab=readme-ov-file#substitution-matrices

and then use

ir.pp.ir_dist(adata, metric="alignment", cutoff=10, subst_mat="tcrblosum.txt")

If that works, it would be quite easy to include it natively in scirpy.

@riederd
Copy link
Collaborator Author

riederd commented Feb 10, 2025

Missed that, will try and report back
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants