Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for spart.xml #35

Open
mvences opened this issue Nov 26, 2021 · 0 comments
Open

Add support for spart.xml #35

mvences opened this issue Nov 26, 2021 · 0 comments
Assignees

Comments

@mvences
Copy link
Contributor

mvences commented Nov 26, 2021

TaxI3, I think, already exports the results of the current simple clustering algorithm in spart format.

When defining the spart format in a recent paper by Miralles et al. 2021, we also defined an XML version of this format which is more flexible and can be extended to include much more information than the classical spart.

We should envisage to include spart.xml in two ways in TaxI3:

  1. Besides regular spart output, any clustering performed in the program should also output a spart.xml file
  2. The program should also accept spart.xml as input file.

For the second point, the format however still needs to be developed (I will try to do this along with colleagues in the next weeks). For instance, we may decide that also DNA sequences can be included along with species partitions in spart.xml, in which case the format could include the same information as a tab file (and maybe more). Alternatively, the file may specify a species partition, and sequence names assigned to each subset (= species), and a user can then upload a spart.xml along with a fasta file, so that the same information as currently in a tab file is included.

This is not an issue to be tackled with priority - we first need to define more precisely the extended version of the spart.xml format. Maybe in late January, we can come back to this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants