You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
TaxI3, I think, already exports the results of the current simple clustering algorithm in spart format.
When defining the spart format in a recent paper by Miralles et al. 2021, we also defined an XML version of this format which is more flexible and can be extended to include much more information than the classical spart.
We should envisage to include spart.xml in two ways in TaxI3:
Besides regular spart output, any clustering performed in the program should also output a spart.xml file
The program should also accept spart.xml as input file.
For the second point, the format however still needs to be developed (I will try to do this along with colleagues in the next weeks). For instance, we may decide that also DNA sequences can be included along with species partitions in spart.xml, in which case the format could include the same information as a tab file (and maybe more). Alternatively, the file may specify a species partition, and sequence names assigned to each subset (= species), and a user can then upload a spart.xml along with a fasta file, so that the same information as currently in a tab file is included.
This is not an issue to be tackled with priority - we first need to define more precisely the extended version of the spart.xml format. Maybe in late January, we can come back to this.
The text was updated successfully, but these errors were encountered:
TaxI3, I think, already exports the results of the current simple clustering algorithm in spart format.
When defining the spart format in a recent paper by Miralles et al. 2021, we also defined an XML version of this format which is more flexible and can be extended to include much more information than the classical spart.
We should envisage to include spart.xml in two ways in TaxI3:
For the second point, the format however still needs to be developed (I will try to do this along with colleagues in the next weeks). For instance, we may decide that also DNA sequences can be included along with species partitions in spart.xml, in which case the format could include the same information as a tab file (and maybe more). Alternatively, the file may specify a species partition, and sequence names assigned to each subset (= species), and a user can then upload a spart.xml along with a fasta file, so that the same information as currently in a tab file is included.
This is not an issue to be tackled with priority - we first need to define more precisely the extended version of the spart.xml format. Maybe in late January, we can come back to this.
The text was updated successfully, but these errors were encountered: