Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

assembly and genbank updates #579

Draft
wants to merge 19 commits into
base: master
Choose a base branch
from
Draft

assembly and genbank updates #579

wants to merge 19 commits into from

Conversation

dpark01
Copy link
Member

@dpark01 dpark01 commented Feb 3, 2025

Updates:

  • All assembly processes (anything involving assemble_refbased or refine_assembly_with_aligned_reads) will now trim the edges of ambiguous bases according to a "3rules" criteria used in NCBI's VADR script "fasta-trim-terminal-ambigs.pl". This is to satisfy submission requirements enforced by table2asn which fail sequences with too much ambiguous content at the edges.
  • When downloading reference genomes in scaffold_and_refine_multitaxa, save them in files that use dashes instead of colons as delimiters in the filenames, as the colons are incompatible with Singularity.
  • Update task utils.rename_file to be more portable (was not working with miniwdl's implementation)
  • In genbank_single, update more output filenames to be consistently named so we can more easily group them together.
  • In task align_and_count, bugfix for empty input bams
  • In task FastqToUBAM, add knobs for ram and disk size and increase defaults

@dpark01 dpark01 changed the title WiP: assembly and genbank updates assembly and genbank updates Feb 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant