Skip to content

Metagenomics NF WF Patch update for post-processing WF #153

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 9 commits into
base: DEV_NFMGIllumina_1.0.1_patch
Choose a base branch
from

Conversation

bnovak32
Copy link
Contributor

  • clean-paths updated to fix bug that removes entire line in front of path to be removed and add better regex for white space
  • association table updated
    • remove filtered-read-counts which is not generated by Metagenomics pipeline (specific to Amplicon pipeline)
    • removed unused parameters
    • updated bins/MAGs search to look for zip instead of fasta files
  • metagenomics README generation updated
    • updated default name/email
    • updated to use pathlib instead of os.path, which allows easier directory basename extraction
    • fixed naming of directories to match OSDR published data
      • removed redundant parent directories
      • swapped "_" with " " as in OSDR resource category names
    • removed processing info zip content listing
  • performance improvement for md5sum generation (maintain and use soft links instead of copying files)

- clean-paths updated to fix bug that removes entire line in front of
  path to be removed and add better regex for white space
- association table updated
    - remove filtered-read-counts which is not generated by Metagenomics
      pipeline (specific to Amplicon pipeline)
    - removed unused parameters
    - updated bins/MAGs search to look for zip instead of fasta files
- metagenomics README generation updated
    - updated default name/email
    - updated to use pathlib instead of os.path, which allows easier
      directory basename extraction
    - fixed naming of directories to match OSDR published data
        - removed redundant parent directories
        - swapped "_" with " " as in OSDR resource category names
    - removed processing info zip content listing
@bnovak32 bnovak32 changed the title Patch update for post-processing WF Metagenomics NF WF Patch update for post-processing WF May 14, 2025
@bnovak32
Copy link
Contributor Author

Updated Validation Report

  • Produce list of samples with assemblies but no gene predictions instead of failing validation
  • Improved formatting

bnovak32 added 6 commits May 16, 2025 10:35
- update README to explicitly list more file names
- update Failed-assemblies file to include assay suffix
- update version info pulled for KEGG-Decoder (use conda package
  version)
- update curation table name
- expand name for "bit" in pipeline software table to make it easier to
  trace the version
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant