Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Empty fastq files in delivery for microsalt cases #2983

Closed
seallard opened this issue Feb 26, 2024 · 6 comments
Closed

Empty fastq files in delivery for microsalt cases #2983

seallard opened this issue Feb 26, 2024 · 6 comments
Labels

Comments

@seallard
Copy link
Contributor

seallard commented Feb 26, 2024

Description

At least one microsalt case has been observed to contain two empty fastq files (besides the fastq files containing the concatenated data) in the customers delivery directory on Hasta.

BLAZA-9789:
total 648486
-rw-rw----  1 hiseq.clinical  cust015          0 Feb 26 10:03 240219_BLAZA-9789_1.fastq.gz
-rw-rw----  1 hiseq.clinical  cust015          0 Feb 26 10:03 240219_BLAZA-9789_2.fastq.gz
-rw-rw----  1 hiseq.clinical  cust015  339066091 Feb 26 10:03 BLAZA-9789_R1.fastq.gz
-rw-rw----  1 hiseq.clinical  cust015  324937459 Feb 26 10:03 BLAZA-9789_R2.fastq.gz

The reason the files are not removed is that they do not follow the expected naming pattern for sample fastq files.

Suggested solution

Determine where the empty files are created.
Ensure they are not created, alternatively remove them before delivery.

This can be closed when

The empty fastq files are not present.

@seallard seallard added the Bug label Feb 26, 2024
@eliottBo eliottBo pinned this issue Feb 26, 2024
@eliottBo eliottBo unpinned this issue Feb 26, 2024
@seallard
Copy link
Contributor Author

The empty files are not present in the bundle for the case in housekeeper. So they are not generated by the microsalt pipeline.

Looking at the names, the empty files were generated by the cg deliver ticket command. So the issue is that:

  1. Automation delivered the files, concatenated and removed them.
  2. The manual deliver ticket command was run separately which created the empty files (since automation already concatenated and removed the other ones).

@seallard
Copy link
Contributor Author

Closing this for now, re-open if issue reoccurs.

@eliottBo
Copy link
Contributor

eliottBo commented Mar 14, 2024

I observed this issue again (2024-03-11) when using cg deliver ticket -d fastq -t <ticket_id> --force-all. I removed the additional empty files with find . -type d -exec sh -c 'cd "{}" && rm -f <pattern>*' \; command in the directory.

I used this command because a sample in a microsalt case did not receive enough raw reads but the QCs in the report were ok and I decided to deliver as it was. After setting as completed in TB, I removed the ticket folder on caesar and hasta and used the cg deliver command with --force-all flag to include the sample without enough raw reads.

@eliottBo eliottBo reopened this Mar 14, 2024
@seallard seallard self-assigned this Mar 14, 2024
@seallard
Copy link
Contributor Author

seallard commented Mar 14, 2024

I'm pretty sure this means the sample had already been concatenated and delivered.
I think the QC for microsalt allows delivering failing samples given that the case still passes QC. Is that something you want to change?

See documentation here #2754 (comment)

I'll patch the concatenation logic so that it does not generate the empty files, but you should be aware that the failed sample most likely already had been delivered.

@seallard seallard removed their assignment Aug 5, 2024
@Vince-janv
Copy link
Contributor

Does not seem solved. Could replicate the issue.
If cg.services.fastq_concatenation_service.utils.concatenate gets an empty list as input it will create an empty file

@Vince-janv
Copy link
Contributor

Should be solved by the merge of the new delivery flow

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants