Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to read two fastq files(paired fastq file) in the example/consume.cc #1876

Open
lzhLab opened this issue Aug 17, 2018 · 1 comment
Open

Comments

@lzhLab
Copy link

lzhLab commented Aug 17, 2018

counts.consume_seqfile<FastxReader>(argv[1], seqs_consumed, kmers_consumed);
Assume that I have two fastq file, 'seq_1.fastq' and 'seq_2.fastq', I found it's is not useful via the API ,"consume_seqfile()",to read multiple fastq files, since it only accepts one file.

The commend I used
./consume rs_frag_{1,2}.fastq

this is other available API I can use? any help will be appreciated !

@standage
Copy link
Member

Hi @IzhLab!

If you’re only going to be storing the reads in a counttable or countgraph, then pairing info is not required. You can simply consume one file and then the next.

If there’s another reason you need to retain the pairing information, khmer only supports interleaved Fastq files—the read followed by its paired sequence. There should be a “broken_paired_reader” in both the Python API and the C++/Cython API. I don’t know if either one is well documented, but you should be able to find some example code in the scripts. 🤷‍♂️

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants