Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error when annotating big assemblies file. #659

Open
RunJiaJi opened this issue Apr 15, 2023 · 0 comments
Open

Error when annotating big assemblies file. #659

RunJiaJi opened this issue Apr 15, 2023 · 0 comments

Comments

@RunJiaJi
Copy link

RunJiaJi commented Apr 15, 2023

Hi sicentists,

I'm trying to annotate metagenomic assemblies which were directly assembled through Megahit. I have a big fasta file (318MB) which containing 64096 contigs. After Prokka annotation, I noted that some of the protein sequences were not properly annotated.

To better explain the situation, take one contig (>gnl|Prokka|LOCUSTAG_416) as example, some of the protein sequences were annotated and translated, while some of the sequences were not annotated and translated, which can be easily seen in the GenBank file (contig_LOCUSTAG_416_firstTimeAnno.gbk).

I further extracted the contig sequence from the big fasta file (contig_LOCUSTAG_416.fa) and reannotated using Prokka, surprisingly, all proteins were annotated and translated (contig_LOCUSTAG_416_secondTimeAnno.gbk).

Can someone explain why this error appears when annotate big files? Thanks in advance.

contig_LOCUSTAG_416.fa.txt
contig_LOCUSTAG_416_firstTimeAnno.gbk.txt
contig_LOCUSTAG_416_secondTimeAnno.gbk.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant