-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
riplike: script is too slow compared to LANL tool #12
Comments
Turning off bootstrap sampling makes a big difference: Elzar:poplars artpoon$ time python3 riplike.py -nrep 0 ref_genomes/K03455.fasta test.out
K03455|HIVHXB2CG
real 0m6.893s
user 0m6.578s
sys 0m0.256s |
Replacing Elzar:poplars artpoon$ time python3 riplike.py ref_genomes/K03455.fasta test.out
K03455|HIVHXB2CG
real 0m35.956s
user 0m35.614s
sys 0m0.278s |
riplike is very slow on Windows (possibly due to the MAFFT version). I think
Bootstrapdef bootstrap(s1, s2, reps=100):
...
for rep in range(reps):
result = []
bootstrap = [random.randint(0, seqlen-1) for _ in range(seqlen)]
b1 = ''.join([s1[i] for i in bootstrap])
b2 = ''.join([s2[i] for i in bootstrap])
yield b1, b2 The string joining in NumPyUsing NumPy arrays in
|
I think that the implementation of |
See #22 |
On my Mac at home (admittedly a slow machine):
This same query takes about 7 seconds on the LANL server.
First I'm going to see if the bootstrap step can be made faster.
The text was updated successfully, but these errors were encountered: