Bonjour,

I have assembled a simulated metagenome of 20 genomes. I have 64 pairs each
containing 23,5M pairs. I have been using Ray v2.3.1 using 2048 cores.

For one of the genomes I seem to be getting duplicate contigs.

I have mapped the assembly against itself using MUMmer and found out that
some of the contigs are exactly the same. MUMmer shows these kind of hits:

[S1] [E2] [S2] [E2] [LEN 1] [LEN 2] [% IDY] [LEN R] [LEN Q] [COV R] [COV Q]
[REF] [QRY]

1       825006  825006  1       825006  825006  100.00  825006  825006
100.00  100.00 contig-859      contig-717

For an explanation of the format:
http://mummer.sourceforge.net/manual/#coords

Do you have any idea how this might happen?

Best regards,
Ino
------------------------------------------------------------------------------
"Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE
Instantly run your Selenium tests across 300+ browser/OS combos.
Get unparalleled scalability from the best Selenium testing platform available
Simple to use. Nothing to install. Get started now for free."
http://p.sf.net/sfu/SauceLabs
_______________________________________________
Denovoassembler-users mailing list
Denovoassembler-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/denovoassembler-users

Reply via email to