> From: Ino de Bruijn [ino.debru...@scilifelab.se]
> Sent: Friday, May 16, 2014 7:31 AM
> To: denovoassembler-users@lists.sourceforge.net
> Subject: [Denovoassembler-users] Duplicate contigs?
> 
> 
> 
> 
> Bonjour,
> 
> I have assembled a simulated metagenome of 20 genomes. I have 64 pairs each 
> containing 23,5M pairs. I have been using Ray v2.3.1 using 2048 cores.
> 
> For one of the genomes I seem to be getting duplicate contigs.
> 
> 
> I have mapped the assembly against itself using MUMmer and found out that 
> some of the contigs are exactly the same. MUMmer shows these kind of hits:
> 
> 
> [S1] [E2] [S2] [E2] [LEN 1] [LEN 2] [% IDY] [LEN R] [LEN Q] [COV R] [COV Q] 
> [REF] [QRY]
> 
> 
> 1       825006  825006  1       825006  825006  100.00  825006  825006  
> 100.00  100.00 contig-859      contig-717
> 
> 
> For an explanation of the format:
> http://mummer.sourceforge.net/manual/#coords
> 

I have seen cases where there was a significant overlap between two contigs and 
for which Ray
failed to merge them.

I have not seen the case where there are 2 identical contigs (in your case, one 
is the reverse-complement of
the other).

> 
> Do you have any idea how this might happen?
> 
> 
> Best regards,
> Ino
> 
> 
> 
> 
> 

------------------------------------------------------------------------------
"Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE
Instantly run your Selenium tests across 300+ browser/OS combos.
Get unparalleled scalability from the best Selenium testing platform available
Simple to use. Nothing to install. Get started now for free."
http://p.sf.net/sfu/SauceLabs
_______________________________________________
Denovoassembler-users mailing list
Denovoassembler-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/denovoassembler-users

Reply via email to