On 26.02.2014 15:02, David Eccles (gringer) wrote:
> I have no idea if this peak coverage problem will impact how Ray deals with 
> the assembly. If it does, I'll probably need to do some
> kmer-based error correction prior to assembly with Ray in the hope that it 
> will bring those kmers with coverage 2-20 down to frequencies 
> below the true peak value.

So, I've now used BayesHammer (i.e. SPAdes) for error correction, and selected 
only samples that were from the same strain, but the coverage
problem still remains. Because the counts at 2X are larger than the counts at 
~500X, 2 is chosen for the peak coverage:

CoverageDistribution.postEC.txt
CoverageDistributionAnalysis.postEC.txt

[these are from 250bp paired-end Illumina reads]

I'll try again with a larger kmer size. I guess that might cause a few more of 
those reads to become unique.

- David

------------------------------------------------------------------------------
Flow-based real-time traffic analytics software. Cisco certified tool.
Monitor traffic, SLAs, QoS, Medianet, WAAS etc. with NetFlow Analyzer
Customize your own dashboards, set traffic alerts and generate reports.
Network behavioral analysis & security monitoring. All-in-one tool.
http://pubads.g.doubleclick.net/gampad/clk?id=126839071&iu=/4140/ostg.clktrk
_______________________________________________
Denovoassembler-users mailing list
Denovoassembler-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/denovoassembler-users

Reply via email to