Q: When using Picard's MarkDuplicates, is it preferable to only mark
duplicates rather than remove them?
A: In our pipelines, we keep the duplicates, as downstream tools can
themselves choose whether to count or discount them in their analysis.
Q: Does the remove duplicates command remove all duplicated reads or does
it leave one representative copy of the read (leaving all unique reads)?
A: It removes all duplicates that would have been marked, leaving a single
representative copy.
Q: If I choose to mark duplicates only, what downstream analyses might this
bias or effect?
A: Depending on the tools, they may not be duplicate aware in their
analysis, meaning they will include them. Other tools in Picard are
duplicate aware, as is the GATK for example.
Q: Generated vcf files shouldn't be compromised by the presence of
duplicates, correct?
A: It depends on the tool you are using to generate the VCF, but generally
most variant callers do not consider the duplicates.
N
On Wed, Nov 19, 2014 at 12:19 AM, Katharine Walter <
katharine.wal...@yale.edu> wrote:
> Hi,
>
> When using Picard's MarkDuplicates, is it preferable to only mark
> duplicates rather than remove them? Does the remove duplicates command
> remove all duplicated reads or does it leave one representative copy of the
> read (leaving all unique reads)? If I choose to mark duplicates only, what
> downstream analyses might this bias or effect? Generated vcf files
> shouldn't be compromised by the presence of duplicates, correct? Although
> the fold-coverage at variant sites will be effected depending on whether
> you decide to mark or remove.
>
> Thank you very much for your help!
>
> Best,
>
>
>
> ------------------------------------------------------------------------------
> Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
> from Actuate! Instantly Supercharge Your Business Reports and Dashboards
> with Interactivity, Sharing, Native Excel Exports, App Integration & more
> Get technology previously reserved for billion-dollar corporations, FREE
>
> http://pubads.g.doubleclick.net/gampad/clk?id=157005751&iu=/4140/ostg.clktrk
> _______________________________________________
> Samtools-help mailing list
> Samtools-help@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/samtools-help
>
>
------------------------------------------------------------------------------
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration & more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751&iu=/4140/ostg.clktrk
_______________________________________________
Samtools-help mailing list
Samtools-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/samtools-help