Hi João, Robert and everyone,

First, thank you for your responses.

It was easy to predict that the question about discarding duplicates would get 
responses faster as it is the one that you can address quickly and accurately 
with common sense.

Briefly, I fully agree, we need to remove the PCR bias whenever possible. I was 
actually wondering about the false negative error introduced by this filter. 
Now that I think about it a little better, the benefit of the the decrease in 
false positives must comfortably outweigh the increase of false negative unless 
the ChIP peaks are very sharp. Good.

Question one still remains:

I read in the data with ShortRead. Now, how do I filter it and export it to a 
fomat that will allow me to follow along the example workflow? Class matters, 
doesn't it?

> load("alignedLocs.rda")
> ls()
[1] "alignedLocs"
> class(alignedLocs)
[1] "AlignedList"
attr(,"package")
[1] "chipseq"

Thank you,

Ivan






----- Original Message ----
From: João Fadista <[email protected]>
To: [email protected]; [email protected]
Sent: Thursday, 19 March, 2009 10:31:24
Subject: RE: [Bioc-sig-seq] Reducing Solexa's export.txt in preparation for 
aChIP-seq analysis.


Hi,

Removing duplicates is a step that you can do in order to minimize the possible 
bias due to the amplification in sample preparation. 

Best,
João

-----Original Message-----
From: [email protected] 
[mailto:[email protected]] On Behalf Of 
[email protected]
Sent: Thursday, March 19, 2009 3:24 PM
To: [email protected]
Subject: [Bioc-sig-seq] Reducing Solexa's export.txt in preparation for 
aChIP-seq analysis.


Hello,

In preparation to analyse my own ChIP-seq data, I am trying to follow the steps 
described in this sample workflow:

http://www.bioconductor.org/workshops/2008/SeattleNov08/ChIP-seq/workflow.pdf

The document starts by loading data that has been "reduced to a set of 
alignment start positions (including orientation)".

Can somebody elaborate on that a little bit or, ideally, show it with one 
example?

Also, as part of the reduction, the procedure "removed all duplicate reads and 
applied a quality score cutoff". The score cutoff is fine but how is removing 
duplicates justified?

Thank you,

Ivan




_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing





_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

Reply via email to