Hello Everyone,
We are developing a new package for ChiP-Seq analysis. We use the ShorRead
package to import the short read data, e.g.
data<readAligned("s_1_sequence.maq.map",type="MAQMap")
However, all data information are included in the R alignedRead object (data :
sequences, start, stop, strand, etc), but in ChIP-Seq we do not really need the
short read sequences, only their position and strand information. Is there a
more direct/efficient way to do it? We know that we can convert/coerce the
alignedRead into a RangedData or a similar object but this is not very
efficient. Our experience is that the readAligned function can be very
demanding both in memory and time when reading large datafiles even with 32G of
RAM (on a 64bit R version), and we think that this is mostly due to the
sequences being read.
We are wondering if there is any available filters to include only the start,
the end and the strand or perhaps a different R object, R function that we
should use?
Thanks
Arnaud.
_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing