Re: [Bioc-sig-seq] Rle vs RangedData

Patrick Aboyoun Mon, 29 Jun 2009 11:25:46 -0700

Simon,

Could you provide timings for Rle element extraction because I have beentrying to provide speedups for bottlenecks. If the need is to performmultiple element extraction, as Wolfgang suggests, then "[" for Rleshould be performant since it only calculates the start values once:


## from the internals of "[" for Rle
output <- runValue(x)[findInterval(i, start(x))]
if (!drop) output <- Rle(output)



Patrick



Wolfgang Huber wrote:

Hi Simon
just to be sure - what is n? Number of segments, or length of the(expanded) sequence?
And rather than looking at the time needed to access a single value ata certain position, shouldn't you be looking at the time needed toaccess the values on a complete equi-spaced grid from begin to end ofthe sequence?
    bw Wolfgang


Simon Anders ha scritto:
Hi Michael

Michael Lawrence wrote:
An Rle object, even if it only stores the widths, would be betterthan RangedData. Just getting the starts out of a RangedData is anO(n) operation, and there is in general a lot of overhead forfunctionality that is not useful in your case.
Thanks.

But wait a second: Isn't there a slot "starts" in a RangedData object?
So why would it be O(n) if this information is already there?

My concern was that getting the starts (or even just getting a value at
a given position) from an Rle object would be O(n) because the Rle
object does not contain the starts, only the lengths of the intervals.

So, what information is now stored where?

Cheers
  Simon
Best wishes
     Wolfgang

------------------------------------------------
Wolfgang Huber, EMBL, http://www.ebi.ac.uk/huber


_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

Re: [Bioc-sig-seq] Rle vs RangedData

Reply via email to