[Bioc-sig-seq] Rle vs RangedData

Simon Anders Fri, 26 Jun 2009 03:33:45 -0700

Dear Michael and Patrick

As you may have noticed, my HilbertVis package requires the input datato be presented as ordinary vector. Obviously, it would be much betterfor performance to use a run-length-encoded vector (and the stand-aloneversion of HilbertVis already does that).

So, I wanted to add the functionality to use either Rle objects orRangedData objects as input to the hilbertDisplay function and gotconfused about the two classes.

Rle seems to be simple and lightweight, but I cannot see how I couldperform fast random access. If I want to access an element of the vectorwith a given position somewhere in the middle, I suppose I cannot avoidhaving to add up all the lengths in order to find the right value. Isthere any reason why you store lengths of the constant intervals in theRle object rather than their start points? In the latter case one couldachieve random access in time O(log n) as opposed to O(n). Or are thestart points cached somewhere internally?

RangedData does seem to store the data in the start/value scheme thatseems more advantageous to me. However, it has a rather heavyweight slotstructure. Do I understand correctly that the canonical way to get thestart and data vectors from a RangedData object 'rd' would be'start(rd)' and 'rd$score' (or maybe better 'rd[[1]]')?

As the most likely input for hilbertDisplay is the output of the'coverage' function, which is an Rle object, it seems to make sense tochange hilbertDisplay to accept this. However, for performance reasons,I then better convert to RangedData.


Would you agree?

Can you shed some lights about what you intended on when to use "Rle"and when "RangedData"?


Thanks.

  Simon


+---
| Dr. Simon Anders, Dipl. Phys.
| European Bioinformatics Institute (EMBL-EBI)
| Hinxton, Cambridgeshire, UK
| office phone +44-1223-492680, mobile phone +44-7505-841692
| preferred (permanent) e-mail: [email protected]

_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

[Bioc-sig-seq] Rle vs RangedData

Reply via email to