yes, and they've developed some interesting high-performance algorithms, which 
i've scanned,
but need to take a good look at.

the computational bio guys love it because they have long strings of base pairs 
that they want
to index. and suffix arrays are the ticket for that. the reason they love 
suffix arrays is that
there is no natural "word".

text searching would be the opposite. words are the natural unit (in speech 
there are no letters)
and words are often repeated.

- erik

Russ Cox <[EMAIL PROTECTED]> writes

| 
| > suffix arrays create an index that is bigger than the
| > original data. regardless of the theoretical O(1) mumble,
| > the size of the index is a major drawback.
| 
| That's true, but it depends a lot on the app.
| The computational biology guys seem to love them
| for indexing large amounts of DNA.
| 
| Russ

Reply via email to