Many thanks, Doug. 

A quick question, which class implements the following
logic? 

  org.apache.lucene.search.IndexSearcher? 

> For access, Lucene is equivalent to a B-Tree 
> with all but the leaves cached in memory, so
> that accesses require only a single disk access.


thanks,
Kan



--- Doug Cutting <[EMAIL PROTECTED]> wrote:

> B-Tree's are best for random, incremental updates. 
> They require 
> log_b(N) disk accesses for inserts, deletes and
> accesses, where b is the 
> number of entries per page, and N is the total
> number of entries in the 
> tree.  But that's too slow for text indexing. 
> Rather Lucene uses a 
> combination of file sorting and merging to update
> indexes, which is much 
> faster than a B-tree would be.  For access, Lucene
> is equivalent to a 
> B-Tree with all but the leaves cached in memory, so
> that accesses 
> require only a single disk access.
> 
> Slides 5 and 6 of the following presentation discuss
> this a bit:
> 
>
http://www.research.ibm.com/haifa/Workshops/ir2005/papers/DougCutting-Haifa05.pdf
> 
> Doug
> 
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> [EMAIL PROTECTED]
> For additional commands, e-mail:
> [EMAIL PROTECTED]
> 
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to