As I understand,

Lucene will store the files in following way per-segment

TIM file
     Field1 ---> Some byte[]
     Field2 ---> Some byte[]

TIP file
     Field1 ---> Some byte[]
     Field2 ---> Some byte[]


Blur will "sample" this lucene-file in the following way

Field1 --> <TIM, start-offset>, <TIP, start-offset>, ...

Field 2 --> <TIM, start-offset>, <TIP, start-offset>, ...

Is my understanding correct?

How does Blur warm-up the fields, when it does not know the "end-offset" or
the "length" for each field to warm.

Will it by default read all Terms of a field?

--
Ravi

Reply via email to