Daniel,
Yes, this is correct if you happen to be doing a radius search and sorting
by mileage.
Peter

On 11/3/06, Daniel Rosher <[EMAIL PROTECTED]> wrote:

Hi Peter,

Does this mean you are calculating the euclidean distance twice ... once
for
the HitCollecter to filter
'out of range' documents, and then again for the custom Comparator to sort
the returned documents?
especially since the filtering is done outside Lucene?

Regards,
Dan


>Joe,
>
>Fields with numeric values are stored in a separate file as binary values
in
>an internal format. Lucene is unaware of this file and unaware of the
range
>expression in the query. The range expression is parsed outside of Lucene
>and used in a custom HitCollector to filter out documents that aren't in
the
>requested range(s). A goal was to do this without having to modify
Lucene.
>Our scheme is pretty efficient, but not very general purpose in its
current
>form, though.
>
>Peter
>
>
>On 10/30/06, Joe Shaw <[EMAIL PROTECTED]> wrote:
>>
>> Hi Peter,
>>
>> On Fri, 2006-10-27 at 15:29 -0400, Peter Keegan wrote:
>> > Numeric range search is one of Lucene's weak points
(performance-wise)
>> so we
>> > have implemented this with a custom HitCollector and an extension to
the
>> > Lucene index files that stores the numeric field values for all
>> documents.
>> >
>> > It is important to point out that this has all been implemented with
the
>> > stock Lucene 2.0 library. No code changes were made to the Lucene
core.
>>
>> Can you give some technical details on the extension to the Lucene
index
>> files?  How did you do it without making any changes to the Lucene
core?
>>
>> Thanks,
>> Joe
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [EMAIL PROTECTED]
>> For additional commands, e-mail: [EMAIL PROTECTED]
>>
>>


Reply via email to