On Fri, May 14, 2010 at 11:21 AM, Michael McCandless
<[email protected]> wrote:
> On Fri, May 14, 2010 at 10:59 AM, Yonik Seeley
> <[email protected]> wrote:
>> On Fri, May 14, 2010 at 7:29 AM, Robert Muir <[email protected]> wrote:
>>> On Fri, May 14, 2010 at 5:14 AM, Michael McCandless
>>> <[email protected]> wrote:
>>>> Or just cutover to UTF8 order for trunk.
>>>
>>> I would really prefer we go this route, instead of trying to do any
>>> hacks at this point!
>>
>> Sounds good...
>> So it seems like the biggest issue we might have in cutting over would
>> be the field cache and sorting?  Instead of using String.compareTo we
>> need one that compares as UTF-32 (or longer term, don't even create
>> strings of course...)
>
> Actually, I think on changing to unicode codepoint order, the
> StringIndex returned by FieldCache would in fact be sorted in
> codepoint order (even though it's still a String[]), because it just
> enums the terms from TermsEnum.

Right... the FIeldCache will be ordered correctly... but when the sort
code compares values across segments?

-Yonik
Apache Lucene Eurocon 2010
18-21 May 2010 | Prague



> Mike
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to