On Fri, May 14, 2010 at 11:21 AM, Michael McCandless <[email protected]> wrote: > On Fri, May 14, 2010 at 10:59 AM, Yonik Seeley > <[email protected]> wrote: >> On Fri, May 14, 2010 at 7:29 AM, Robert Muir <[email protected]> wrote: >>> On Fri, May 14, 2010 at 5:14 AM, Michael McCandless >>> <[email protected]> wrote: >>>> Or just cutover to UTF8 order for trunk. >>> >>> I would really prefer we go this route, instead of trying to do any >>> hacks at this point! >> >> Sounds good... >> So it seems like the biggest issue we might have in cutting over would >> be the field cache and sorting? Instead of using String.compareTo we >> need one that compares as UTF-32 (or longer term, don't even create >> strings of course...) > > Actually, I think on changing to unicode codepoint order, the > StringIndex returned by FieldCache would in fact be sorted in > codepoint order (even though it's still a String[]), because it just > enums the terms from TermsEnum.
Right... the FIeldCache will be ordered correctly... but when the sort code compares values across segments? -Yonik Apache Lucene Eurocon 2010 18-21 May 2010 | Prague > Mike > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
