RE: Probelm sort on TermEnum

Steven A Rowe Tue, 07 Apr 2009 10:56:45 -0700

On 4/7/2009 at 1:19 PM, Michael McCandless wrote:
> I think the new contrib/collation package may address this use case?
> It converts each term to its CollationKey, outside of Lucene.


Since AFAIK CollationKey creation is a one-way process, CollationKeyFilter may 
not be useful for Federica.

Federica, what use do you make of the terms returned by reader.terms()?  I ask 
because the new CollationKeyFilter would produce terms that would not be 
suitable for human consumption, but might be useful for other purposes.

Steve

> On Tue, Apr 7, 2009 at 7:36 AM, Federica Falini Data Management S.p.A
> <ffal...@datamanagement.it> wrote:
> > Good morning,
> > In Lucene 2.2 i have made modification to Term.java, TermBuffer.java
> > (see below)  in order to have  Term enumerations sorted case-insensitive
> > (when a field is not-tokenized):
> > TermEnum terms = reader.terms(new Term("myFieldNotTokenized", ""));
> >       while ("myFieldNotTokenized".equals(terms.term().field())) {
> >
> >         System.out.println( "     " + terms.term());
> >         if (!terms.next()) break;
> >   }
> >
> > For example, instead to obtain this sort on TermEnum:
> >
> > Annales
> > Cafè
> > Zucche
> > cafe
> >
> > i need to obtain this :
> >
> > Annales
> > cafe
> > Cafè
> > Zucche
> >
> > Now in Lucene 2.4 i find it difficult because the package "index" is
> > changed a lot; can i have some indications to keep my sort?
> > Thanks in advance
> > Federica


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

RE: Probelm sort on TermEnum

Reply via email to