I checked the I post before
http://nagoya.apache.org/eyebrowse/SearchList?listId=&[EMAIL PROTECTED]&searchText=Che&defaultField=sender&Search=Search
mainly in two fields:
1. custom sorting beside default score sorting: make docID alias one field you need
output sorting
solved by sort data before indexing(example sorted by field PostDate), so docID can
be an alias to the sort field. if we make hitCollector
sort with docID or 1/docID or even complex stragety (docID * score)...
http://nagoya.apache.org/eyebrowse/ReadMsg?[EMAIL PROTECTED]&msgId=115469
IndexOrderSearcher: sort data before indexing and use 1/docID instead of score
2. CJK support:
2.1 sigram based(no word segment just use one character as a token): modified
from StandardTokenizer.java
http://nagoya.apache.org/eyebrowse/ReadMsg?[EMAIL PROTECTED]&msgId=330905
CJKTokenizer for Asia language(Chinese Japanese Korean) Word Segment
http://nagoya.apache.org/eyebrowse/ReadMsg?[EMAIL PROTECTED]&msgId=450266
StandardTokenizer with sigram based CJK Support
2.2 bigram based word segment: modified from SimpleTokenizer to CJKTokenizer.java
http://www.mail-archive.com/[email protected]/msg01220.html
Thank you
I also have some advise and working on lucene structure(Document Field Index) => XML
binding. If we Make a standard lucene.dtd as a default lucene input format maight be
use for applacation intergration with lucene.
Che, Dong
----- Original Message -----
From: "Peter Carlson" <[EMAIL PROTECTED]>
To: "Lucene Developers List" <[EMAIL PROTECTED]>
Sent: Sunday, September 08, 2002 2:08 PM
Subject: Re: fixed url and How to contribute code to lucene sandbox?
> I will add this to the contributions page.
>
> --Peter
> On Saturday, September 7, 2002, at 10:48 PM, Che Dong wrote:
>
> > http://www.chedong.com/tech/lucene.html
> >
> > fixed reference url with:
> > http://jakarta.apache.org/lucene/
> >
> > BTW:
> > How to contribute code to lucene sandbox?
> >
> >
> > Che, Dong
> >
> > ----- Original Message -----
> > From: "Otis Gospodnetic" <[EMAIL PROTECTED]>
> > To: "Lucene Developers List" <[EMAIL PROTECTED]>
> > Sent: Sunday, September 08, 2002 12:01 AM
> > Subject: Re: Lucene introduction in Chinese
> >
> >
> >> Thank you for this.
> >> I think we should add this to the contribution page or some other
> >> place
> >> on the Lucene site (I'll take a look in a bit).
> >> I would like to just add a link to it.
> >>
> >> Note: the link to Lucene's home page at the bottom of the page is
> >> wrong: http://jakarta.apache.org/Lucene/
> >> should be
> >> http://jakarta.apache.org/lucene/
> >>
> >> Thanks,
> >> Otis
> >>
> >>
> >
>
>
> --
> To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]>
> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
>