Hello,
> I use lucene 1.2 and I index a text document wich size is near 500 ko.
> (I use Field.UnStored method)
> It seems that only the beginning of this document is indexing !
> If I search a term that is at the end of this document, I 
> don't find it (but
> If find term at the beginning).
> So, I split my document in 2 parts and index them, and now it 
> works fine.
> 
> Is there a limit size for indexing a document ?
You are right. There is a limit for the number of terms for each field, but
you can
change it. Look at org.apache.lucene.index.IndexWriter for maxFieldLength.
The default limit is set to 10000 terms. A 500k document contains more terms
depending on stopwords and number of white spaces. That why the end of your
document
was ignored.
Regards,

-- 
Wolf-Dietrich Materna
Development
 
empolis GmbH -  arvato knowledge management 
Kekul�str. 7 
12489 Berlin, Germany
 
phone :  +49-30-6780-6510
fax :    +49-30-6780-6549
 
< <mailto:[EMAIL PROTECTED]>> < <http://www.empolis.com>>

--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to