Thank you for your help, it solved my problem.

-----
Christophe

----- Message d'origine -----
De : "Materna, Wolf-Dietrich (empolis B)"
<[EMAIL PROTECTED]>
� : "'Lucene Users List'" <[EMAIL PROTECTED]>
Envoy� : mercredi 9 octobre 2002 10:33
Objet : RE: Size limit for indexing ?


Hello,
> I use lucene 1.2 and I index a text document wich size is near 500 ko.
> (I use Field.UnStored method)
> It seems that only the beginning of this document is indexing !
> If I search a term that is at the end of this document, I
> don't find it (but
> If find term at the beginning).
> So, I split my document in 2 parts and index them, and now it
> works fine.
>
> Is there a limit size for indexing a document ?
You are right. There is a limit for the number of terms for each field, but
you can
change it. Look at org.apache.lucene.index.IndexWriter for maxFieldLength.
The default limit is set to 10000 terms. A 500k document contains more terms
depending on stopwords and number of white spaces. That why the end of your
document
was ignored.
Regards,

--
Wolf-Dietrich Materna
Development

empolis GmbH -  arvato knowledge management
Kekul�str. 7
12489 Berlin, Germany

phone :  +49-30-6780-6510
fax :    +49-30-6780-6549

< <mailto:[EMAIL PROTECTED]>> < <http://www.empolis.com>>

--
To unsubscribe, e-mail:
<mailto:[EMAIL PROTECTED]>
For additional commands, e-mail:
<mailto:[EMAIL PROTECTED]>


--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to