Thank you for your help, it solved my problem. ----- Christophe
----- Message d'origine ----- De : "Materna, Wolf-Dietrich (empolis B)" <[EMAIL PROTECTED]> � : "'Lucene Users List'" <[EMAIL PROTECTED]> Envoy� : mercredi 9 octobre 2002 10:33 Objet : RE: Size limit for indexing ? Hello, > I use lucene 1.2 and I index a text document wich size is near 500 ko. > (I use Field.UnStored method) > It seems that only the beginning of this document is indexing ! > If I search a term that is at the end of this document, I > don't find it (but > If find term at the beginning). > So, I split my document in 2 parts and index them, and now it > works fine. > > Is there a limit size for indexing a document ? You are right. There is a limit for the number of terms for each field, but you can change it. Look at org.apache.lucene.index.IndexWriter for maxFieldLength. The default limit is set to 10000 terms. A 500k document contains more terms depending on stopwords and number of white spaces. That why the end of your document was ignored. Regards, -- Wolf-Dietrich Materna Development empolis GmbH - arvato knowledge management Kekul�str. 7 12489 Berlin, Germany phone : +49-30-6780-6510 fax : +49-30-6780-6549 < <mailto:[EMAIL PROTECTED]>> < <http://www.empolis.com>> -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
