On Monday 21 November 2005 02:16, [EMAIL PROTECTED] wrote: > Hi. I was wondering if anyone else has seen this > before. I'm using lucene 1.4.3 and have indexed > about 3000 text documents using the statement: > > doc.add(Field.Text("contents", new FileReader(f), > true)); > > When I go and retrieve the term frequency vectors, for > any document under about 90k, everything looks as > expected. However for larger documents (I haven't > found the exact point, but I know that those over 128k > qualify) the sum of the term frequencies in the vector > seems to max out at 10001. ...
That's correct, have a look here for IndexWriter.maxFieldLength : http://wiki.apache.org/jakarta-lucene/LuceneFAQ#head-3558e5121806fb4fce80fc022d889484a9248b71 Regards, Paul Elschot --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]