> I would like to identify also the problematic document I have 10000 so, > what > would be the best way of identifying the one that it making zzBuffer to > grow > without control?
Dont index your documents, but instead pass them directly to the analyzer and consume the tokenstream manually. Then visit TermAttribute.termLength() for each Token. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org