You cannot modify the ternm dictionary of an index, see my other eMail. You have to filter it by copying to a new index or reindexing. Document modifications are not supported in Lucene and other inverted indexes.
----- Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -----Original Message----- > From: Shaya Potter [mailto:spot...@gmail.com] > Sent: Wednesday, August 15, 2012 8:44 PM > To: java-user@lucene.apache.org > Subject: Re: easy way to figure out most common tokens? > > On 08/15/2012 02:34 PM, Ahmet Arslan wrote: > >> Is there an easy way to figure out > >> the most common tokens and then remove those tokens from the > >> documents. > > > > Probably this : > > http://lucene.apache.org/core/3_6_1/api/all/org/apache/lucene/misc/Hig > > hFreqTerms.html > > ah, that's a good part 1. Then the Q would then be, how to modify the index > without reindexing all documents. > > my gut is that it should be possible (it seems luke does it), but never went deep > into the document object besides for adding fields. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org