[EMAIL PROTECTED] wrote:
I would like suggest a tipp:
- Download Luke from http://www.getopt.org/luke.
- Open a segment index in it.
- Select overview
- use 'top ranking terms' in the common-terms.utf8

Yes, this is a good idea.

Instead of Luke, one can use the following command to generate this file:

bin/nutch org.apache.nutch.indexer.HighFreqTerms -count 10 -nofreqs index

Doug


------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to