[EMAIL PROTECTED] wrote:
I would like suggest a tipp:
- Download Luke from http://www.getopt.org/luke.
- Open a segment index in it.
- Select overview
- use 'top ranking terms' in the common-terms.utf8
Yes, this is a good idea.
Instead of Luke, one can use the following command to generate this file:
bin/nutch org.apache.nutch.indexer.HighFreqTerms -count 10 -nofreqs index
Doug
-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general