Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by DavidCary: http://wiki.apache.org/nutch/FAQ The comment on the change is: HITS algorithm ------------------------------------------------------------------------------ <property> <name>http.content.limit</name> <value>'''150000'''</value> - </property> + </property> If you do not want to limit the size of downloaded documents, set http.content.limit to a negative value. ---- @@ -111, +111 @@ bin/nutch org.apache.nutch.indexer.HighFreqTerms -count 10 -nofreqs index + Q: What ranking algorithm is used in searches ? Does Nutch use the [http://en.wikipedia.org/wiki/HITS_algorithm HITS algorithm] ? - ---- + ---- == Crawling == - ---- + ---- == Discussion == ------------------------------------------------------- This SF.Net email is sponsored by: NEC IT Guy Games. How far can you shotput a projector? How fast can you ride your desk chair down the office luge track? If you want to score the big prize, get to know the little guy. Play to win an NEC 61" plasma display: http://www.necitguy.com/?r=20 _______________________________________________ Nutch-cvs mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-cvs
