Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by DavidCary:
http://wiki.apache.org/nutch/FAQ

The comment on the change is:
HITS algorithm

------------------------------------------------------------------------------
  <property>
    <name>http.content.limit</name>
    <value>'''150000'''</value>
- </property> 
+ </property>
  
  If you do not want to limit the size of downloaded documents, set 
http.content.limit to a negative value.
  ----
@@ -111, +111 @@

  bin/nutch org.apache.nutch.indexer.HighFreqTerms -count 10 -nofreqs index
  
  
+ Q: What ranking algorithm is used in searches ? Does Nutch use the 
[http://en.wikipedia.org/wiki/HITS_algorithm HITS algorithm] ?
  
- ----  
+ ----
  == Crawling ==
  
- ---- 
+ ----
  
  == Discussion ==
  


-------------------------------------------------------
This SF.Net email is sponsored by: NEC IT Guy Games.  How far can you shotput
a projector? How fast can you ride your desk chair down the office luge track?
If you want to score the big prize, get to know the little guy.  
Play to win an NEC 61" plasma display: http://www.necitguy.com/?r=20
_______________________________________________
Nutch-cvs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-cvs

Reply via email to