Hi, I know the properties in nutch-0.9/conf/nutch-default.xml boost the weight of certain elements on a page when that page is getting ranked in the index.
I need to understand all the factors in how a page is ranking in the nutch index. currently this is what I know. url --> 40 % anchor --> 20 % title --> 15 % host --> 20 % phrase --> 10 % How each of these fields influences the index rating? Is there a way to change the algorythm of field's importance(nu just change the boost, but change the whole logic) Regards, Dimitri -- View this message in context: http://www.nabble.com/How-does-the-nutch-index-work-tp17088222p17088222.html Sent from the Nutch - Agent mailing list archive at Nabble.com.