I know the properties in nutch-0.9/conf/nutch-default.xml boost the weight
of certain elements on a page when that page is getting ranked in the index.

I need to understand all the factors in how a page is ranking in the nutch

currently this is what I know.

url --> 40 %
anchor --> 20 %
title --> 15 %
host --> 20 %
phrase --> 10 %

How each of these fields influences the index rating? Is there a way to
change the algorythm of field's importance(nu just change the boost, but
change the whole logic)

View this message in context: 
Sent from the Nutch - Agent mailing list archive at Nabble.com.

Reply via email to