My exact problem is the following: I want to make a scoring function that
whenever a URL contains an .jpg image, the URL's score is increased by 10. In
method distributeScoreToOutlinks i added these:
for(ScoreDatum free : scoreData){
try{
String aleos = free.getUrl();
if(aleos.contains(".jpg"))
{
adjust+=10.0f;
}
}catch(Exception e){}
}
float aleks = row.getScore();
row.setScore(aleks+adjust);
For example, http://www.uefa.com/ contains ~25 .jpg images and has score ~251
with my scoring plugin. At the depth 2, that score goes to 502, at depth 3
1004 e.t.c. .
I want that page's score to stay at 251 and not be refetched and reupdated. I
think my problem is that Nutch at the beginning of the loop cycle, reupdates
http://www.uefa.com/ which is my prime URL.
Any ideas?
Thank you in advance!