On 09.07.2011 19:20, Markus Jelsma wrote:
Score and fetch time if i'm not mistaken.
BTW: I am still not too familiar with the nutch scoring. It is a little
bit confusing, that the word "scoring" seems to be used for two things.
First, I think there is a score witch nutch calculates for each page,
based on the inlinks from other pages to this site.
Second, there is a scoring in Lucene for search request with is
calculated while performing a request.
Can somebody please confirm or correct me, if I am right with these thesis?
Furthermore, can someone explain or give some resources of "the crawl
time scoring" is performed in nutch?
That would really help me
when we see: "Generator: Selecting best-scoring urls due for fetch"
What is the criteria for best scoring urls ?