I want to use nutch as an environment to test my proposed algorithm for web
mining
1- Where exactly does the nutch score take place ? in which packages or files?
2- Can the LinkAnalysisTool be run at the intranet level?, some documents
mentioned that it can take place only at the whole web
Hi,
The scoring in Nutch-08 is done in a plugin: scoring-opic. It is called from
Indexr.java
HTH
-Original Message-
From: ahmed ghouzia [mailto:[EMAIL PROTECTED]
Sent: Friday, May 26, 2006 3:16 PM
To: nutch-user@lucene.apache.org; nutch-dev@incubator.apache.org
Subject: Where exactly
[
http://issues.apache.org/jira/browse/NUTCH-273?page=comments#action_12413528 ]
Doug Cutting commented on NUTCH-273:
Redirects should really not be followed immediately anyway. We should instead
note that it was redirected and to which URL in the
CrawlDatum should store IP address
--
Key: NUTCH-289
URL: http://issues.apache.org/jira/browse/NUTCH-289
Project: Nutch
Type: Bug
Components: fetcher
Versions: 0.8-dev
Reporter: Doug Cutting
If the CrawlDatum stored