Matt Kangas wrote:
#2 should be a pluggable/hookable parameter. "high-scoring" sounds like
a reasonable default basis for choosing recrawl intervals, but I'm sure
that nearly everyone will think of a way to improve upon that for their
particular system.
e.g. "high-scoring" ain't gonna cut it for my needs. (0.5 wink ;)
In NUTCH-61, Andrzej has a pluggable FetchSchedule. That looks like a
good idea.
http://issues.apache.org/jira/browse/NUTCH-61
Doug
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers