Hi.
Check the mail archive, some of theses things was already discussed
and I guess people already have some code / plans but it is not yet
part of the sources.
In any cases such contributions are very welcome from my point of view.
Stefan
Am 24.01.2006 um 11:08 schrieb Guenter, Matthias:
Hi
Would it be of interest for the project to have an extension of
crawl that allows:
- shaping the bandwidth used (inbound)
- keeping the number of request per second in a certain limit
- is able to schedule that with a difference between working hours
and night
And an extension that crawls only file: /http: requests which have
changed after a given date.
Sort of sh ./nutch crawl -changedafter="2006-01-04"?
The code could be delivered end of April as part of a student project.
Kind regards
Matthias Günter