Hi.
Check the mail archive, some of theses things was already discussed and I guess people already have some code / plans but it is not yet part of the sources.
In any cases such contributions are very welcome from my point of view.

Stefan


Am 24.01.2006 um 11:08 schrieb Guenter, Matthias:

Hi
Would it be of interest for the project to have an extension of crawl that allows:
- shaping the bandwidth used (inbound)
- keeping the number of request per second in a certain limit
- is able to schedule that with a difference between working hours and night

And an extension that crawls only file: /http: requests which have changed after a given date.
Sort of  sh ./nutch crawl -changedafter="2006-01-04"?

The code could be delivered end of April as part of a student project.

Kind regards

Matthias Günter



Reply via email to