Does such a property really exists? It doesn't in my Nutch trunk. Only this:
$ grep generate\\. conf/*xml conf/nutch-default.xml: <name>generate.max.per.host</name> conf/nutch-default.xml: <name>generate.max.per.host.by.ip</name> conf/nutch-default.xml: <name>generate.update.crawldb</name> conf/nutch-default.xml: "generate.max.per.host.by.ip" - default settings are different only for Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: Euan Clark <[EMAIL PROTECTED]> > To: [email protected] > Sent: Sunday, April 20, 2008 8:33:47 PM > Subject: generate.maxurls.per.domain.default exceptions file? > > Hi All, > > Does anyone know what the expected file format for > 'generate.maxurls.per.domain.exceptions.file ' is? (nutch-site.xml para) > > Is it (e.g.): http://netloc 999 - somthing more like XML? > > Thanks, Euan
