Yes, it makes sense to provide a working set up. But since http.agent.* properties are dependant on the user, what values would be sensible? At least not a value that would indicate that nutch.apache.org operates the crawler.
On Tuesday 26 April 2011 19:24:33 Susam Pal wrote > > I would suggest that these properties are set to sensible values in > 'conf/nutch-default.xml' itself. I have found it inconvenient to > override these properties every time I have installed Nutch. IMHO it > would be good to have a working configuration available with the > source code and distribution. > > Regards, > Susam Pal -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

