I would like one chapter on how to configure Nutch for focus crawling.. best practices and strategies... especially to avoid host-blocking.
On Mon, May 17, 2010 at 6:57 AM, Dennis Kubes <[email protected]> wrote: > Hi Everyone, > > It has been a long time coming but I have finally started to write a book > on Nutch. It will be self published and should be available in PDF / > paperback form in less than a month hopefully. > > A while back we discussed a Nutch training seminar on the list. I am not > ready to do a full on seminar yet but I will be putting up some training and > tutorial videos in the next few weeks. I will update the list as those > become available. > > I already have a general outline but it would help me to know the > following: > > 1) What types of things you would want explained in a book / videos on > Nutch? > 2) What are the biggest problems you face using Nutch? > 3) Anything special you would like answered or explained? > > Thanks in advance for any responses. > > Dennis > >

