There are very few books about Nutch. I have "Web Crawling and Data Mining with Apache Nutch" but suggest you'd do better reading on-line documentation and discussions or looking at the code. I agree with Chris MacNaughton's review https://chrismacnaughton.com/blog/2014/01/22/web_crawling_and_data_mining_with_apache_nutch/

Steven Hayles
Systems Analyst

IT Services, University of Leicester,
Propsect House, 94 Regent Rd, Leicester, LE1 7DA, UK

t: +44 (0)116 229 7950
e: [email protected]

Follow us on Twitter http://twitter.com/uniofleicester or
visit our Facebook page https://facebook.com/UniofLeicester


On Wed, 18 Jan 2017, Fengtan wrote:

Hi,

I am trying to list all books about Nutch -- here are the ones I have found:

  - Big data made easy : a working guide to the complete Hadoop toolset
  (Chapter 3) http://www.apress.com/us/book/9781484200957
  - Hadoop: The Definitive Guide, 2nd Edition (Chapter 16)
  http://shop.oreilly.com/product/0636920010388.do
  - Web Crawling and Data Mining with Apache Nutch
  https://www.amazon.ca/Crawling-Data-Mining-Apache-Nutch/dp/1783286857

Does anyone know about any other book ?

Also is it possible to have access to the wiki ? I may contribute a few
things... My id: FengTan

Reply via email to