Hi guys.. Just a quick question about the tutorial:
the line bin/nutch org.apache.nutch.crawl.DmozParser content.rdf.u8 -subset 5000 > dmoz/urls shouldn't be bin/nutch org.apache.nutch.tools.DmozParser content.rdf.u8 -subset 5000 > dmoz/urls -- Dr. Fabrizio SILVESTRI High Performance Computing Laboratory Information Science and Technologies Institute (ISTI), Italian National Research Council (CNR) Via G. Moruzzi, 1, 56126 Pisa, Italy Phone: +39 050 315-3011 (Direct) Mobile: +39 328-9552152 FAX: +39 050 3138091 (G3) WWW: http://miles.isti.cnr.it/~silvestr
