The tutorial on Web site was updated for unreleased 0.8 version by mistake. If you use nutch 0.7.1 the tutorial should be somewhere in the nutch-0.7.1bundle (I hope but I am not sure). I will try to fix tutorial on nutch Web site to provide information for 0.7.1 as it should. Regards Piotr
On 3/1/06, Patrice Neff <[EMAIL PROTECTED]> wrote: > > Hi Fabrizio > > > Now another problem, It all goes through smoothly as far as I reach > > the nutch invertlinks crawl/linkdb crawl/segments command to issue... > > In this case I receive the following error output: > > [....] > > > java.io.IOException: No input directories specified in: > > Configuration: defaults: hadoop-default.xml , mapred-default.xml , / > > tmp/hadoop/mapred/local/localRunner/job_mmx151.xmlfinal: hadoop- > > site.xml > > You didn't specify what command line you used. But probably you > specified the path to the URL file instead of a directory. Follow the > tutorial word by word and you'll see, that they put the DMOZ file > into a directory (the file `urls' in the directory `dmoz' in the > example). Then to inject you specify the directory name and not the > file name. > > Cheers > Patrice >
