Hi Fabrizio
Now another problem, It all goes through smoothly as far as I reach
the nutch invertlinks crawl/linkdb crawl/segments command to issue...
In this case I receive the following error output:
[....]
java.io.IOException: No input directories specified in:
Configuration: defaults: hadoop-default.xml , mapred-default.xml , /
tmp/hadoop/mapred/local/localRunner/job_mmx151.xmlfinal: hadoop-
site.xml
You didn't specify what command line you used. But probably you
specified the path to the URL file instead of a directory. Follow the
tutorial word by word and you'll see, that they put the DMOZ file
into a directory (the file `urls' in the directory `dmoz' in the
example). Then to inject you specify the directory name and not the
file name.
Cheers
Patrice
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general