Hi, On Fri, Nov 2, 2012 at 5:36 PM, cocofan <[email protected]> wrote:
> 2012-11-01 14:46:52,027 ERROR security.UserGroupInformation - > PriviledgedActionException as:cocofan I've never seen this Exception before...honestly. > cause:org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input > path does not exist: > file:/home/cocofan/Dropbox/project/apache-nutch-2.1/runtime/local/bin/urls > 2012-11-01 14:46:52,027 ERROR crawl.InjectorJob - InjectorJob: > org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does > not exist: The rest seems to be pretty straight forward. You appear to be running nutch from $NUTCH_HOME/runtime/local/bin with the following command ./nutch XYZ Unless you urls directory is located in the ./bin directory (which I doubt it is) then you should come up one directory and run the command from $NUTCH_HOME/runtime/local e.g. ./bin/nutch XYZ Does this make sense? Please read the tutorial carefully and thoroughly and it will work perfectly. hth Lewis

