Well I'm just using nutch in local mode, no hdfs (as far as I know)...  My
latest thing is trying to determine if there is a filesystem issue.  It's
not really clear what file is not found.  I have about 10 different
configs, this is just one of them and they all have the urls folder.  The
script worked for quite a while before this just started happening on it's
own.  That's why I'm suspecting a filesystem error.


On Tue, Jun 24, 2014 at 6:53 PM, kaveh minooie <[email protected]> wrote:

> you might want to check to see if
>
> > Injector: urlDir: di/urls
>
> still exist in your hdfs.
>
>
>
>
> On 06/24/2014 12:30 AM, John Lafitte wrote:
>
>> Using Nutch 1.7
>>
>> Out of the blue all of my crawl jobs started failing a few days ago.  I
>> checked the user logs and nobody logged into the server and there were no
>> reboots or any other obvious issues.  There is plenty of disk space.  Here
>> is the error I'm getting, any help is appreciated:
>>
>> Injector: starting at 2014-06-24 07:26:54
>> Injector: crawlDb: di/crawl/crawldb
>> Injector: urlDir: di/urls
>> Injector: Converting injected urls to crawl db entries.
>> Injector: ENOENT: No such file or directory
>> at org.apache.hadoop.io.nativeio.NativeIO.chmod(Native Method)
>> at org.apache.hadoop.fs.FileUtil.execSetPermission(FileUtil.java:701)
>>   at org.apache.hadoop.fs.FileUtil.setPermission(FileUtil.java:656)
>> at
>> org.apache.hadoop.fs.RawLocalFileSystem.setPermission(
>> RawLocalFileSystem.java:514)
>>   at
>> org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(
>> RawLocalFileSystem.java:349)
>> at org.apache.hadoop.fs.FilterFileSystem.mkdirs(
>> FilterFileSystem.java:193)
>>   at
>> org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(
>> JobSubmissionFiles.java:126)
>> at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:942)
>>   at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:936)
>> at java.security.AccessController.doPrivileged(Native Method)
>>   at javax.security.auth.Subject.doAs(Subject.java:416)
>> at
>> org.apache.hadoop.security.UserGroupInformation.doAs(
>> UserGroupInformation.java:1190)
>>   at org.apache.hadoop.mapred.JobClient.submitJobInternal(
>> JobClient.java:936)
>> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:910)
>>   at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1353)
>> at org.apache.nutch.crawl.Injector.inject(Injector.java:281)
>>   at org.apache.nutch.crawl.Injector.run(Injector.java:318)
>> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>>   at org.apache.nutch.crawl.Injector.main(Injector.java:308)
>>
>>
> --
> Kaveh Minooie
>

Reply via email to