Okay, I got it working again.  Not sure exactly what happened, but fsck
didn't help.  I noticed the last line showed "native method" so moved the
native binaries out of the /lib folder.  Lo and behold, the next time I ran
it, it used the java libs and displayed the filename it was having a
problem with.  It
was /tmp/hadoop-root/mapred/staging/root850517656/.staging so given that I
just went and moved the /tmp/hadoop-root directory and then it started
working again.  Permissions looked fine, so it might have just been corrupt.

Thanks for the help!


On Tue, Jun 24, 2014 at 9:03 PM, John Lafitte <[email protected]>
wrote:

> Well I'm just using nutch in local mode, no hdfs (as far as I know)...  My
> latest thing is trying to determine if there is a filesystem issue.  It's
> not really clear what file is not found.  I have about 10 different
> configs, this is just one of them and they all have the urls folder.  The
> script worked for quite a while before this just started happening on it's
> own.  That's why I'm suspecting a filesystem error.
>
>
> On Tue, Jun 24, 2014 at 6:53 PM, kaveh minooie <[email protected]> wrote:
>
>> you might want to check to see if
>>
>> > Injector: urlDir: di/urls
>>
>> still exist in your hdfs.
>>
>>
>>
>>
>> On 06/24/2014 12:30 AM, John Lafitte wrote:
>>
>>> Using Nutch 1.7
>>>
>>> Out of the blue all of my crawl jobs started failing a few days ago.  I
>>> checked the user logs and nobody logged into the server and there were no
>>> reboots or any other obvious issues.  There is plenty of disk space.
>>>  Here
>>> is the error I'm getting, any help is appreciated:
>>>
>>> Injector: starting at 2014-06-24 07:26:54
>>> Injector: crawlDb: di/crawl/crawldb
>>> Injector: urlDir: di/urls
>>> Injector: Converting injected urls to crawl db entries.
>>> Injector: ENOENT: No such file or directory
>>> at org.apache.hadoop.io.nativeio.NativeIO.chmod(Native Method)
>>> at org.apache.hadoop.fs.FileUtil.execSetPermission(FileUtil.java:701)
>>>   at org.apache.hadoop.fs.FileUtil.setPermission(FileUtil.java:656)
>>> at
>>> org.apache.hadoop.fs.RawLocalFileSystem.setPermission(
>>> RawLocalFileSystem.java:514)
>>>   at
>>> org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(
>>> RawLocalFileSystem.java:349)
>>> at org.apache.hadoop.fs.FilterFileSystem.mkdirs(
>>> FilterFileSystem.java:193)
>>>   at
>>> org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(
>>> JobSubmissionFiles.java:126)
>>> at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:942)
>>>   at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:936)
>>> at java.security.AccessController.doPrivileged(Native Method)
>>>   at javax.security.auth.Subject.doAs(Subject.java:416)
>>> at
>>> org.apache.hadoop.security.UserGroupInformation.doAs(
>>> UserGroupInformation.java:1190)
>>>   at org.apache.hadoop.mapred.JobClient.submitJobInternal(
>>> JobClient.java:936)
>>> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:910)
>>>   at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1353)
>>> at org.apache.nutch.crawl.Injector.inject(Injector.java:281)
>>>   at org.apache.nutch.crawl.Injector.run(Injector.java:318)
>>> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>>>   at org.apache.nutch.crawl.Injector.main(Injector.java:308)
>>>
>>>
>> --
>> Kaveh Minooie
>>
>
>

Reply via email to