Hi there,
I have Nutch 2.3.1 deployment.
I run in parallel 7 threads. Each of the threads  runs sequentially generate, 
fetch, parse, updatedb in circles and  using   REST API calls. Each of the 
threads uses its own  crawlId for REST API calls .   So I didn't expected 
exceptions,  but from time to time I receive an exception during  generate 
step. The problem is random.
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
java.lang.Exception: java.lang.RuntimeException: java.io.EOFException
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:529)
Caused by: java.lang.RuntimeException: java.io.EOFException
        at 
org.apache.hadoop.io.WritableComparator.compare(WritableComparator.java:164)
        at 
org.apache.hadoop.mapreduce.task.ReduceContextImpl.nextKeyValue(ReduceContextImpl.java:158)
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
Everything   work fine when I run  one thread. 
Is it bug or Nutch server supports only sequential calls: generate, fetch, 
parse, updated? 

Regards,
Vladimir Loubenski.

Reply via email to