Hi there,
I have Nutch 2.3.1 deployment.
I run in parallel 7 threads. Each of the threads runs sequentially generate,
fetch, parse, updatedb in circles and using REST API calls. Each of the
threads uses its own crawlId for REST API calls . So I didn't expected
exceptions, but from time to time I receive an exception during generate
step. The problem is random.
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
java.lang.Exception: java.lang.RuntimeException: java.io.EOFException
at
org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:529)
Caused by: java.lang.RuntimeException: java.io.EOFException
at
org.apache.hadoop.io.WritableComparator.compare(WritableComparator.java:164)
at
org.apache.hadoop.mapreduce.task.ReduceContextImpl.nextKeyValue(ReduceContextImpl.java:158)
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
Everything work fine when I run one thread.
Is it bug or Nutch server supports only sequential calls: generate, fetch,
parse, updated?
Regards,
Vladimir Loubenski.