Amazon EMR returned the two logs below in the MAP task logs. All MAP tasks had either of the two logs below.
I'll try and farm the syslog directly from the machines this time. Do you know how to get more detailed logs from Amazon EMR? I'm running it again with the same configuration. attempt_201206200559_0032_m_000313_0 task_201206200559_0032_m_000313 10.76.89.196 FAILED Error: Java heap space attempt_201206200559_0032_m_000322_1 task_201206200559_0032_m_000322 10.242.110.38 FAILED java.lang.Throwable: Child Error at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:271) Caused by: java.io.IOException: Task process exit with nonzero status of 255. at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:258) -- View this message in context: http://lucene.472066.n3.nabble.com/Nutch-1-5-Error-Java-heap-space-during-MAP-step-of-CrawlDb-update-tp3990448p3990623.html Sent from the Nutch - User mailing list archive at Nabble.com.

