Hi, do you have a testcase that we can run to reproduce this? Thanks!
> -----Original Message-----
> From: jkupferman [mailto:[EMAIL PROTECTED]
> Sent: Monday, June 02, 2008 9:22 AM
> To: [email protected]
> Subject: Stack Overflow When Running Job
>
>
> Hi everyone,
> I have a job running that keeps failing with Stack Overflows
> and I really dont see how that is happening.
> The job runs for about 20-30 minutes before one task errors,
> then a few more error and it fails.
> I am running hadoop-17 and ive tried lowering these settings
> to no avail:
> io.sort.factor 50
> io.seqfile.sorter.recordlimit 500000
>
> java.io.IOException: Spill failed
> at
> org.apache.hadoop.mapred.MapTask$MapOutputBuffer$Buffer.write(
> MapTask.java:594)
> at
> org.apache.hadoop.mapred.MapTask$MapOutputBuffer$Buffer.write(
> MapTask.java:576)
> at java.io.DataOutputStream.writeInt(DataOutputStream.java:180)
> at Group.write(Group.java:68)
> at GroupPair.write(GroupPair.java:67)
> at
> org.apache.hadoop.io.serializer.WritableSerialization$Writable
Serializer.serialize(WritableSerialization.java:90)
> at
> org.apache.hadoop.io.serializer.WritableSerialization$Writable
Serializer.serialize(WritableSerialization.java:77)
> at
> org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTa
> sk.java:434)
> at MyMapper.map(MyMapper.java:27)
> at MyMapper.map(MyMapper.java:10)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:47)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:219)
> at
> org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2124)
> Caused by: java.lang.StackOverflowError
> at java.io.DataInputStream.readInt(DataInputStream.java:370)
> at Group.readFields(Group.java:62)
> at GroupPair.readFields(GroupPair.java:60)
> at
> org.apache.hadoop.io.WritableComparator.compare(WritableCompar
> ator.java:91)
> at
> org.apache.hadoop.mapred.MapTask$MapOutputBuffer.compare(MapTa
> sk.java:494)
> at org.apache.hadoop.util.QuickSort.fix(QuickSort.java:29)
> at org.apache.hadoop.util.QuickSort.sort(QuickSort.java:58)
> at org.apache.hadoop.util.QuickSort.sort(QuickSort.java:58)
> ....the above line repeated 200x
>
> I defined writeablecomparable called GroupPair which simply
> holds to Group objects, each of which contains two integers.
> I fail to see how QuickSort could recurse 200+ times since
> that would require an insanely large amount of entries , far
> more then the 500 million that had been output at that point.
>
> How is this even possible? And what can be done to fix this?
> --
> View this message in context:
> http://www.nabble.com/Stack-Overflow-When-Running-Job-tp175935
> 94p17593594.html
> Sent from the Hadoop core-user mailing list archive at Nabble.com.
>
>