Gopal. That was exactly it. As always, a succinct, accurate answer.
Thanks, -Kris On Mon, Feb 26, 2018 at 8:06 PM, Gopal Vijayaraghavan <gop...@apache.org> wrote: > Hi, > > > Caused by: java.lang.ArrayIndexOutOfBoundsException > > at org.apache.hadoop.mapred.MapTask$MapOutputBuffer$ > Buffer.write(MapTask.java:1453) > > In general HDP specific issues tend to get more attention on HCC, but this > is a pretty old issue stemming from MapReduce being designed for fairly > low-memory JVMs. > > The io.sort.mb size is the reason for this crash, it has a wrap-around > case where sort buffers which are > 1Gb trigger a corner case. > > As odd as this might sound, if you have fewer splits the sort buffer > wouldn't wrap around enough times to generate a -ve offset. > > You can lower the mapreduce.task.io.sort.mb to 1024Mb or lower as a slower > workaround. > > I ran into this issue in 2013 and started working on optimizing sort for > larger buffers for MapReduce (MAPREDUCE-4755), but ended up rewriting the > entire thing & then added it to Tez. > > Cheers, > Gopal > > >