Gopal. That was exactly it.

As always, a succinct, accurate answer.

Thanks,
-Kris

On Mon, Feb 26, 2018 at 8:06 PM, Gopal Vijayaraghavan <gop...@apache.org>
wrote:

> Hi,
>
> > Caused by: java.lang.ArrayIndexOutOfBoundsException
> > at org.apache.hadoop.mapred.MapTask$MapOutputBuffer$
> Buffer.write(MapTask.java:1453)
>
> In general HDP specific issues tend to get more attention on HCC, but this
> is a pretty old issue stemming from MapReduce being designed for fairly
> low-memory JVMs.
>
> The io.sort.mb size is the reason for this crash, it has a wrap-around
> case where sort buffers which are > 1Gb trigger a corner case.
>
> As odd as this might sound, if you have fewer splits the sort buffer
> wouldn't wrap around enough times to generate a -ve offset.
>
> You can lower the mapreduce.task.io.sort.mb to 1024Mb or lower as a slower
> workaround.
>
> I ran into this issue in 2013 and started working on optimizing sort for
> larger buffers for MapReduce (MAPREDUCE-4755), but ended up rewriting the
> entire thing & then added it to Tez.
>
> Cheers,
> Gopal
>
>
>

Reply via email to