[
https://issues.apache.org/jira/browse/HADOOP-7154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14505800#comment-14505800
]
Lari Hotari commented on HADOOP-7154:
-------------------------------------
A note about MALLOC_ARENA_MAX:
MALLOC_ARENA_MAX is broken on glibc < 2.15 (like Ubuntu 10.04) . The fix was
made for 2.16 and backported to 2.15 . MALLOC_ARENA_MAX doesn't work on Ubuntu
10.04 because of [this
bug|https://sourceware.org/bugzilla/show_bug.cgi?id=13071].
The same bug seems to be reported to Redhat as
https://bugzilla.redhat.com/show_bug.cgi?id=799327 . Other reports:
https://sourceware.org/bugzilla/show_bug.cgi?id=13137 ,
https://sourceware.org/bugzilla/show_bug.cgi?id=13754 ,
https://sourceware.org/bugzilla/show_bug.cgi?id=11261 .
This is the commit to glibc fixing the bug:
https://github.com/bminor/glibc/commit/41b81892f11fe1353123e892158b53de73863d62
(backport for 2.15 is
https://github.com/bminor/glibc/commit/7cf8e20d03a43b1375e90d381a16caa2686e4fdf
).
> Should set MALLOC_ARENA_MAX in hadoop-config.sh
> -----------------------------------------------
>
> Key: HADOOP-7154
> URL: https://issues.apache.org/jira/browse/HADOOP-7154
> Project: Hadoop Common
> Issue Type: Improvement
> Components: scripts
> Affects Versions: 0.22.0
> Reporter: Todd Lipcon
> Assignee: Todd Lipcon
> Priority: Minor
> Fix For: 1.0.4, 0.22.0
>
> Attachments: hadoop-7154.txt
>
>
> New versions of glibc present in RHEL6 include a new arena allocator design.
> In several clusters we've seen this new allocator cause huge amounts of
> virtual memory to be used, since when multiple threads perform allocations,
> they each get their own memory arena. On a 64-bit system, these arenas are
> 64M mappings, and the maximum number of arenas is 8 times the number of
> cores. We've observed a DN process using 14GB of vmem for only 300M of
> resident set. This causes all kinds of nasty issues for obvious reasons.
> Setting MALLOC_ARENA_MAX to a low number will restrict the number of memory
> arenas and bound the virtual memory, with no noticeable downside in
> performance - we've been recommending MALLOC_ARENA_MAX=4. We should set this
> in hadoop-env.sh to avoid this issue as RHEL6 becomes more and more common.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)