[jira] [Commented] (MAPREDUCE-7150) Optimize collections used by MR JHS to reduce its memory

Misha Dmitriev (JIRA) Fri, 12 Oct 2018 16:41:57 -0700


    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-7150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16648589#comment-16648589
 ]


Misha Dmitriev commented on MAPREDUCE-7150:
-------------------------------------------

In Java, if some variable X is always updated in a {{synchronized}} block, the 
code that reads X doesn't need a read barrier to read the value that X is set 
to once the above {{synchronized}} block completes. What happens under the 
covers is that in the end of the synchronized block all the memory caches are 
flushed, and thus the latest value of X becomes visible to all CPUs/threads 
other than the one that updated X.

In our case, where {{map}} (I mean the {{.map}} field, not the contents of the 
{{ConcurrentSkipListMap}} that it points to) is updated only once, from null to 
its final value, it means that all other code will see it as either null or 
fully constructed (i.e. not in some inconsistent half-constructed state). 
Subsequently, the code that wants to read/update {{map}} contents, doesn't need 
to be synchronized. But both before and after my change, the unsynchronized 
code in {{write()}} that iterates {{map}} contents can do it in parallel with 
the code that updates {{map}} in {{findCounter()}}.

Anyway, it's unclear whether the way it worked before is considered correct/by 
design. So I'll make {{write()}} synchronized.

> Optimize collections used by MR JHS to reduce its memory
> --------------------------------------------------------
>
>                 Key: MAPREDUCE-7150
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7150
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: jobhistoryserver, mrv2
>            Reporter: Misha Dmitriev
>            Assignee: Misha Dmitriev
>            Priority: Major
>         Attachments: YARN-8872.01.patch, YARN-8872.02.patch, 
> jhs-bad-collections.png
>
>
> We analyzed, using jxray (www.jxray.com) a heap dump of JHS running with big 
> heap in a large clusters, handling large MapReduce jobs. The heap is large 
> (over 32GB) and 21.4% of it is wasted due to various suboptimal Java 
> collections, mostly maps and lists that are either empty or contain only one 
> element. In such under-populated collections considerable amount of memory is 
> still used by just the internal implementation objects. See the attached 
> excerpt from the jxray report for the details. If certain collections are 
> almost always empty, they should be initialized lazily. If others almost 
> always have just 1 or 2 elements, they should be initialized with the 
> appropriate initial capacity of 1 or 2 (the default capacity is 16 for 
> HashMap and 10 for ArrayList).
> Based on the attached report, we should do the following:
>  # {{FileSystemCounterGroup.map}} - initialize lazily
>  # {{CompletedTask.attempts}} - initialize with  capacity 2, given most tasks 
> only have one or two attempts
>  # {{JobHistoryParser$TaskInfo.attemptsMap}} - initialize with capacity
>  # {{CompletedTaskAttempt.diagnostics}} - initialize with capacity 1 since it 
> contains one diagnostic message most of the time
>  # {{CompletedTask.reportDiagnostics}} - switch to ArrayList (no reason to 
> use the more wasteful LinkedList here) and initialize with capacity 1.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (MAPREDUCE-7150) Optimize collections used by MR JHS to reduce its memory

Reply via email to