[ 
https://issues.apache.org/jira/browse/TEZ-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14106448#comment-14106448
 ] 

Tsuyoshi OZAWA commented on TEZ-1473:
-------------------------------------

The workload is wordcount against 20 GB data generated by randomtext writer. 
Container killer of YARN killed some containers when I use a following 
configuration:

{quote}
<property>
  <name>tez.task.launch.cmd-opts</name>
  <value>-server -Xmx4096m -Djava.net.preferIPv4Stack=true -XX:+UseNUMA 
-XX:+UseParallelGC -Dhadoop.metrics.log.level=WARN</value>
</property>
<property>
  <name>tez.task.resource.memory.mb</name>
  <value>4096</value>
</property>
<property>
  <name>tez.task.resource.memory.mb</name>
  <value>4096</value>
</property>
<property>
  <name>tez.runtime.shuffle.buffersize</name>
  <value>8 * 1024</value>
</property>
{quote}

I can avoid it by making TEZ_RUNTIME_SHUFFLE_BUFFER small:

{quote}
<property>
  <name>tez.runtime.shuffle.buffersize</name>
  <value>3072</value>
</property>
{quote}

I can avoid it by making JVM heap size for tasks small as follows:

{quote}
<property>
  <name>tez.task.launch.cmd-opts</name>
  <value>-server -Xmx3800m -Djava.net.preferIPv4Stack=true -XX:+UseNUMA 
-XX:+UseParallelGC -Dhadoop.metrics.log.level=WARN</value>
</property>
<property>
  <name>tez.task.resource.memory.mb</name>
  <value>4096</value>
</property>
<property>
  <name>tez.runtime.shuffle.buffersize</name>
  <value>8 * 1024</value>
</property>
{quote}



> TEZ_RUNTIME_SHUFFLE_BUFFER is too large by default
> --------------------------------------------------
>
>                 Key: TEZ-1473
>                 URL: https://issues.apache.org/jira/browse/TEZ-1473
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Tsuyoshi OZAWA
>            Assignee: Tsuyoshi OZAWA
>         Attachments: TEZ-1473.1.patch
>
>
> TEZ_RUNTIME_SHUFFLE_BUFFER is 8GB by default, while 
> TEZ_TASK_RESOURCE_MEMORY_MB_DEFAULT is 1GB. It leads OoM or Container Killer.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to