[
https://issues.apache.org/jira/browse/TAJO-743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13965056#comment-13965056
]
Hudson commented on TAJO-743:
-----------------------------
SUCCESS: Integrated in Tajo-master-build #164 (See
[https://builds.apache.org/job/Tajo-master-build/164/])
TAJO-743: Change the default resource allocation policy of leaf tasks. (jinho)
(jinossy: rev 743c52650d34ed67b777705816a127b7b5f1b0fa)
* tajo-storage/src/main/java/org/apache/tajo/storage/v2/DiskUtil.java
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/worker/WorkerHeartbeatService.java
*
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/worker/TajoResourceAllocator.java
* CHANGES.txt
* tajo-common/src/main/java/org/apache/tajo/conf/TajoConf.java
> Change the default resource allocation policy of leaf tasks
> -----------------------------------------------------------
>
> Key: TAJO-743
> URL: https://issues.apache.org/jira/browse/TAJO-743
> Project: Tajo
> Issue Type: Improvement
> Components: resource manager
> Affects Versions: 0.8-incubating, 1.0-incubating
> Reporter: Jinho Kim
> Assignee: Jinho Kim
> Fix For: 0.8-incubating, 1.0-incubating
>
> Attachments: TAJO-743.patch, TAJO-743_branch-0.8.0.patch
>
>
> Currently, resource allocation is calculated by memory base. If a machine
> have a large memory, in default settings, heavy disk IO per disk is usually
> caused by high task concurrency. However, it is likely to seem to be
> problematic.
> When i tested the leaf task scan by 2(concurrency of SATA disk), the
> performance was better. if you have SAS Storage or SSD, you can increase the
> disk concurrency. This patch changes the default resource allocation policy
> to use disk resource.
> The following configs have been available so far:
> * tajo.worker.resource.disks - available disk resource of each worker
> * tajo.task.disk-slot.default - how many disk resource is consumed per task
> Below config is newly introduced in this patch
> * tajo.worker.resource.dfs-dir-aware - it can be true/false. If it is true,
> each worker uses the number of HDFS datanode's data dirs in the worker as the
> disk resource. So, tajo.worker.resource.disks is ignored.
--
This message was sent by Atlassian JIRA
(v6.2#6252)