[ 
https://issues.apache.org/jira/browse/TAJO-743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13965056#comment-13965056
 ] 

Hudson commented on TAJO-743:
-----------------------------

SUCCESS: Integrated in Tajo-master-build #164 (See 
[https://builds.apache.org/job/Tajo-master-build/164/])
TAJO-743: Change the default resource allocation policy of leaf tasks. (jinho) 
(jinossy: rev 743c52650d34ed67b777705816a127b7b5f1b0fa)
* tajo-storage/src/main/java/org/apache/tajo/storage/v2/DiskUtil.java
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/worker/WorkerHeartbeatService.java
* 
tajo-core/tajo-core-backend/src/main/java/org/apache/tajo/worker/TajoResourceAllocator.java
* CHANGES.txt
* tajo-common/src/main/java/org/apache/tajo/conf/TajoConf.java


> Change the default resource allocation policy of leaf tasks
> -----------------------------------------------------------
>
>                 Key: TAJO-743
>                 URL: https://issues.apache.org/jira/browse/TAJO-743
>             Project: Tajo
>          Issue Type: Improvement
>          Components: resource manager
>    Affects Versions: 0.8-incubating, 1.0-incubating
>            Reporter: Jinho Kim
>            Assignee: Jinho Kim
>             Fix For: 0.8-incubating, 1.0-incubating
>
>         Attachments: TAJO-743.patch, TAJO-743_branch-0.8.0.patch
>
>
> Currently, resource allocation is calculated by memory base. If a machine 
> have a large memory, in default settings, heavy disk IO per disk is usually 
> caused by high task concurrency. However, it is likely to seem to be 
> problematic.
> When i tested the leaf task scan by 2(concurrency of SATA disk), the 
> performance was better. if you have SAS Storage or SSD, you can increase the 
> disk concurrency. This patch changes the default resource allocation policy 
> to use disk resource.
> The following configs have been available so far:
>  * tajo.worker.resource.disks - available disk resource of each worker
>  * tajo.task.disk-slot.default - how many disk resource is consumed per task
> Below config is newly introduced in this patch
>  * tajo.worker.resource.dfs-dir-aware - it can be true/false. If it is true, 
> each worker uses the number of HDFS datanode's data dirs in the worker as the 
> disk resource. So, tajo.worker.resource.disks is ignored.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to