[jira] [Commented] (PIG-3365) Run as uber job if there is only one input split

Lorand Bendig (JIRA) Tue, 24 Jun 2014 15:25:19 -0700

    [ 
https://issues.apache.org/jira/browse/PIG-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14042783#comment-14042783
 ]


Lorand Bendig commented on PIG-3365:
------------------------------------

[~chitnis], thanks for the suggestions. I'll modify the patch so that 
properties will be extracted into static variables. The property 
{{dfs.block.size}} is deprecated in Hadoop2, {{dfs.blocksize}} is used instead. 
Initially I used this property to retrieve the block size but I remarked that 
MAPREDUCE-3610 changed the way how blocksize is determined and also checked 
Hadoop's {{JobImpl#makeUberDecision}} and followed that. I'm not quite sure 
whether dfs.blocksize returns the current blocksize if the file system isn't 
hdfs (E.g: for s3n there's {{fs.s3n.block.size}}). I think if we get blocksize 
from the current fs implementation that might be safer. What do you think?

> Run as uber job if there is only one input split
> ------------------------------------------------
>
>                 Key: PIG-3365
>                 URL: https://issues.apache.org/jira/browse/PIG-3365
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Rohini Palaniswamy
>            Assignee: Lorand Bendig
>              Labels: Performance
>         Attachments: PIG-3365.patch
>
>
> Hadoop 2 has support for uber mode (mapreduce.job.ubertask.enable=true) which 
> runs the map and reduce on Application Master itself and reduces the overhead 
> of launching a separate map/reduce task. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (PIG-3365) Run as uber job if there is only one input split

Reply via email to