[ 
https://issues.apache.org/jira/browse/TEZ-4137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072456#comment-17072456
 ] 

Mustafa Iman commented on TEZ-4137:
-----------------------------------

cc: [~gopalv], [~ashutoshc], [~rzhappy]

> Input/Output processors should merge payload to local conf
> ----------------------------------------------------------
>
>                 Key: TEZ-4137
>                 URL: https://issues.apache.org/jira/browse/TEZ-4137
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Mustafa Iman
>            Assignee: Mustafa Iman
>            Priority: Major
>         Attachments: TEZ-4137.1.patch
>
>
> This patch introduces config merging to various Input and Output processors. 
> As described in https://issues.apache.org/jira/browse/TEZ-4073 , we need to 
> reduce the size of the configuration objects transferred over the wire. There 
> are two improvements we are planning to do regarding to that:
>  # Skip sending default configs and configuration coming from xml files in 
> payload
>  # Send dag, vertex and session configurations in layers instead of sending 
> dag + vertex + session configs all together three times.
> In order to achieve these,
>  * We need to expose local config coming from configuration files to 
> TaskContext.
>  * Input/Output processors must merge the config from user payload to local 
> config in their TaskContext
> This is the configuration merging part. After this is merged, corresponding 
> changes should be made on Hive side to prevent sending redundant configs. 
> Until Hive side is updated, changes here are only overhead because all the 
> config objects are the same and they have all the config options anyway.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to