[
https://issues.apache.org/jira/browse/TEZ-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339396#comment-14339396
]
Rohini Palaniswamy commented on TEZ-2144:
-----------------------------------------
bq. Is there a reason to not run everything in non-session mode ?
Currently the code does not differentiate between grunt mode and script
mode. Also a single script can launch more than 1 DAG based on things like
merge join, embedded Pig in parallel mode, presence of exec, shell or fs
commands. Except for the parallel mode others will be serial. Launching
different applications for them would then be same as mapreduce putting them
back in queue and waiting for resources to be available to be launched.
> Compressing MRInput Split Distributor payload
> ---------------------------------------------
>
> Key: TEZ-2144
> URL: https://issues.apache.org/jira/browse/TEZ-2144
> Project: Apache Tez
> Issue Type: Improvement
> Reporter: Rohini Palaniswamy
>
> Pig sets the input split information in user payload and when running against
> a table with 10s of 1000s of partitions, DAG submission fails with
> java.io.IOException: Requested data length 305844060 is longer than maximum
> configured RPC length 67108864
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)