[
https://issues.apache.org/jira/browse/PIG-3743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948109#comment-13948109
]
Cheolsoo Park commented on PIG-3743:
------------------------------------
# I see 10% slower in the current head + PIG-3743-fix-skew.patch compared to
before. I don't have any specific analysis on what's contributed to it.
# {quote}Are you asking for committing PIG-3743-fix-skew.patch and then address
POValueOuputTez in a separate jira? I don't have a problem with that.{quote}
Good. I'll do that. Thanks!
# {quote}It might be overhead, but still should be better than having
duplicating value in key and doubling the memory requirements.{quote}
A simple alternative is to set value to null in POLR and let POGroupInputTez
retrieve key. Do you think this is fine? Here is the implementation-
https://github.com/piaozhexiu/apache-pig/commit/00f671d563ada56ece50493dd1d24aa5587c8171
> Use VertexGroup and Alias vertex for union
> ------------------------------------------
>
> Key: PIG-3743
> URL: https://issues.apache.org/jira/browse/PIG-3743
> Project: Pig
> Issue Type: Sub-task
> Components: tez
> Reporter: Rohini Palaniswamy
> Assignee: Cheolsoo Park
> Fix For: tez-branch
>
> Attachments: PIG-3743-1.patch, PIG-3743-2.patch,
> PIG-3743-fix-skew.patch
>
>
--
This message was sent by Atlassian JIRA
(v6.2#6252)