[
https://issues.apache.org/jira/browse/PIG-4874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15279132#comment-15279132
]
Daniel Dai commented on PIG-4874:
---------------------------------
One question I have initially is why we change replicates from array to List.
[~rohini] clarified with me offline this is because Java array does not work
with generics. By changing to List, we can make better use of generic types.
+1.
> Remove schema tuple reference overhead for replicate join hashmap
> -----------------------------------------------------------------
>
> Key: PIG-4874
> URL: https://issues.apache.org/jira/browse/PIG-4874
> Project: Pig
> Issue Type: Improvement
> Reporter: Rohini Palaniswamy
> Assignee: Rohini Palaniswamy
> Fix For: 0.16.0
>
> Attachments: PIG-4874-1.patch, PIG-4874-2.patch
>
>
> Currently even if pig.schematuple is set to false which is the default, the
> usage of TupleToMapKey and TuplesToSchemaTupleList instead of plain
> HashMap<Object, ArrayList<Tuple>> costs a lot of memory. Also key is
> currently converted to a tuple which is unnecessary.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)