[
https://issues.apache.org/jira/browse/PIG-2163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zhijie Shen updated PIG-2163:
-----------------------------
Attachment: PIG-2163.patch
Attached is the patch for this issue. Assume there are n bags as input. Now
POCross only create n - 1 temporal bags.
The general logic is iterating the tuple of the first bag (the left-most one in
the bag list) and merging it with all the combinations with the tuples stored
in the n - 1 temporal bags. Choosing the first bag to iterate separately is to
keep the order of the cross product in this method the same as that with n
temporal bags.
> Improve nested cross to stream one relation
> -------------------------------------------
>
> Key: PIG-2163
> URL: https://issues.apache.org/jira/browse/PIG-2163
> Project: Pig
> Issue Type: Improvement
> Components: impl
> Affects Versions: 0.10
> Reporter: Daniel Dai
> Assignee: Zhijie Shen
> Fix For: 0.10
>
> Attachments: PIG-2163.patch
>
>
> PIG-1916 added nested cross support for PIG. One optimization is instead of
> materialize all bags before producing result, we can stream one of the input
> to save on memory.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira