[
https://issues.apache.org/jira/browse/PIG-4876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15247024#comment-15247024
]
Xianda Ke commented on PIG-4876:
--------------------------------
the expected result of D:
{code}
((1,1),{((1,1),{(1,1)},{(1,1)})})
((1,2),{((1,2),{(1,2)},{(1,2)})})
((1,3),{((1,3),{(1,3)},{(1,3)})})
((2,1),{((2,1),{(2,1)},{(2,1)})})
((2,2),{((2,2),{(2,2)},{(2,2)})})
((2,3),{((2,3),{(2,3)},{(2,3)})})
((3,1),{((3,1),{(3,1)},{(3,1)})})
((3,2),{((3,2),{(3,2)},{(3,2)})})
((3,3),{((3,3),{(3,3)},{(3,3)})})
{code}
> OutputConsumeIterator can't handle the last buffered tuples for some Operators
> ------------------------------------------------------------------------------
>
> Key: PIG-4876
> URL: https://issues.apache.org/jira/browse/PIG-4876
> Project: Pig
> Issue Type: Sub-task
> Components: spark
> Reporter: Xianda Ke
> Assignee: Xianda Ke
> Fix For: spark-branch
>
>
> Some Operators, such as MergeCogroup, Stream, CollectedGroup etc buffer some
> input records to constitute the result tuples. The last result tuples are
> buffered in the operator. These Operators need a flag to indicate the end of
> input, so that they can flush and constitute their last tuples.
> Currently, the flag 'parentPlan.endOfAllInput' is targeted for flushing the
> buffered tuples in MR mode. But it does not work with OutputConsumeIterator
> in Spark mode.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)