Travis Woodruff created PIG-4001:
------------------------------------
Summary: POPartialAgg aggregates too aggressively when multiple
values aggregated
Key: PIG-4001
URL: https://issues.apache.org/jira/browse/PIG-4001
Project: Pig
Issue Type: Bug
Affects Versions: 0.13.0
Reporter: Travis Woodruff
Assignee: Travis Woodruff
Priority: Minor
{{POPartialAgg.aggregateRawRow()}} is counting values in
{{numEntriesInTarget}}, but all the values that {{numEntriesInTarget}} are
compared to count tuples.
This is similar to PIG-3649 but has less of a performance impact, as it
generally does not cause map-side aggregation to be disable entirely (since
{{aggregateRawRow()}} is unlikely to be called before {{checkSizeReduction()}};
however, it can cause second-level aggregation to be run too often.
--
This message was sent by Atlassian JIRA
(v6.2#6252)