Travis Woodruff created PIG-4001:
------------------------------------

             Summary: POPartialAgg aggregates too aggressively when multiple 
values aggregated
                 Key: PIG-4001
                 URL: https://issues.apache.org/jira/browse/PIG-4001
             Project: Pig
          Issue Type: Bug
    Affects Versions: 0.13.0
            Reporter: Travis Woodruff
            Assignee: Travis Woodruff
            Priority: Minor


{{POPartialAgg.aggregateRawRow()}} is counting values in 
{{numEntriesInTarget}}, but all the values that {{numEntriesInTarget}} are 
compared to count tuples.

This is similar to PIG-3649 but has less of a performance impact, as it 
generally does not cause map-side aggregation to be disable entirely (since 
{{aggregateRawRow()}} is unlikely to be called before {{checkSizeReduction()}}; 
however, it can cause second-level aggregation to be run too often.





--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to