PERFORMANCE: Distinct should make use of combiner to remove duplicate values
from keys.
----------------------------------------------------------------------------------------
Key: PIG-450
URL: https://issues.apache.org/jira/browse/PIG-450
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: types_branch
Reporter: Alan Gates
Assignee: Alan Gates
Fix For: types_branch
In 2.0 distinct was improved by removing values in the map and just passing an
empty tuple along with the key. This can be further improved by adding a
combiner step that passes along only the first empty tuple instead of all of
them.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.