PERFORMANCE: removing keys from the value
-----------------------------------------
Key: PIG-465
URL: https://issues.apache.org/jira/browse/PIG-465
Project: Pig
Issue Type: Improvement
Affects Versions: types_branch
Reporter: Olga Natkovich
Fix For: types_branch
Currently, reducers get the key data twice: once in the key and once in the
value. If grouping key is the large part of the value, this causes large data
replication and performance loss.
The key should not be sent as part of the value. Instead, a metadata should
used to assist in reconstructing the row from the key and the remaining data
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.