[
https://issues.apache.org/jira/browse/HADOOP-3684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris Douglas updated HADOOP-3684:
----------------------------------
Description:
Currently, the framework uses serialization/deserialization to clone the values
passed to the resuce function.
This amounts to a very heavy weight deep copy of the value objects.
That is way too expensive. Although that may be a generic way to work for all
possible value classes, thus good as a default way,
the framework should allow the user to implemet an application specific yet
efficient cloning function.
was:
Currently, the framework uses serialization/deserialization to clone the values
passed to the resuce function.
This amounts to a very heavy weight deep copy of the value objects.
That is way too expensive. Although that may be a generic way to work for all
possible value classes, thus good as a default way,
the framework should allow the user to implemet an application specific yet
efficient cloning function.
Assignee: Runping Qi
Hadoop Flags: [Reviewed]
+1
> The data_join should allow the user to implement a customer cloning function
> ----------------------------------------------------------------------------
>
> Key: HADOOP-3684
> URL: https://issues.apache.org/jira/browse/HADOOP-3684
> Project: Hadoop Core
> Issue Type: Bug
> Components: mapred
> Reporter: Runping Qi
> Assignee: Runping Qi
> Fix For: 0.19.0
>
> Attachments: H-3684.txt
>
>
> Currently, the framework uses serialization/deserialization to clone the
> values passed to the resuce function.
> This amounts to a very heavy weight deep copy of the value objects.
> That is way too expensive. Although that may be a generic way to work for all
> possible value classes, thus good as a default way,
> the framework should allow the user to implemet an application specific yet
> efficient cloning function.
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.