[
https://issues.apache.org/jira/browse/CRUNCH-90?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Josh Wills updated CRUNCH-90:
-----------------------------
Attachment: CRUNCH-90-reflect.patch
Attached a patch that fixes the PageRankClassTest. In order to do it, I had to
besmirch the really elegant DeepCopier infrastructure (like, seriously
elegant-- I haven't looked at it that closely before and it was a joy to read)
by passing Configuration objects all over the place so that the Avro deepCopy's
will use the correct instance of ReflectData-- the Scrunch stuff has to use a
different version than the Java stuff.
> Object reuse is not accounted for in mapper fusion
> --------------------------------------------------
>
> Key: CRUNCH-90
> URL: https://issues.apache.org/jira/browse/CRUNCH-90
> Project: Crunch
> Issue Type: Bug
> Reporter: Gabriel Reid
> Assignee: Gabriel Reid
> Fix For: 0.4.0
>
> Attachments: CRUNCH-90.patch, CRUNCH-90-reflect.patch
>
>
> When multiple DoFns are run over the same output (i.e. in the case of mapper
> fusion), the same value object is passed to multiple underlying DoFns. If the
> state of that value object is changed by one DoFn, other DoFns are called
> with the updated object.
> This is a situation that can happen quite easily when the input of a DoFn is
> simply updated and then emitted. In general, this bug will only affect values
> whose type is the same as the underlying serialization type.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira