[ 
https://issues.apache.org/jira/browse/CRUNCH-501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14334557#comment-14334557
 ] 

Gabriel Reid commented on CRUNCH-501:
-------------------------------------

+1 to the patch. It seems unfortunate that Aggregators.toCombineFn is public 
(and so necessitates deprecation), but I guess there's no easy way around that.

About providing the PType for every DoFn, one problem I can see is that for 
shared instances of DoFns (Identity.getInstance for example), there would be an 
issue because multiple PTypes are used by the same instance. I'm not sure how 
often that would come up, but at least in terms of IdentityFn I believe it 
would break.

[~aj987] is the use case for having access to the PType only related to 
detaching values, or are there other reasons to need access to a PType in a 
DoFn?

> Object reuse issue in combineValues(Aggregator)
> -----------------------------------------------
>
>                 Key: CRUNCH-501
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-501
>             Project: Crunch
>          Issue Type: Bug
>            Reporter: Brandon Davis
>         Attachments: CRUNCH-501.patch
>
>
> I'm trying to use combineValues on a PGroupedTable. I am using 
> Aggregators.FIRST_N. If I have 20 keys in my PGroupedTable, then I only get 
> 20 distinct values because the AggregatorCombineFn and FirstNAggregator don't 
> detach the values from the incoming iterator.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to