[ https://issues.apache.org/jira/browse/CRUNCH-501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14334557#comment-14334557 ]
Gabriel Reid commented on CRUNCH-501: ------------------------------------- +1 to the patch. It seems unfortunate that Aggregators.toCombineFn is public (and so necessitates deprecation), but I guess there's no easy way around that. About providing the PType for every DoFn, one problem I can see is that for shared instances of DoFns (Identity.getInstance for example), there would be an issue because multiple PTypes are used by the same instance. I'm not sure how often that would come up, but at least in terms of IdentityFn I believe it would break. [~aj987] is the use case for having access to the PType only related to detaching values, or are there other reasons to need access to a PType in a DoFn? > Object reuse issue in combineValues(Aggregator) > ----------------------------------------------- > > Key: CRUNCH-501 > URL: https://issues.apache.org/jira/browse/CRUNCH-501 > Project: Crunch > Issue Type: Bug > Reporter: Brandon Davis > Attachments: CRUNCH-501.patch > > > I'm trying to use combineValues on a PGroupedTable. I am using > Aggregators.FIRST_N. If I have 20 keys in my PGroupedTable, then I only get > 20 distinct values because the AggregatorCombineFn and FirstNAggregator don't > detach the values from the incoming iterator. -- This message was sent by Atlassian JIRA (v6.3.4#6332)