[ 
https://issues.apache.org/jira/browse/FLINK-6388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15986851#comment-15986851
 ] 

ASF GitHub Bot commented on FLINK-6388:
---------------------------------------

Github user fhueske commented on the issue:

    https://github.com/apache/flink/pull/3783
  
    Thanks for this PR @huawei-flink! 
    
    I think I made a mistake when I suggested to use the code-gen'd functions 
with registered `MapState` to compute distinct window aggregations. Originally, 
I thought it would be possible to register state (i.e., the `MapState` for the 
distinct values) in an `AggregateFunction` (which is used for the grouped 
window aggregates). However, that's unfortunately not possible as I learned 
today. All state of an `AggregateFunction` must be contained in the accumulator.
    
    What does this mean? We cannot use the current approach of registering 
`MapState` in the code-gen'd function for group windowed aggregates. So we 
would need another approach for that.
    
    However, we can still use your code for distinct over windows 
(`ProcessFunction` can obviously register state) once the API supports to 
define DISTINCT aggregates.
    
    I'll try to have a closer look at this PR soon.
    
    Best, Fabian


> Add support for DISTINCT into Code Generated Aggregations
> ---------------------------------------------------------
>
>                 Key: FLINK-6388
>                 URL: https://issues.apache.org/jira/browse/FLINK-6388
>             Project: Flink
>          Issue Type: Sub-task
>          Components: DataStream API
>    Affects Versions: 1.3.0
>            Reporter: Stefano Bortoli
>            Assignee: Stefano Bortoli
>             Fix For: 1.3.0
>
>
> We should support DISTINCT in Code Generated aggrgation functions.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to