[jira] [Commented] (FLINK-1085) Unnecessary failing of GroupReduceCombineDriver

Fabian Hueske (JIRA) Tue, 02 Sep 2014 04:44:09 -0700

    [ 
https://issues.apache.org/jira/browse/FLINK-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14118108#comment-14118108
 ]


Fabian Hueske commented on FLINK-1085:
--------------------------------------

The same applies to the {{SynchronousChainedCombineDriver}}.

> Unnecessary failing of GroupReduceCombineDriver
> -----------------------------------------------
>
>                 Key: FLINK-1085
>                 URL: https://issues.apache.org/jira/browse/FLINK-1085
>             Project: Flink
>          Issue Type: Bug
>          Components: Local Runtime
>    Affects Versions: 0.7-incubating, 0.6.1-incubating
>            Reporter: Fabian Hueske
>              Labels: starter
>
> With a recent update (commit cbbcf7820885a8a9734ffeba637b0182a6637939) the 
> GroupReduceCombineDriver was changed to not use an asynchronous partial 
> sorter. Instead, the driver fills a sort buffer with records, sorts it, 
> combines them, clears the buffer, and continues to fill it again.
> The GroupReduceCombineDriver fails if a record cannot be serialized into an 
> empty sort buffer, i.e., if the record is too large for the buffer.
> Alternatively, we should emit a WARN message for the first record that is too 
> large and just forward all records which do not fit into the empty sort 
> buffer (maybe continue to count how many records were simply forwarded and 
> give a second WARN message with this statistic).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (FLINK-1085) Unnecessary failing of GroupReduceCombineDriver

Reply via email to