[ 
https://issues.apache.org/jira/browse/FLINK-795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stephan Ewen resolved FLINK-795.
--------------------------------

       Resolution: Fixed
    Fix Version/s:     (was: pre-apache)
                   0.6-incubating

Implemented in 99c888c7b775e1c3c910b9b4a45bbae5102bac14 and 
03c6160e47bc2ab613a2734388aa22f83323e6a5

> Possibly extend the cost model of the optimizer
> -----------------------------------------------
>
>                 Key: FLINK-795
>                 URL: https://issues.apache.org/jira/browse/FLINK-795
>             Project: Flink
>          Issue Type: Bug
>            Reporter: GitHub Import
>            Assignee: Markus Holzemer
>              Labels: github-import
>             Fix For: 0.6-incubating
>
>
> I have started the task to integrate the AbstractCachedBuildSideMatchDriver 
> into the optimizer. The driver caches one side of the join and thereby can 
> accellerate iterations if there are joins with static (non-changing) datasets 
> inside the iteration.
> The current way of calculating the cost of operators inside of iterations is 
> basically to multiply them by the number of iterations. I would like to 
> propose to extend this to have one static part of costs, that is counted only 
> once for all iterations, and one dynamic part that is multiplied by the 
> number of iterations.
> In my opinion that would be the cleanest way to intergrate the cached match, 
> by assigning it a higher starting cost then the regular match and a cheaper 
> dynamic part.
> One other approach would be to always use the cached match inside of 
> iterations. For that I would probably have to add a new 
> RequestedLocalProperty that tells the optimizer if the operator is used 
> inside of a iteration.
> A simple hacked solution could also be to simply exchange all suitable 
> regular matches inside of an iteration by the cached alternative.
> What do you think is the best approach?
> ---------------- Imported from GitHub ----------------
> Url: https://github.com/stratosphere/stratosphere/issues/795
> Created by: [markus-h|https://github.com/markus-h]
> Labels: 
> Created at: Mon May 12 18:51:51 CEST 2014
> State: open



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to