[ 
https://issues.apache.org/jira/browse/FLINK-23959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dong Lin updated FLINK-23959:
-----------------------------
    Description: 
The FLIP design doc can be found at 
https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=181311363.

The existing Flink ML library allows users to compose an Estimator/Transformer 
from a pipeline (i.e. linear sequence) of Estimator/Transformer. Users only 
need to construct this Pipeline once and generate the corresponding 
PipelineModel, without having to explicitly construct the fitted PipelineModel 
as a linear sequence of stages. However, in order to train a DAG of 
Estimator/Transformer and uses the trained model for inference, users currently 
need to construct the DAG twice, once for the training logic and once for the 
inference logic. This experience is inferior to the experience of training and 
using a chain of Estimator/Transformer. In addition to requiring more work from 
users, this approach is more error prone because the DAG for the training logic 
may be inconsistent from the DAG for the inference logic.

In order to address the issues described above, we propose to add several 
helper classes that allow users to compose Estimator/Transformer/AlgoOperator 
from a DAG of Estimator/Transformer/AlgoOperator.

  was:
The existing Flink ML library allows users to compose an Estimator/Transformer 
from a pipeline (i.e. linear sequence) of Estimator/Transformer. Users only 
need to construct this Pipeline once and generate the corresponding 
PipelineModel, without having to explicitly construct the fitted PipelineModel 
as a linear sequence of stages.

However, in the use-case that needs a DAG of Estimator/Transformer, users 
currently needs to separately build the DAG separately, once for the training 
logic and once for the inference logic. This experience is inferior to the 
cases supported by the Pipeline.

To improve the user experience, we propose to add several helper classes that 
allow users to compose Estimator/Transformer/AlgoOperator from a DAG of 
Estimator/Transformer/AlgoOperator.


> FLIP-175: Add GraphBuilder to compose Estimator/Transformer/AlgoOperator from 
> a DAG of stages
> ---------------------------------------------------------------------------------------------
>
>                 Key: FLINK-23959
>                 URL: https://issues.apache.org/jira/browse/FLINK-23959
>             Project: Flink
>          Issue Type: Improvement
>            Reporter: Dong Lin
>            Priority: Major
>
> The FLIP design doc can be found at 
> https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=181311363.
> The existing Flink ML library allows users to compose an 
> Estimator/Transformer from a pipeline (i.e. linear sequence) of 
> Estimator/Transformer. Users only need to construct this Pipeline once and 
> generate the corresponding PipelineModel, without having to explicitly 
> construct the fitted PipelineModel as a linear sequence of stages. However, 
> in order to train a DAG of Estimator/Transformer and uses the trained model 
> for inference, users currently need to construct the DAG twice, once for the 
> training logic and once for the inference logic. This experience is inferior 
> to the experience of training and using a chain of Estimator/Transformer. In 
> addition to requiring more work from users, this approach is more error prone 
> because the DAG for the training logic may be inconsistent from the DAG for 
> the inference logic.
> In order to address the issues described above, we propose to add several 
> helper classes that allow users to compose Estimator/Transformer/AlgoOperator 
> from a DAG of Estimator/Transformer/AlgoOperator.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to