Re: [DISCUSS] Embracing Table API in Flink ML

2018-12-04 Thread Weihua Jiang
It has pasted a while and I think we can move forward to JIRA discussion. I will try to split the design into smaller pieces to make it more understandable. Actually, I have already implemented an initial version and ported some flink.ml algorithms using this new API. Thus, we can have a better b

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Chen Qin
Hi Yun, Very excited to see Flink ML forward! There are many touch points your document touched. I couldn't agree more the value of having a (unified) table API could bring to Flink ecosystem towards running ML workload. Most ML pipelines we observed starts from single box python scripts or adhoc

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Weihua Jiang
Hi Yun, Can't wait to see your design. Thanks Weihua Yun Gao 于2018年11月21日周三 上午12:43写道: > Hi Weihua, > > Thanks for the exciting proposal! > > I have quickly read through it, and I really appropriate the idea of > providing the ML Pipeline API similar to the commonly used library > sci

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Weihua Jiang
Hi Shaoxuan, You are perfectly right. What I want to achieve is a combination of all your 3 points. Let me rephrase here: 1. Define a Table based ML Pipeline interface to have the same functionality as current DataSet based implementations. 2. Support new features like online learning, streaming i

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Weihua Jiang
HI Becket, Thanks a lot for the Table API enhancement design doc. I am working on some simple ML algorithm using this new ML pipeline. Will feedback you if there is any Table enhancement needed. Thanks Weihua Becket Qin 于2018年11月20日周二 下午10:43写道: > Hi Weihua, > > Thanks for the well written

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Weihua Jiang
Hi Jincheng, Thanks a lot for the warm feedback. I've already read your Table API enhancement google doc. Those enhancements are essential to implement any ML/DL algorithm on Table API. Our two designs are perfectly complementary to each other. :) Will add a section in my google doc for the impl

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Shaoxuan Wang
Hi Weihua, Thanks for the proposal. I have quickly read through it. It looks great. A quick question. Do you consider changing the ML Lib (implementation of Estimator/Predictor/Transformer) also on top of the tableAPI? I will be very happy if this is also included in the scope. It is not easy and

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread Becket Qin
Hi Weihua, Thanks for the well written design doc! The abstraction of ML pipeline is pretty handy to the AI engineers. As Jincheng mentioned, there is an undergoing effort to enhance the Table API for ML. But it would still be helpful to understand what is missing in Table API to fully support th

Re: [DISCUSS] Embracing Table API in Flink ML

2018-11-20 Thread jincheng sun
Hi Weihua, Thanks for bring up this discuss! I quickly read the google doc,and I fully agree that ML can be well supported on TableAPI (at some stage in the future). In fact, Xiaowei and I have already brought up a discussion on enhancing the Table API. In the first phase, we will add support for