Re: [DISCUSS] Unified Core API for Streaming and Batch

2018-12-07 Thread Haibo Sun
Hi All, Thank Aljoscha for further spitting up topics. I will start separate threads on each topic which you propose. Best, Haibo Aljoscha Krettek-2 wrote > Hi All, > > this is a great discussion! (I have some thoughts on most of the topics > but I'll wait for the separate discussion threads

Re: [DISCUSS] Unified Core API for Streaming and Batch

2018-12-07 Thread Aljoscha Krettek
Hi All, this is a great discussion! (I have some thoughts on most of the topics but I'll wait for the separate discussion threads) @Haibo Will you start a separate threads? I think the separate discussion topics would be (based on Stephans mail but further split up): 1. What should the API sta

Re: [DISCUSS] Unified Core API for Streaming and Batch

2018-12-07 Thread Shuai Xu
Hi all Glad to see the discussion, we are now designing to enhance the scheduling of batch job, a unified api will help a lot. Haibo Sun 于2018年12月5日周三 下午4:45写道: > Hi all, > > Thank Kurt, you see more benefits of the unification than I do. > > I quite agree Kurt's views. DataStream, DataSet and T

Re: [DISCUSS] Unified Core API for Streaming and Batch

2018-12-05 Thread Haibo Sun
Hi all, Thank Kurt, you see more benefits of the unification than I do. I quite agree Kurt's views. DataStream, DataSet and Table are remained independent for now, and subsumed DataSet in data stream in the future. The collection execution mode is replaced by mini cluster. The high-level semantic

Re: [DISCUSS] Unified Core API for Streaming and Batch

2018-12-04 Thread Guowei Ma
gt; they are unified into one stack, the batch jobs can also get benefits > from > > all the improvements. I think it is a very big work but worth doing, left > > some concerns: > > > > 1. The current job graph generation for batch covers complicated > > optimization such as cost-based estimate, plan etc. Would this part also

Re: [DISCUSS] Unified Core API for Streaming and Batch

2018-12-04 Thread Kurt Young
s input selection while reading. I acknowledge these roles for > special batch scenarios, but they seem not the blocker for unification > motivation, because current batch jobs can also work without these > improvements. So the further improvments can be separated into individual > topics after we

Re: [DISCUSS] Unified Core API for Streaming and Batch

2018-12-03 Thread Wang Feng
s after we reaching the unification of stream and batch firstly. Best, Zhijiang -- 发件人:孙海波 mailto:sunhaib...@163.com>> 发送时间:2018年12月3日(星期一) 10:52 收件人:dev mailto:dev@flink.apache.org>> 主 题:[DISCUSS] Unified Core API for Strea

Re: [DISCUSS] Unified Core API for Streaming and Batch

2018-12-03 Thread Stephan Ewen
because current batch jobs can also work without these > improvements. So the further improvments can be separated into individual > topics after we reaching the unification of stream and batch firstly. > > > >Best, > >Zhijiang > > > > > >-

Re:[DISCUSS] Unified Core API for Streaming and Batch

2018-12-03 Thread Haibo Sun
e reaching the >unification of stream and batch firstly. > >Best, >Zhijiang > > >---------------------- >发件人:孙海波 >发送时间:2018年12月3日(星期一) 10:52 >收件人:dev >主 题:[DISCUSS] Unified Core API for Streaming and Batch > >Hi all, >This post proposes unified core API for Streaming and Bat

Re: [DISCUSS] Unified Core API for Streaming and Batch

2018-12-03 Thread jincheng sun
Hi Haibo, Thank you for this great proposal! Flink is a unified computing engine. It has been unified at the TableAPI and SQLAPI levels (not yet complete). It's greate If we can unify the DataSet API and DataStream API. I also want to convert to StreamTransformation in the SQL and Table API, bec

回复:[DISCUSS] Unified Core API for Streaming and Batch

2018-12-03 Thread zhijiang
unification of stream and batch firstly. Best, Zhijiang -- 发件人:孙海波 发送时间:2018年12月3日(星期一) 10:52 收件人:dev 主 题:[DISCUSS] Unified Core API for Streaming and Batch Hi all, This post proposes unified core API for Streaming and Batch. Currently

[DISCUSS] Unified Core API for Streaming and Batch

2018-12-02 Thread 孙海波
Hi all, This post proposes unified core API for Streaming and Batch. Currently DataStream and DataSet adopt separated compilation processes, execution tasks and basic programming models in the runtime layer, which complicates the system implementation. We think that batch jobs can be processed