Re: Python SDK worker / portable Flink runner performance improvements

2018-10-21 Thread Thomas Weise
Regarding the functionality: https://s.apache.org/apache-beam-portability-support-table While we still have a good chunk of work to do, the MVP feature set is in place and allows to run pipelines. Before we check P2 (feature complete), I would like to see (in addition to what Max mentioned): *

Re: Python SDK worker / portable Flink runner performance improvements

2018-10-19 Thread Kenneth Knowles
This is really cool news. Pretty awesome to move from the "get it to run" phase to the "get it to run faster" phase of this project. Streaming testing: In Java there's a synthetic source (GenerateSequence / CountingSource) for testing. Maybe in this case I'd say porting to py is worth it? Kenn

Re: Python SDK worker / portable Flink runner performance improvements

2018-10-19 Thread Maximilian Michels
Thanks Thomas, I think it is important to start looking at performance and improved test coverage. While we have the basic functionality, there is still state and timers to be implemented for the Portable FlinkRunner. These two will allow full testing/optimization: State:

Re: Python SDK worker / portable Flink runner performance improvements

2018-10-17 Thread Lukasz Cwik
Thanks, this was useful for me since I have been away these past couple of weeks. On Wed, Oct 17, 2018 at 8:45 AM Thomas Weise wrote: > Hi, > > As you may have noticed, some of the contributors are working on enabling > the Python support on Flink. The upcoming 2.8 release is going to include >

Python SDK worker / portable Flink runner performance improvements

2018-10-17 Thread Thomas Weise
Hi, As you may have noticed, some of the contributors are working on enabling the Python support on Flink. The upcoming 2.8 release is going to include much of the functionality and we are now shifting gears to stability and performance. There have been some basic fixes already (logging, memory