I have new findings & subsequently relative improvements.Am testing as we speak. 4 Beam server nodes , Azure A11 & 2 Kafka nodes same config.I had keep state somewhere. I went with Redis. I found it to be a major bottle neck as Beam nodes constantly are going across NW to update its repository.So I replaced Redis with Java Concurrenthashmaps. Must faster. Then Kafka went out of disk space and the replication manager complained. So I clustered the two Kafka nodes hoping for sharing space. As of this second I am typing this email, its sustaining but only 1/2 of the 201401969 tuples have been processed after 3.5 hours.According to the Linear Road benchmarking expectations, if your system is working well, this whole 201401969 tuples must be done in 3.5 hrs max.So this means there is still room for tuning Flink nodes. I have already shared with you all more details about my config.It run perfect yesterday with almost 1/10th of this load. Perfect real-time send/processed streaming behavior.If thats the case & I cannot get better performance with FlinkRunner, my nest stop is SparkRunner and repeat of the whole thing for final benchmarking of the two under Beam APIs.Which was the initial intent anyways.If you have suggestions to make improvements in the above case, I am all ears & greatly appreciate it.Cheers,Amir-
From: "Chawla,Sumit" <sumitkcha...@gmail.com> To: dev@flink.apache.org; amir bahmanyari <amirto...@yahoo.com> Sent: Sunday, September 18, 2016 2:07 PM Subject: Re: Performance and Latency Chart for Flink Has anyone else run these kind of benchmarks? Would love to hear more people'e experience and details about those benchmarks. Regards Sumit Chawla On Sun, Sep 18, 2016 at 2:01 PM, Chawla,Sumit <sumitkcha...@gmail.com> wrote: > Hi Amir > > Would it be possible for you to share the numbers? Also share if possible > your configuration details. > > Regards > Sumit Chawla > > > On Fri, Sep 16, 2016 at 12:18 PM, amir bahmanyari < > amirto...@yahoo.com.invalid> wrote: > >> Hi Fabian,FYI. This is report on other engines we did the same type of >> bench-marking.Also explains what Linear Road bench-marking is.Thanks for >> your help. >> http://www.slideshare.net/RedisLabs/walmart-ibm-revisit-the- >> linear-road-benchmark >> https://github.com/IBMStreams/benchmarks >> https://www.datatorrent.com/blog/blog-implementing-linear-ro >> ad-benchmark-in-apex/ >> >> >> From: Fabian Hueske <fhue...@gmail.com> >> To: "dev@flink.apache.org" <dev@flink.apache.org> >> Sent: Friday, September 16, 2016 12:31 AM >> Subject: Re: Performance and Latency Chart for Flink >> >> Hi, >> >> I am not aware of periodic performance runs for the Flink releases. >> I know a few benchmarks which have been published at different points in >> time like [1], [2], and [3] (you'll probably find more). >> >> In general, fair benchmarks that compare different systems (if there is >> such thing) are very difficult and the results often depend on the use >> case. >> IMO the best option is to run your own benchmarks, if you have a concrete >> use case. >> >> Best, Fabian >> >> [1] 08/2015: >> http://data-artisans.com/high-throughput-low-latency-and-exa >> ctly-once-stream-processing-with-apache-flink/ >> [2] 12/2015: >> https://yahooeng.tumblr.com/post/135321837876/benchmarking- >> streaming-computation-engines-at >> [3] 02/2016: >> http://data-artisans.com/extending-the-yahoo-streaming-benchmark/ >> >> >> 2016-09-16 5:54 GMT+02:00 Chawla,Sumit <sumitkcha...@gmail.com>: >> >> > Hi >> > >> > Is there any performance run that is done for each Flink release? Or you >> > are aware of any third party evaluation of performance metrics for >> Flink? >> > I am interested in seeing how performance has improved over release to >> > release, and performance vs other competitors. >> > >> > Regards >> > Sumit Chawla >> > >> >> >> >> > >