Re: Benchmarking streaming frameworks

Michael Noll Thu, 23 Mar 2017 06:49:35 -0700

A recent one is "Analytics on Fast Data: Main-Memory Database Systems
versus Modern Streaming Systems" (
http://db.in.tum.de/~kipf/papers/fastdata.pdf)


For the record, the paper above doesn't yet cover/realize that, nowadays,
the Kafka project includes native stream processing capabilities aka the
Kafka Streams API.

-Michael


On Thu, Mar 23, 2017 at 2:00 PM, Felix Neutatz <neut...@googlemail.com>
wrote:

> Hi,
>
> our team already created a benchmark framework for batch processing
> (including MR,Yarn, Spark, Flink), maybe you like to extend it for
> streaming: https://github.com/peelframework/peel
>
> Best regards,
> Felix
>
>
> On Mar 23, 2017 11:51, "Christophe Salperwyck" <
> christophe.salperw...@gmail.com> wrote:
>
> Good idea! You could test Akka streams too.
>
> Lots of documents exist:
> https://yahooeng.tumblr.com/post/135321837876/benchmarking-s
> treaming-computation-engines-at
> https://github.com/yahoo/streaming-benchmarks
>
> Cheers,
> Christophe
>
> 2017-03-23 11:09 GMT+01:00 Giselle van Dongen <giselle.vandon...@ugent.be>
> :
>
>> Dear users of Streaming Technologies,
>>
>> As a PhD student in big data analytics, I am currently in the process of
>> compiling a list of benchmarks (to test multiple streaming frameworks) in
>> order to create an expanded benchmarking suite. The benchmark suite is
>> being
>> developed as a part of my current work at Ghent University.
>>
>> The included frameworks at this time are, in no particular order, Spark,
>> Flink, Kafka (Streams), Storm (Trident) and Drizzle. Any pointers to
>> previous work or relevant benchmarks would be appreciated.
>>
>> Best regards,
>> Giselle van Dongen
>>
>
>
>

Re: Benchmarking streaming frameworks

Reply via email to