Hello, everyone.

I'm a PhD student from the Institute of Computing Technology, Chinese
Academy of Sciences. Our team has released a benchmark for big data systems
called BigDataBench, which has become an industry-standard big data
benchmark in China. You can find our work on this website:
http://prof.ict.ac.cn/BigDataBench/

We are now planning to support Flink in our benchmark, which could provide a
set of workloads on different domains and an objective comparison with
systems such as Spark and Hadoop. But we are new to this system, so we are
asking for your advice about benchmark design. The first thing is to decide
what workloads should be added to our benchmark and which domain we should
pay more attention. 

The attachment is a preliminary plan, which lists some workloads that have
already been implemented in the Spark version. We plan to first implement
these workloads on Flink, and evalute these two systems. Does anyone have
some adivce for this list? We will be very grateful for any idea.
BigDataBench_for_Flink.docx
<http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/file/n7079/BigDataBench_for_Flink.docx>
  

Thanks ;)



--
View this message in context: 
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Benchmarks-of-Flink-supporting-Flink-in-BigDataBench-tp7079.html
Sent from the Apache Flink Mailing List archive. mailing list archive at 
Nabble.com.

Reply via email to