Hello, everyone. I'm a PhD student from the Institute of Computing Technology, Chinese Academy of Sciences. Our team has released a benchmark for big data systems called BigDataBench, which has become an industry-standard big data benchmark in China. You can find our work on this website: http://prof.ict.ac.cn/BigDataBench/
We are now planning to support Flink in our benchmark, which could provide a set of workloads on different domains and an objective comparison with systems such as Spark and Hadoop. But we are new to this system, so we are asking for your advice about benchmark design. The first thing is to decide what workloads should be added to our benchmark and which domain we should pay more attention. The attachment is a preliminary plan, which lists some workloads that have already been implemented in the Spark version. We plan to first implement these workloads on Flink, and evalute these two systems. Does anyone have some adivce for this list? We will be very grateful for any idea. BigDataBench_for_Flink.docx <http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/file/n7079/BigDataBench_for_Flink.docx> Thanks ;) -- View this message in context: http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Benchmarks-of-Flink-supporting-Flink-in-BigDataBench-tp7079.html Sent from the Apache Flink Mailing List archive. mailing list archive at Nabble.com.