Hi guys, we just finished testing Kudu, mostly comparing Kudu to Impala on HDFS/parquet. I wanted to share my blog post and results. We used typical (and real) healthcare data for the test, not a synthetic data which I think makes it is a bit more interesting.
I welcome any feedback! http://boristyukin.com/benchmarking-apache-kudu-vs-apache-impala/ We are really impressed with Kudu and I wanted to take an opportunity to thank Kudu developers for such an amazing and much-needed product. Boris
