Did you guys already look at Dr Elephant? https://engineering.linkedin.com/blog/2016/04/dr-elephant-open-source-self-serve-performance-tuning-hadoop-spark
Not sure if there is anything you might find useful, but I would be interested in hearing about good and bad about Dr Elephant w/ Hive. Sent from my iPhone > On Jul 25, 2018, at 12:13 PM, Zheng Shao <zsh...@gmail.com> wrote: > > Hi, > > I am interested in working on a project that takes a large number of Hive > queries (as well as their meta data like amount of resources used etc) and > find out common sub queries and expensive query groups etc. > > Are there any existing work in this domain? Happy to collaborate as well if > there are shared I interests. > > Zheng >