Query Optimization in Hive

Anja Gruenheid Tue, 01 Feb 2011 08:20:06 -0800

Hi!

I'm a grad student at Georgia Tech and I'm currently working with Hivefor a university project. The project is on query optimizationtechniques and possibilities in Hive. I know that there have been a lotof additions to the ql and metastore components since the latest releaseand I was hoping to help advancing those components even further. Mymain interests in the course of my research is the storage and use ofmetadata to run a cost-based optimizer. This involves basicoptimizations using for example the table size for cost estimations, butalso more advanced approaches using histograms. I know that table andpartition information is already collected in Hive, but from what Icould gather, column metadata and histograms are still open. Would it bepossible for me to contribute to the project in that area?


Anja

Query Optimization in Hive

Reply via email to