Re: Hive performance issue with _ character in query

2018-01-18 Thread Gopal Vijayaraghavan
Hi, > I wanted to understand why hive has a performance issue with using _ > character in queries. This is somewhat of a missed optimization issue - the "%" impl uses a fast BoyerMoore algorithm and avoids converting from utf-8 bytes -> String.

Hive performance issue with _ character in query

2018-01-17 Thread Arya Ketan
Hi, I wanted to understand why hive has a performance issue with using _ character in queries. For eg - select object from table where object like '*%SR_311%*' and date='2017-11-23' t*akes hours* Same query (modified) completes in < 5 mins - select object from table where object like