Hi,

> I wanted to understand why hive has a performance issue with using _ 
> character in queries.

This is somewhat of a missed optimization issue - the "%" impl uses a fast 
BoyerMoore algorithm and avoids converting from utf-8 bytes -> String.

http://mail-archives.apache.org/mod_mbox/hive-user/201608.mbox/%3cd3c8de1e.4bcab%25go...@hortonworks.com%3E

That's the old thread for the same issue.

Filed a bug, since it came up again - 
https://issues.apache.org/jira/browse/HIVE-18487

If you want to pick this up and put up a patch, I can review it.

Cheers,
Gopal


Reply via email to