Hi, > I wanted to understand why hive has a performance issue with using _ > character in queries.
This is somewhat of a missed optimization issue - the "%" impl uses a fast BoyerMoore algorithm and avoids converting from utf-8 bytes -> String. http://mail-archives.apache.org/mod_mbox/hive-user/201608.mbox/%3cd3c8de1e.4bcab%25go...@hortonworks.com%3E That's the old thread for the same issue. Filed a bug, since it came up again - https://issues.apache.org/jira/browse/HIVE-18487 If you want to pick this up and put up a patch, I can review it. Cheers, Gopal