[ 
https://issues.apache.org/jira/browse/HIVE-542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12719728#action_12719728
 ] 

Zheng Shao commented on HIVE-542:
---------------------------------

Since there are so many advanced string matching algorithms (e.g. KMP and its 
variants), we should limit the scope here and maybe only optimize for the 
simplest case: "LIKE" that can be replaced by something like String.find().

Basically, if the pattern is in the form of "%xxx%" where xxx does not contain 
any wildcards, we can do "find".
If the pattern is in the form of "%xxx" or "xxx%", we can also do very 
efficient computation.


> Faster String Like
> ------------------
>
>                 Key: HIVE-542
>                 URL: https://issues.apache.org/jira/browse/HIVE-542
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.4.0
>            Reporter: Zheng Shao
>
> Currently Hive uses regular expression to do string like which is really 
> really slow. We should implement "LIKE" natively using Text.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to