[ https://issues.apache.org/jira/browse/HIVE-4642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Teddy Choi updated HIVE-4642: ----------------------------- Attachment: Hive-Vectorized-Query-Execution-Design-rev10.docx I wrote "LIKE and REGEXP expressions:" section in "Filter operator". Following is the added text. {quote} Filter condition expressions LIKE and REGEXP expressions: LIKE and REGEXP expressions find any strings fitting a pattern. They compile a pattern on creation, and find strings on evaluation. Both kinds of expression use the Java regular expression package. REGEXP expressions use the package as it is. But LIKE expressions have different grammar, so they need conversion. “%” is converted to “.*” and “_” is converted to “.”. AbstractFilterStringColLikeStringScalar class defines common behaviors. FilterStringColLikeStringScalar class and FilterStringColRegExpStringScalar class implement differences. There are simple and frequently used patterns; such as prefix match, suffix match, middle match, exact match, and phone numbers. There are optimized implementations for them. They evaluate using byte arrays directly to avoid UTF-8 decoding load. {quote} This file is edited on Word for Mac 2011, so it may have incompatibilities. > Implement vectorized RLIKE and REGEXP filter expressions > -------------------------------------------------------- > > Key: HIVE-4642 > URL: https://issues.apache.org/jira/browse/HIVE-4642 > Project: Hive > Issue Type: Sub-task > Reporter: Eric Hanson > Assignee: Teddy Choi > Attachments: HIVE-4642-1.patch, HIVE-4642.2.patch, > HIVE-4642.3.patch.txt, Hive-Vectorized-Query-Execution-Design-rev10.docx > > > See title. I will add more details next week. The goal is (a) make this work > correctly and (b) optimize it as well as possible, at least for the common > cases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira