[ 
https://issues.apache.org/jira/browse/PHOENIX-1439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14208611#comment-14208611
 ] 

Mujtaba Chohan commented on PHOENIX-1439:
-----------------------------------------

Tried this with Phoenix 4.2 branch. Generated data using bin/performance.py 
script and ran the following query:
{code}
select count(*) from performance_5000000 where host like 'E%' and domain like 
'S%';
{code}

Explain plan shows skip scan is used:
{code}
CLIENT 2-CHUNK PARALLEL 1-WAY SKIP SCAN ON 1 RANGE OVER PERFORMANCE_5000000 ['E 
','S'] - ['F ','T'] 
     SERVER FILTER BY FIRST KEY ONLY      
     SERVER AGGREGATE INTO SINGLE ROW  
{code}

Also another query that runs part of perf test

{code}
select count(*) from IN_LIKE_TABLE where (K1 in ('A','E','F','G','H','I','Z') 
or K1 like 'E%') and (K2 like '%xx1%' or K2 like '%xxxxxx2%')
{code}

Explain plan:
{code}
CLIENT 6-CHUNK PARALLEL 1-WAY SKIP SCAN ON 6 RANGES OVER IN_LIKE_TABLE [A] - [Z]
    SERVER FILTER BY FIRST KEY ONLY AND (K2 LIKE %xx1% OR K2 LIKE %xxxxxx2%)
    SERVER AGGREGATE INTO SINGLE ROW
{code}

[~jamestaylor]

> LIKE doesn't appear to use skip scanning
> ----------------------------------------
>
>                 Key: PHOENIX-1439
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-1439
>             Project: Phoenix
>          Issue Type: Improvement
>            Reporter: Brian Johnson
>
> We had a query using several like statements of the form LIKE 'prefix|%' and 
> LIKE 'prefix2|%' ... and the performance was on par with a full table scan. 
> When we replaced it with BETWEEN 'prefix|' and 'prefix|FFFFFFF...' the query 
> time was substantially faster. It seems that the LIKE query should work with 
> skip scan, but I don't think it's being used.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to