Mujtaba Chohan created PHOENIX-3023:
---------------------------------------

             Summary: Slow performance when limit queries are executed in 
parallel by default
                 Key: PHOENIX-3023
                 URL: https://issues.apache.org/jira/browse/PHOENIX-3023
             Project: Phoenix
          Issue Type: Bug
            Reporter: Mujtaba Chohan


After 
[this|https://git-wip-us.apache.org/repos/asf?p=phoenix.git;a=commit;h=54362430d71be788d515944573572624628a09b6]
 commit, limit queries are executed in parallel which causes performance to be 
~5-10x slower. Providing a serial hint fixes it though.

After commit:
{code}
select * from WIDE_PK order by mypk DESC limit 1; // this takes ~400ms

CLIENT 1280-CHUNK 1996304 ROWS 6380181208 BYTES PARALLEL 4-WAY REVERSE FULL 
SCAN OVER WIDE_PK SERVER 1 ROW LIMIT CLIENT MERGE SORT CLIENT 1 ROW LIMIT
{code}

Before commit:
{code}
select * from WIDE_PK order by mypk DESC limit 1; // this takes ~40ms

CLIENT 1280-CHUNK 1996304 ROWS 6380181208 BYTES SERIAL 4-WAY REVERSE FULL SCAN 
OVER WIDE_PK SERVER 1 ROW LIMIT CLIENT MERGE SORT CLIENT 1 ROW LIMIT
{code}

Test was done on a single node machine running HBase 0.98.17.  DDL used was 
{code}CREATE TABLE WIDE_PK (MYPK CHAR(500) NOT NULL PRIMARY KEY,CF.column1 
INTEGER,CF.column2 INTEGER,CF.column3 INTEGER,CF.column4 INTEGER,CF.column5 
INTEGER) SALT_BUCKETS=4 with phoenix.stats.guidepost.width of 5000000
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to