Github user ravipesala commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/644
  
    Test results witj 100 million data 
    **DDL**
    CREATE TABLE perftesta (c1 string,c2 string,c3 string,c4 string,c5 
string,c6 bigint,c7 decimal(7,2),c8 int,c9 decimal(7,2),c10 decimal(15,2)) 
STORED BY 'carbondata'
    
    **Queries**
    Q1 -> SELECT 
count(c1),count(c2),count(c3),count(c4),count(c5),count(c6),count(c7),count(c8),count(c9),count(c10)
 FROM perftesta99;
    Q2 -> SELECT sum(c7), sum(c8), sum(9), sum(c10) FROM perftesta99 WHERE 
c2="P2_75" and c7<5;
    Q3 -> SELECT c2, c5, count(distinct c1), sum(c7) FROM perftesta99 WHERE 
c4="P4_4" and c5="P5_7" GROUP BY c2, c5;
    
    **Master Code**
    Load time  -> 576 seconds
    Data size after load  -> 1800MB
    Query(first_reading, second_reading)
    Q1(25.27, 21.794)
    Q2(27.296, 28.21)
    Q3(7.383, 5.103)
    
    **This PR Code**
    Load time -> 431 seconds
    Data size after load  -> 1720MB
    Query(first_reading, second_reading)
    Q1(18.507,14.427)
    Q2(24.102, 23.322)
    Q3(6.87,5.079)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

Reply via email to