Geetika Gupta created CARBONDATA-1956:
-----------------------------------------

             Summary: Select query with sum, count and avg throws exception for 
pre aggregate table
                 Key: CARBONDATA-1956
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-1956
             Project: CarbonData
          Issue Type: Bug
          Components: data-query
    Affects Versions: 1.3.0
         Environment: spark2.1
            Reporter: Geetika Gupta
             Fix For: 1.3.0
         Attachments: 2000_UniqData.csv

I create a datamap using the following command:

create datamap uniqdata_agg_d on table uniqdata_29 using 'preaggregate' as 
select sum(decimal_column1), count(cust_id), avg(bigint_column1) from 
uniqdata_29 group by cust_id;

The datamap creation was successfull, but when I tried the following query:
select sum(decimal_column1), count(cust_id), avg(bigint_column1) from 
uniqdata_29 group by cust_id;

It throws the following exception:
Error: org.apache.spark.sql.AnalysisException: cannot resolve 
'(sum(uniqdata_29_uniqdata_agg_d.`uniqdata_29_bigint_column1_sum`) / 
sum(uniqdata_29_uniqdata_agg_d.`uniqdata_29_bigint_column1_count`))' due to 
data type mismatch: 
'(sum(uniqdata_29_uniqdata_agg_d.`uniqdata_29_bigint_column1_sum`) / 
sum(uniqdata_29_uniqdata_agg_d.`uniqdata_29_bigint_column1_count`))' requires 
(double or decimal) type, not bigint;;
'Aggregate [uniqdata_29_cust_id_count#244], 
[sum(uniqdata_29_decimal_column1_sum#243) AS sum(decimal_column1)#274, 
sum(cast(uniqdata_29_cust_id_count#244 as bigint)) AS count(cust_id)#276L, 
(sum(uniqdata_29_bigint_column1_sum#245L) / 
sum(uniqdata_29_bigint_column1_count#246L)) AS avg(bigint_column1)#279]
+- 
Relation[uniqdata_29_decimal_column1_sum#243,uniqdata_29_cust_id_count#244,uniqdata_29_bigint_column1_sum#245L,uniqdata_29_bigint_column1_count#246L]
 CarbonDatasourceHadoopRelation [ Database name :28dec, Table name 
:uniqdata_29_uniqdata_agg_d, Schema 
:Some(StructType(StructField(uniqdata_29_decimal_column1_sum,DecimalType(30,10),true),
 StructField(uniqdata_29_cust_id_count,IntegerType,true), 
StructField(uniqdata_29_bigint_column1_sum,LongType,true), 
StructField(uniqdata_29_bigint_column1_count,LongType,true))) ] (state=,code=0)

Steps for creation of maintable:
CREATE TABLE uniqdata_29(CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION 
string, DOB timestamp, DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 
bigint,DECIMAL_COLUMN1 decimal(30,10), DECIMAL_COLUMN2 
decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1 
int) STORED BY 'org.apache.carbondata.format';

Load command:
LOAD DATA INPATH 'hdfs://localhost:54311/Files/2000_UniqData.csv' into table 
uniqdata_29 OPTIONS('DELIMITER'=',', 
'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1');

Datamap creation command:
create datamap uniqdata_agg_d on table uniqdata_29 using 'preaggregate' as 
select sum(decimal_column1), count(cust_id), avg(bigint_column1) from 
uniqdata_29 group by cust_id;

Note: sum(decimal_column1), count(cust_id), avg(bigint_column1) from 
uniqdata_29 group by cust_id; executed successfully on maintable





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to