[jira] [Created] (CARBONDATA-4321) Major Compaction of a table with multiple big data loads each having different sort scopes fails

2022-01-06 Thread Chetan Bhat (Jira)
Chetan Bhat created CARBONDATA-4321:
---

 Summary: Major Compaction of a table with multiple big data loads 
each having different sort scopes fails
 Key: CARBONDATA-4321
 URL: https://issues.apache.org/jira/browse/CARBONDATA-4321
 Project: CarbonData
  Issue Type: Bug
  Components: data-load
Affects Versions: 2.3.0
 Environment: SUSE/Cent OS, Spark 3.1.1
Reporter: Chetan Bhat
 Attachments: Failure_Logs.txt

Test Steps :

>From Spark beeline table is created with compression format gzip, table having 
>more than 100 columns.

3 big data loads each with different sort scopes are loaded in the table.

Major compaction is executed on the table.

create table JL_r3
(
p_cap_time String,
city String,
product_code String,
user_base_station String,
user_belong_area_code String,
user_num String,
user_imsi String,
user_id String,
user_msisdn String,
dim1 String,
dim2 String,
dim3 String,
dim4 String,
dim5 String,
dim6 String,
dim7 String,
dim8 String,
dim9 String,
dim10 String,
dim11 String,
dim12 String,
dim13 String,
dim14 String,
dim15 String,
dim16 String,
dim17 String,
dim18 String,
dim19 String,
dim20 String,
dim21 String,
dim22 String,
dim23 String,
dim24 String,
dim25 String,
dim26 String,
dim27 String,
dim28 String,
dim29 String,
dim30 String,
dim31 String,
dim32 String,
dim33 String,
dim34 String,
dim35 String,
dim36 String,
dim37 String,
dim38 String,
dim39 String,
dim40 String,
dim41 String,
dim42 String,
dim43 String,
dim44 String,
dim45 String,
dim46 String,
dim47 String,
dim48 String,
dim49 String,
dim50 String,
dim51 String,
dim52 String,
dim53 String,
dim54 String,
dim55 String,
dim56 String,
dim57 String,
dim58 String,
dim59 String,
dim60 String,
dim61 String,
dim62 String,
dim63 String,
dim64 String,
dim65 String,
dim66 String,
dim67 String,
dim68 String,
dim69 String,
dim70 String,
dim71 String,
dim72 String,
dim73 String,
dim74 String,
dim75 String,
dim76 String,
dim77 String,
dim78 String,
dim79 String,
dim80 String,
dim81 String,
M1 double,
M2 double,
M3 double,
M4 double,
M5 double,
M6 double,
M7 double,
M8 double,
M9 double,
M10 double )
stored as carbondata
TBLPROPERTIES('table_blocksize'='256','sort_columns'='dim81','carbon.column.compressor'='gzip');

0: jdbc:hive2://10.21.19.14:23040/default> LOAD DATA inpath 
'hdfs://hacluster/chetan/Bigdata_bulk.csv' into table JL_r3 
options('sort_scope'='global_sort','DELIMITER'=',', 
'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','BAD_RECORDS_LOGGER_ENABLE'='TRUE','IS_EMPTY_DATA_BAD_RECORD'='TRUE','FILEHEADER'='p_cap_time,city,product_code,user_base_station,user_belong_area_code,user_num,user_imsi,user_id,user_msisdn,dim1,dim2,dim3,dim4,dim5,dim6,dim7,dim8,dim9,dim10,dim11,dim12,dim13,dim14,dim15,dim16,dim17,dim18,dim19,dim20,dim21,dim22,dim23,dim24,dim25,dim26,dim27,dim28,dim29,dim30,dim31,dim32,dim33,dim34,dim35,dim36,dim37,dim38,dim39,dim40,dim41,dim42,dim43,dim44,dim45,dim46,dim47,dim48,dim49,dim50,dim51,dim52,dim53,dim54,dim55,dim56,dim57,dim58,dim59,dim60,dim61,dim62,dim63,dim64,dim65,dim66,dim67,dim68,dim69,dim70,dim71,dim72,dim73,dim74,dim75,dim76,dim77,dim78,dim79,dim80,dim81,M1,M2,M3,M4,M5,M6,M7,M8,M9,M10');
+-+
| Segment ID  |
+-+
| 0           |
+-+
1 row selected (41.011 seconds)
0: jdbc:hive2://10.21.19.14:23040/default> LOAD DATA inpath 
'hdfs://hacluster/chetan/Bigdata_bulk.csv' into table JL_r3 
options('sort_scope'='local_sort','DELIMITER'=',', 
'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','BAD_RECORDS_LOGGER_ENABLE'='TRUE','IS_EMPTY_DATA_BAD_RECORD'='TRUE','FILEHEADER'='p_cap_time,city,product_code,user_base_station,user_belong_area_code,user_num,user_imsi,user_id,user_msisdn,dim1,dim2,dim3,dim4,dim5,dim6,dim7,dim8,dim9,dim10,dim11,dim12,dim13,dim14,dim15,dim16,dim17,dim18,dim19,dim20,dim21,dim22,dim23,dim24,dim25,dim26,dim27,dim28,dim29,dim30,dim31,dim32,dim33,dim34,dim35,dim36,dim37,dim38,dim39,dim40,dim41,dim42,dim43,dim44,dim45,dim46,dim47,dim48,dim49,dim50,dim51,dim52,dim53,dim54,dim55,dim56,dim57,dim58,dim59,dim60,dim61,dim62,dim63,dim64,dim65,dim66,dim67,dim68,dim69,dim70,dim71,dim72,dim73,dim74,dim75,dim76,dim77,dim78,dim79,dim80,dim81,M1,M2,M3,M4,M5,M6,M7,M8,M9,M10');
+-+
| Segment ID  |
+-+
| 1           |
+-+
1 row selected (17.094 seconds)
0: jdbc:hive2://10.21.19.14:23040/default> LOAD DATA inpath 
'hdfs://hacluster/chetan/Bigdata_bulk.csv' into table JL_r3 
options('sort_scope'='no_sort','DELIMITER'=',', 

[jira] [Created] (CARBONDATA-4320) Fix clean files removing wrong delta files

2022-01-06 Thread Vikram Ahuja (Jira)
Vikram Ahuja created CARBONDATA-4320:


 Summary: Fix clean files removing wrong delta files
 Key: CARBONDATA-4320
 URL: https://issues.apache.org/jira/browse/CARBONDATA-4320
 Project: CarbonData
  Issue Type: Bug
Reporter: Vikram Ahuja


h1. Fix clean files removing wrong delta files



--
This message was sent by Atlassian Jira
(v8.20.1#820001)