Vandana Yadav created CARBONDATA-2136:
-----------------------------------------
Summary: Exception displays while loading data with
BAD_RECORDS_ACTION = REDIRECT
Key: CARBONDATA-2136
URL: https://issues.apache.org/jira/browse/CARBONDATA-2136
Project: CarbonData
Issue Type: Bug
Components: data-load
Affects Versions: 1.3.0
Environment: spark 2.1
Reporter: Vandana Yadav
Attachments: 2000_UniqData.csv
Exception displays while loading data with BAD_RECORDS_ACTION = REDIRECT
Steps to reproduce:
1) create the table:
CREATE TABLE uniqdata(CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION string,
DOB timestamp, DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2
bigint,DECIMAL_COLUMN1 decimal(30,10), DECIMAL_COLUMN2
decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1
int) STORED BY 'org.apache.carbondata.format'
TBLPROPERTIES('DICTIONARY_INCLUDE'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1',"TABLE_BLOCKSIZE"=
"256
MB",'SORT_SCOPE'='NO_SORT','NO_INVERTED_INDEX'='CUST_ID,CUST_NAME,Double_COLUMN1,DECIMAL_COLUMN2');
2) Load Data:
LOAD DATA INPATH 'hdfs://localhost:54310/Data/uniqdata/2000_UniqData.csv' into
table uniqdata OPTIONS('DELIMITER'=',',
'QUOTECHAR'='"','BAD_RECORDS_ACTION'='REDIRECT','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1');
Expected Result: data should be loaded successfully.
Actual Result:
Error: java.lang.Exception: DataLoad failure: There is an unexpected error:
unable to generate the mdkey (state=,code=0)
3) ThriftServer logs:
18/02/06 16:38:11 INFO SparkExecuteStatementOperation: Running query 'LOAD DATA
INPATH 'hdfs://localhost:54310/Data/uniqdata/2000_UniqData.csv' into table
uniqdata OPTIONS('DELIMITER'=',',
'QUOTECHAR'='"','BAD_RECORDS_ACTION'='REDIRECT','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')'
with 87eb4af5-e485-4a0b-bcae-6589f1252291
18/02/06 16:38:11 INFO CarbonSparkSqlParser: Parsing command: LOAD DATA INPATH
'hdfs://localhost:54310/Data/uniqdata/2000_UniqData.csv' into table uniqdata
OPTIONS('DELIMITER'=',',
'QUOTECHAR'='"','BAD_RECORDS_ACTION'='REDIRECT','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')
18/02/06 16:38:11 INFO CarbonLateDecodeRule: pool-23-thread-41 skip
CarbonOptimizer
18/02/06 16:38:11 INFO CarbonLateDecodeRule: pool-23-thread-41 Skip
CarbonOptimizer
18/02/06 16:38:11 INFO HiveMetaStore: 42: get_table : db=bug tbl=uniqdata
18/02/06 16:38:11 INFO audit: ugi=hduser ip=unknown-ip-addr cmd=get_table :
db=bug tbl=uniqdata
18/02/06 16:38:11 INFO HiveMetaStore: 42: Opening raw store with implemenation
class:org.apache.hadoop.hive.metastore.ObjectStore
18/02/06 16:38:11 INFO ObjectStore: ObjectStore, initialize called
18/02/06 16:38:11 INFO Query: Reading in results for query
"org.datanucleus.store.rdbms.query.SQLQuery@0" since the connection used is
closing
18/02/06 16:38:11 INFO MetaStoreDirectSql: Using direct SQL, underlying DB is
DERBY
18/02/06 16:38:11 INFO ObjectStore: Initialized ObjectStore
18/02/06 16:38:11 INFO CatalystSqlParser: Parsing command: array<string>
18/02/06 16:38:11 INFO CarbonLoadDataCommand: pool-23-thread-41 Deleting stale
folders if present for table bug.uniqdata
18/02/06 16:38:11 INFO CarbonLoadDataCommand: pool-23-thread-41 Initiating
Direct Load for the Table : (bug.uniqdata)
18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/tablestatus.lock
18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/Segment_1.lock
18/02/06 16:38:12 INFO DeleteLoadFolders: pool-23-thread-41 Info: Deleted the
load 1
18/02/06 16:38:12 INFO DeleteLoadFolders: pool-23-thread-41 Info: Segment lock
on segment:1 is released
18/02/06 16:38:12 INFO DataLoadingUtil$: pool-23-thread-41 Table status lock
has been successfully acquired.
18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 Deleted the lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/tablestatus.lock
18/02/06 16:38:12 INFO CarbonLockUtil: pool-23-thread-41 Table status lock has
been successfully released
18/02/06 16:38:12 WARN DeleteLoadFolders: pool-23-thread-41 Files are not found
in segment
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/Fact/Part0/Segment_0
it seems, files are already being deleted
18/02/06 16:38:12 WARN DeleteLoadFolders: pool-23-thread-41 Files are not found
in segment
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/Fact/Part0/Segment_1
it seems, files are already being deleted
18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/tablestatus.lock
18/02/06 16:38:12 INFO CarbonLoaderUtil: pool-23-thread-41 Acquired lock for
tablebug.uniqdata for table status updation
18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 Deleted the lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/tablestatus.lock
18/02/06 16:38:12 INFO CarbonLoaderUtil: pool-23-thread-41 Table unlocked
successfully after table status updationbug.uniqdata
18/02/06 16:38:12 INFO GlobalDictionaryUtil$: pool-23-thread-41 Generate global
dictionary from source data files!
18/02/06 16:38:12 INFO MemoryStore: Block broadcast_36 stored as values in
memory (estimated size 293.6 KB, free 2.5 GB)
18/02/06 16:38:12 INFO MemoryStore: Block broadcast_36_piece0 stored as bytes
in memory (estimated size 24.7 KB, free 2.5 GB)
18/02/06 16:38:12 INFO BlockManagerInfo: Added broadcast_36_piece0 in memory on
192.168.2.160:44339 (size: 24.7 KB, free: 2.5 GB)
18/02/06 16:38:12 INFO SparkContext: Created broadcast 36 from NewHadoopRDD at
GlobalDictionaryUtil.scala:377
18/02/06 16:38:12 INFO SparkContext: Starting job: collect at
GlobalDictionaryUtil.scala:755
18/02/06 16:38:12 INFO FileInputFormat: Total input paths to process : 1
18/02/06 16:38:12 INFO DAGScheduler: Registering RDD 98 (RDD at
CarbonRDD.scala:33)
18/02/06 16:38:12 INFO DAGScheduler: Got job 22 (collect at
GlobalDictionaryUtil.scala:755) with 10 output partitions
18/02/06 16:38:12 INFO DAGScheduler: Final stage: ResultStage 30 (collect at
GlobalDictionaryUtil.scala:755)
18/02/06 16:38:12 INFO DAGScheduler: Parents of final stage:
List(ShuffleMapStage 29)
18/02/06 16:38:12 INFO DAGScheduler: Missing parents: List(ShuffleMapStage 29)
18/02/06 16:38:12 INFO DAGScheduler: Submitting ShuffleMapStage 29
(CarbonBlockDistinctValuesCombineRDD[98] at RDD at CarbonRDD.scala:33), which
has no missing parents
18/02/06 16:38:12 INFO MemoryStore: Block broadcast_37 stored as values in
memory (estimated size 11.3 KB, free 2.5 GB)
18/02/06 16:38:12 INFO MemoryStore: Block broadcast_37_piece0 stored as bytes
in memory (estimated size 5.8 KB, free 2.5 GB)
18/02/06 16:38:12 INFO BlockManagerInfo: Added broadcast_37_piece0 in memory on
192.168.2.160:44339 (size: 5.8 KB, free: 2.5 GB)
18/02/06 16:38:12 INFO SparkContext: Created broadcast 37 from broadcast at
DAGScheduler.scala:996
18/02/06 16:38:12 INFO DAGScheduler: Submitting 1 missing tasks from
ShuffleMapStage 29 (CarbonBlockDistinctValuesCombineRDD[98] at RDD at
CarbonRDD.scala:33)
18/02/06 16:38:12 INFO TaskSchedulerImpl: Adding task set 29.0 with 1 tasks
18/02/06 16:38:12 INFO TaskSetManager: Starting task 0.0 in stage 29.0 (TID 92,
localhost, executor driver, partition 0, ANY, 6597 bytes)
18/02/06 16:38:12 INFO Executor: Running task 0.0 in stage 29.0 (TID 92)
18/02/06 16:38:12 INFO NewHadoopRDD: Input split:
hdfs://localhost:54310/Data/uniqdata/2000_UniqData.csv:0+376223
18/02/06 16:38:12 INFO Executor: Finished task 0.0 in stage 29.0 (TID 92). 1343
bytes result sent to driver
18/02/06 16:38:12 INFO TaskSetManager: Finished task 0.0 in stage 29.0 (TID 92)
in 108 ms on localhost (executor driver) (1/1)
18/02/06 16:38:12 INFO TaskSchedulerImpl: Removed TaskSet 29.0, whose tasks
have all completed, from pool
18/02/06 16:38:12 INFO DAGScheduler: ShuffleMapStage 29 (RDD at
CarbonRDD.scala:33) finished in 0.107 s
18/02/06 16:38:12 INFO DAGScheduler: looking for newly runnable stages
18/02/06 16:38:12 INFO DAGScheduler: running: Set()
18/02/06 16:38:12 INFO DAGScheduler: waiting: Set(ResultStage 30)
18/02/06 16:38:12 INFO DAGScheduler: failed: Set()
18/02/06 16:38:12 INFO DAGScheduler: Submitting ResultStage 30
(CarbonGlobalDictionaryGenerateRDD[100] at RDD at CarbonRDD.scala:33), which
has no missing parents
18/02/06 16:38:12 INFO MemoryStore: Block broadcast_38 stored as values in
memory (estimated size 10.7 KB, free 2.5 GB)
18/02/06 16:38:12 INFO MemoryStore: Block broadcast_38_piece0 stored as bytes
in memory (estimated size 5.4 KB, free 2.5 GB)
18/02/06 16:38:12 INFO BlockManagerInfo: Added broadcast_38_piece0 in memory on
192.168.2.160:44339 (size: 5.4 KB, free: 2.5 GB)
18/02/06 16:38:12 INFO SparkContext: Created broadcast 38 from broadcast at
DAGScheduler.scala:996
18/02/06 16:38:12 INFO DAGScheduler: Submitting 10 missing tasks from
ResultStage 30 (CarbonGlobalDictionaryGenerateRDD[100] at RDD at
CarbonRDD.scala:33)
18/02/06 16:38:12 INFO TaskSchedulerImpl: Adding task set 30.0 with 10 tasks
18/02/06 16:38:12 INFO TaskSetManager: Starting task 0.0 in stage 30.0 (TID 93,
localhost, executor driver, partition 0, ANY, 6314 bytes)
18/02/06 16:38:12 INFO TaskSetManager: Starting task 1.0 in stage 30.0 (TID 94,
localhost, executor driver, partition 1, ANY, 6314 bytes)
18/02/06 16:38:12 INFO TaskSetManager: Starting task 2.0 in stage 30.0 (TID 95,
localhost, executor driver, partition 2, ANY, 6314 bytes)
18/02/06 16:38:12 INFO TaskSetManager: Starting task 3.0 in stage 30.0 (TID 96,
localhost, executor driver, partition 3, ANY, 6314 bytes)
18/02/06 16:38:12 INFO Executor: Running task 0.0 in stage 30.0 (TID 93)
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/8c24301c-143c-47f9-8bc7-4bf122787b70.lock
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO Executor: Running task 1.0 in stage 30.0 (TID 94)
18/02/06 16:38:12 INFO Executor: Running task 2.0 in stage 30.0 (TID 95)
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/fa04704a-f022-4d7e-b8e9-078357d50e84.lock
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-27 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/c65a6129-0e3d-4686-8176-b82c200e9e81.lock
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO Executor: Running task 3.0 in stage 30.0 (TID 96)
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-29 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/e1872e3d-3c67-4cc5-9356-2e6c614725db.lock
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for cust_id
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-26
columnName: cust_id
columnId: 8c24301c-143c-47f9-8bc7-4bf122787b70
new distinct values count: 0
combine lists: 2
create dictionary cache: 1
sort list, distinct and write: 1
write sort info: 0
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for cust_name
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for active_emui_version
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for bigint_column1
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-29
columnName: bigint_column1
columnId: e1872e3d-3c67-4cc5-9356-2e6c614725db
new distinct values count: 0
combine lists: 4
create dictionary cache: 2
sort list, distinct and write: 1
write sort info: 0
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-28
columnName: active_emui_version
columnId: fa04704a-f022-4d7e-b8e9-078357d50e84
new distinct values count: 0
combine lists: 3
create dictionary cache: 4
sort list, distinct and write: 1
write sort info: 0
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-27
columnName: cust_name
columnId: c65a6129-0e3d-4686-8176-b82c200e9e81
new distinct values count: 0
combine lists: 2
create dictionary cache: 6
sort list, distinct and write: 1
write sort info: 0
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/8c24301c-143c-47f9-8bc7-4bf122787b70.lock
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary cust_id
Unlocked Successfully.
18/02/06 16:38:12 INFO Executor: Finished task 0.0 in stage 30.0 (TID 93). 1728
bytes result sent to driver
18/02/06 16:38:12 INFO TaskSetManager: Starting task 4.0 in stage 30.0 (TID 97,
localhost, executor driver, partition 4, ANY, 6314 bytes)
18/02/06 16:38:12 INFO TaskSetManager: Finished task 0.0 in stage 30.0 (TID 93)
in 123 ms on localhost (executor driver) (1/10)
18/02/06 16:38:12 INFO Executor: Running task 4.0 in stage 30.0 (TID 97)
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/4108c641-a578-4415-b3ed-6ccb1a587d9b.lock
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-27 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/c65a6129-0e3d-4686-8176-b82c200e9e81.lock
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/fa04704a-f022-4d7e-b8e9-078357d50e84.lock
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary cust_name
Unlocked Successfully.
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary
active_emui_version Unlocked Successfully.
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-29 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/e1872e3d-3c67-4cc5-9356-2e6c614725db.lock
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary
bigint_column1 Unlocked Successfully.
18/02/06 16:38:12 INFO Executor: Finished task 2.0 in stage 30.0 (TID 95). 1728
bytes result sent to driver
18/02/06 16:38:12 INFO Executor: Finished task 1.0 in stage 30.0 (TID 94). 1728
bytes result sent to driver
18/02/06 16:38:12 INFO Executor: Finished task 3.0 in stage 30.0 (TID 96). 1728
bytes result sent to driver
18/02/06 16:38:12 INFO TaskSetManager: Starting task 5.0 in stage 30.0 (TID 98,
localhost, executor driver, partition 5, ANY, 6314 bytes)
18/02/06 16:38:12 INFO Executor: Running task 5.0 in stage 30.0 (TID 98)
18/02/06 16:38:12 INFO TaskSetManager: Starting task 6.0 in stage 30.0 (TID 99,
localhost, executor driver, partition 6, ANY, 6314 bytes)
18/02/06 16:38:12 INFO Executor: Running task 6.0 in stage 30.0 (TID 99)
18/02/06 16:38:12 INFO TaskSetManager: Starting task 7.0 in stage 30.0 (TID
100, localhost, executor driver, partition 7, ANY, 6314 bytes)
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-29 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/34778415-43d0-4d19-917b-52d079c9284f.lock
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-27 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/08a717d5-4a69-4fde-a14f-5f3bd762c149.lock
18/02/06 16:38:12 INFO TaskSetManager: Finished task 2.0 in stage 30.0 (TID 95)
in 135 ms on localhost (executor driver) (2/10)
18/02/06 16:38:12 INFO TaskSetManager: Finished task 1.0 in stage 30.0 (TID 94)
in 136 ms on localhost (executor driver) (3/10)
18/02/06 16:38:12 INFO TaskSetManager: Finished task 3.0 in stage 30.0 (TID 96)
in 136 ms on localhost (executor driver) (4/10)
18/02/06 16:38:12 INFO Executor: Running task 7.0 in stage 30.0 (TID 100)
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/67332d44-9548-473d-b171-ea410122f773.lock
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for bigint_column2
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-26
columnName: bigint_column2
columnId: 4108c641-a578-4415-b3ed-6ccb1a587d9b
new distinct values count: 0
combine lists: 1
create dictionary cache: 1
sort list, distinct and write: 2
write sort info: 0
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for decimal_column1
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for double_column1
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for decimal_column2
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-27
columnName: decimal_column2
columnId: 08a717d5-4a69-4fde-a14f-5f3bd762c149
new distinct values count: 0
combine lists: 3
create dictionary cache: 2
sort list, distinct and write: 6
write sort info: 0
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-29
columnName: decimal_column1
columnId: 34778415-43d0-4d19-917b-52d079c9284f
new distinct values count: 0
combine lists: 3
create dictionary cache: 2
sort list, distinct and write: 6
write sort info: 0
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-28
columnName: double_column1
columnId: 67332d44-9548-473d-b171-ea410122f773
new distinct values count: 0
combine lists: 1
create dictionary cache: 8
sort list, distinct and write: 0
write sort info: 0
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/4108c641-a578-4415-b3ed-6ccb1a587d9b.lock
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary
bigint_column2 Unlocked Successfully.
18/02/06 16:38:12 INFO Executor: Finished task 4.0 in stage 30.0 (TID 97). 1801
bytes result sent to driver
18/02/06 16:38:12 INFO TaskSetManager: Starting task 8.0 in stage 30.0 (TID
101, localhost, executor driver, partition 8, ANY, 6314 bytes)
18/02/06 16:38:12 INFO TaskSetManager: Finished task 4.0 in stage 30.0 (TID 97)
in 118 ms on localhost (executor driver) (5/10)
18/02/06 16:38:12 INFO Executor: Running task 8.0 in stage 30.0 (TID 101)
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/8b0db910-ca20-4a95-9f9c-fe94fa460b27.lock
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_33_piece0 on
192.168.2.160:44339 in memory (size: 5.8 KB, free: 2.5 GB)
18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_34_piece0 on
192.168.2.160:44339 in memory (size: 5.4 KB, free: 2.5 GB)
18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_35_piece0 on
192.168.2.160:44339 in memory (size: 30.7 KB, free: 2.5 GB)
18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_37_piece0 on
192.168.2.160:44339 in memory (size: 5.8 KB, free: 2.5 GB)
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/67332d44-9548-473d-b171-ea410122f773.lock
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary
double_column1 Unlocked Successfully.
18/02/06 16:38:12 INFO Executor: Finished task 7.0 in stage 30.0 (TID 100).
1801 bytes result sent to driver
18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_31_piece0 on
192.168.2.160:44339 in memory (size: 30.6 KB, free: 2.5 GB)
18/02/06 16:38:12 INFO TaskSetManager: Starting task 9.0 in stage 30.0 (TID
102, localhost, executor driver, partition 9, ANY, 6314 bytes)
18/02/06 16:38:12 INFO Executor: Running task 9.0 in stage 30.0 (TID 102)
18/02/06 16:38:12 INFO TaskSetManager: Finished task 7.0 in stage 30.0 (TID
100) in 123 ms on localhost (executor driver) (6/10)
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/a5d84076-9ba2-4c63-88cc-6881452eccfb.lock
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks
out of 1 blocks
18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in
0 ms
18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_32_piece0 on
192.168.2.160:44339 in memory (size: 24.7 KB, free: 2.5 GB)
18/02/06 16:38:12 INFO ContextCleaner: Cleaned shuffle 6
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-27 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/08a717d5-4a69-4fde-a14f-5f3bd762c149.lock
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary
decimal_column2 Unlocked Successfully.
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-29 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/34778415-43d0-4d19-917b-52d079c9284f.lock
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary
decimal_column1 Unlocked Successfully.
18/02/06 16:38:12 INFO Executor: Finished task 5.0 in stage 30.0 (TID 98). 1801
bytes result sent to driver
18/02/06 16:38:12 INFO Executor: Finished task 6.0 in stage 30.0 (TID 99). 1801
bytes result sent to driver
18/02/06 16:38:12 INFO TaskSetManager: Finished task 5.0 in stage 30.0 (TID 98)
in 133 ms on localhost (executor driver) (7/10)
18/02/06 16:38:12 INFO TaskSetManager: Finished task 6.0 in stage 30.0 (TID 99)
in 133 ms on localhost (executor driver) (8/10)
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for double_column2
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-26
columnName: double_column2
columnId: 8b0db910-ca20-4a95-9f9c-fe94fa460b27
new distinct values count: 0
combine lists: 0
create dictionary cache: 2
sort list, distinct and write: 0
write sort info: 0
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able to
get the dictionary lock for integer_column1
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task launch
worker-28
columnName: integer_column1
columnId: a5d84076-9ba2-4c63-88cc-6881452eccfb
new distinct values count: 0
combine lists: 2
create dictionary cache: 2
sort list, distinct and write: 2
write sort info: 0
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/8b0db910-ca20-4a95-9f9c-fe94fa460b27.lock
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary
double_column2 Unlocked Successfully.
18/02/06 16:38:12 INFO Executor: Finished task 8.0 in stage 30.0 (TID 101).
1728 bytes result sent to driver
18/02/06 16:38:12 INFO TaskSetManager: Finished task 8.0 in stage 30.0 (TID
101) in 204 ms on localhost (executor driver) (9/10)
18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 Deleted the
lock file
hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/a5d84076-9ba2-4c63-88cc-6881452eccfb.lock
18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary
integer_column1 Unlocked Successfully.
18/02/06 16:38:12 INFO Executor: Finished task 9.0 in stage 30.0 (TID 102).
1728 bytes result sent to driver
18/02/06 16:38:12 INFO TaskSetManager: Finished task 9.0 in stage 30.0 (TID
102) in 200 ms on localhost (executor driver) (10/10)
18/02/06 16:38:12 INFO TaskSchedulerImpl: Removed TaskSet 30.0, whose tasks
have all completed, from pool
18/02/06 16:38:12 INFO DAGScheduler: ResultStage 30 (collect at
GlobalDictionaryUtil.scala:755) finished in 0.457 s
18/02/06 16:38:12 INFO DAGScheduler: Job 22 finished: collect at
GlobalDictionaryUtil.scala:755, took 0.601328 s
18/02/06 16:38:12 INFO GlobalDictionaryUtil$: pool-23-thread-41 generate global
dictionary successfully
18/02/06 16:38:12 AUDIT CarbonDataRDDFactory$:
[knoldus][hduser][Thread-1336]Data load request has been received for table
bug.uniqdata
18/02/06 16:38:13 WARN CarbonDataProcessorUtil: pool-23-thread-41 sort scope is
set to NO_SORT
18/02/06 16:38:13 INFO HdfsFileLock: pool-23-thread-41 HDFS lock
path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/Segment_2.lock
18/02/06 16:38:13 INFO CommonUtil$: pool-23-thread-41 [Block Distribution]
18/02/06 16:38:13 INFO CommonUtil$: pool-23-thread-41 totalInputSpaceConsumed:
376223 , defaultParallelism: 4
18/02/06 16:38:13 INFO CommonUtil$: pool-23-thread-41
mapreduce.input.fileinputformat.split.maxsize: 16777216
18/02/06 16:38:13 INFO FileInputFormat: Total input paths to process : 1
18/02/06 16:38:13 INFO DistributionUtil$: pool-23-thread-41 Executors
configured : 1
18/02/06 16:38:13 INFO DistributionUtil$: pool-23-thread-41 Total Time taken to
ensure the required executors : 1
18/02/06 16:38:13 INFO DistributionUtil$: pool-23-thread-41 Time elapsed to
allocate the required executors: 0
18/02/06 16:38:13 INFO CarbonDataRDDFactory$: pool-23-thread-41 Total Time
taken in block allocation: 1
18/02/06 16:38:13 INFO CarbonDataRDDFactory$: pool-23-thread-41 Total no of
blocks: 1, No.of Nodes: 1
18/02/06 16:38:13 INFO CarbonDataRDDFactory$: pool-23-thread-41 #Node: knoldus
no.of.blocks: 1
18/02/06 16:38:13 INFO SparkContext: Starting job: collect at
CarbonDataRDDFactory.scala:1092
18/02/06 16:38:13 INFO DAGScheduler: Got job 23 (collect at
CarbonDataRDDFactory.scala:1092) with 1 output partitions
18/02/06 16:38:13 INFO DAGScheduler: Final stage: ResultStage 31 (collect at
CarbonDataRDDFactory.scala:1092)
18/02/06 16:38:13 INFO DAGScheduler: Parents of final stage: List()
18/02/06 16:38:13 INFO DAGScheduler: Missing parents: List()
18/02/06 16:38:13 INFO DAGScheduler: Submitting ResultStage 31
(NewCarbonDataLoadRDD[101] at RDD at CarbonRDD.scala:33), which has no missing
parents
18/02/06 16:38:13 INFO NewCarbonDataLoadRDD: Preferred Location for split :
knoldus
18/02/06 16:38:13 INFO MemoryStore: Block broadcast_39 stored as values in
memory (estimated size 38.3 KB, free 2.5 GB)
18/02/06 16:38:13 INFO MemoryStore: Block broadcast_39_piece0 stored as bytes
in memory (estimated size 30.7 KB, free 2.5 GB)
18/02/06 16:38:13 INFO BlockManagerInfo: Added broadcast_39_piece0 in memory on
192.168.2.160:44339 (size: 30.7 KB, free: 2.5 GB)
18/02/06 16:38:13 INFO SparkContext: Created broadcast 39 from broadcast at
DAGScheduler.scala:996
18/02/06 16:38:13 INFO DAGScheduler: Submitting 1 missing tasks from
ResultStage 31 (NewCarbonDataLoadRDD[101] at RDD at CarbonRDD.scala:33)
18/02/06 16:38:13 INFO TaskSchedulerImpl: Adding task set 31.0 with 1 tasks
18/02/06 16:38:13 INFO TaskSetManager: Starting task 0.0 in stage 31.0 (TID
103, localhost, executor driver, partition 0, ANY, 6880 bytes)
18/02/06 16:38:13 INFO Executor: Running task 0.0 in stage 31.0 (TID 103)
18/02/06 16:38:13 INFO NewCarbonDataLoadRDD: Input split: knoldus
18/02/06 16:38:13 INFO NewCarbonDataLoadRDD: The Block Count in this node :1
18/02/06 16:38:13 INFO SparkPartitionLoader: [Executor task launch
worker-28][partitionID:uniqdata;queryID:82072464882259] Temp location for
loading data: /tmp/carbon82072469095359_0
18/02/06 16:38:13 WARN CarbonDataProcessorUtil: [Executor task launch
worker-28][partitionID:uniqdata;queryID:82072464882259] sort scope is set to
NO_SORT
18/02/06 16:38:13 INFO AbstractDataLoadProcessorStep: Thread-1137 Rows
processed in step Data Writer : 0
18/02/06 16:38:13 INFO AbstractDataLoadProcessorStep: Thread-1138 Rows
processed in step Data Converter : 0
18/02/06 16:38:13 INFO AbstractDataLoadProcessorStep: Thread-1139 Rows
processed in step Input Processor : 0
18/02/06 16:38:13 INFO DataLoadExecutor: [Executor task launch
worker-28][partitionID:uniqdata;queryID:82072464882259] Data Loading is started
for table uniqdata
18/02/06 16:38:13 WARN CarbonDataProcessorUtil: [Executor task launch
worker-28][partitionID:uniqdata;queryID:82072464882259] sort scope is set to
NO_SORT
18/02/06 16:38:13 INFO CarbonFactDataHandlerColumnar: [Executor task launch
worker-28][partitionID:uniqdata;queryID:82072464882259] Initializing writer
executors
18/02/06 16:38:13 INFO CarbonFactDataHandlerColumnar: [Executor task launch
worker-28][partitionID:uniqdata;queryID:82072464882259] Columns considered as
NoInverted Index are cust_id,cust_name,decimal_column2,double_column1,
18/02/06 16:38:13 INFO CarbonFactDataHandlerColumnar: [Executor task launch
worker-28][partitionID:uniqdata;queryID:82072464882259] Number of rows per
column blocklet 32000
18/02/06 16:38:13 INFO AbstractFactDataWriter: [Executor task launch
worker-28][partitionID:uniqdata;queryID:82072464882259] Total file size:
268435456 and dataBlock Size: 241591911
18/02/06 16:38:13 INFO AbstractFactDataWriter: [Executor task launch
worker-28][partitionID:uniqdata;queryID:82072464882259] Randomly choose
factdata temp location: /tmp/carbon82072469095359_0/Fact/Part0/Segment_2/0
18/02/06 16:38:13 ERROR CarbonRowDataWriterProcessorStepImpl: [Executor task
launch worker-28][partitionID:uniqdata;queryID:82072464882259] Failed for
table: uniqdata in DataWriterProcessorStepImpl
org.apache.carbondata.processing.loading.exception.CarbonDataLoadingException:
unable to generate the mdkey
at
org.apache.carbondata.processing.loading.steps.CarbonRowDataWriterProcessorStepImpl.processBatch(CarbonRowDataWriterProcessorStepImpl.java:281)
at
org.apache.carbondata.processing.loading.steps.CarbonRowDataWriterProcessorStepImpl.doExecute(CarbonRowDataWriterProcessorStepImpl.java:167)
at
org.apache.carbondata.processing.loading.steps.CarbonRowDataWriterProcessorStepImpl.execute(CarbonRowDataWriterProcessorStepImpl.java:122)
at
org.apache.carbondata.processing.loading.DataLoadExecutor.execute(DataLoadExecutor.java:51)
at
org.apache.carbondata.spark.rdd.NewCarbonDataLoadRDD$$anon$1.<init>(NewCarbonDataLoadRDD.scala:246)
at
org.apache.carbondata.spark.rdd.NewCarbonDataLoadRDD.internalCompute(NewCarbonDataLoadRDD.scala:221)
at org.apache.carbondata.spark.rdd.CarbonRDD.compute(CarbonRDD.scala:60)
at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:323)
at org.apache.spark.rdd.RDD.iterator(RDD.scala:287)
at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:87)
at org.apache.spark.scheduler.Task.run(Task.scala:99)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:282)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)