[ 
https://issues.apache.org/jira/browse/CARBONDATA-2136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jatin reassigned CARBONDATA-2136:
---------------------------------

    Assignee: Jatin

> Exception displays while loading data with BAD_RECORDS_ACTION = REDIRECT
> ------------------------------------------------------------------------
>
>                 Key: CARBONDATA-2136
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-2136
>             Project: CarbonData
>          Issue Type: Bug
>          Components: data-load
>    Affects Versions: 1.3.0
>         Environment: spark 2.1
>            Reporter: Vandana Yadav
>            Assignee: Jatin
>            Priority: Major
>         Attachments: 2000_UniqData.csv
>
>          Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> Exception displays while loading data with BAD_RECORDS_ACTION = REDIRECT
> Steps to reproduce:
> 1) create the table:
> CREATE TABLE uniqdata(CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION 
> string, DOB timestamp, DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 
> bigint,DECIMAL_COLUMN1 decimal(30,10), DECIMAL_COLUMN2 
> decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1 
> int) STORED BY 'org.apache.carbondata.format' 
> TBLPROPERTIES('DICTIONARY_INCLUDE'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1',"TABLE_BLOCKSIZE"=
>  "256 
> MB",'SORT_SCOPE'='NO_SORT','NO_INVERTED_INDEX'='CUST_ID,CUST_NAME,Double_COLUMN1,DECIMAL_COLUMN2');
> 2) Load Data:
> LOAD DATA INPATH 'hdfs://localhost:54310/Data/uniqdata/2000_UniqData.csv' 
> into table uniqdata OPTIONS('DELIMITER'=',', 
> 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='REDIRECT','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1');
> Expected Result: data should be loaded successfully.
> Actual Result:
> Error: java.lang.Exception: DataLoad failure: There is an unexpected error: 
> unable to generate the mdkey (state=,code=0)
>  
> 3) ThriftServer logs: 
> 18/02/06 16:38:11 INFO SparkExecuteStatementOperation: Running query 'LOAD 
> DATA INPATH 'hdfs://localhost:54310/Data/uniqdata/2000_UniqData.csv' into 
> table uniqdata OPTIONS('DELIMITER'=',', 
> 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='REDIRECT','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')'
>  with 87eb4af5-e485-4a0b-bcae-6589f1252291
> 18/02/06 16:38:11 INFO CarbonSparkSqlParser: Parsing command: LOAD DATA 
> INPATH 'hdfs://localhost:54310/Data/uniqdata/2000_UniqData.csv' into table 
> uniqdata OPTIONS('DELIMITER'=',', 
> 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='REDIRECT','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')
> 18/02/06 16:38:11 INFO CarbonLateDecodeRule: pool-23-thread-41 skip 
> CarbonOptimizer
> 18/02/06 16:38:11 INFO CarbonLateDecodeRule: pool-23-thread-41 Skip 
> CarbonOptimizer
> 18/02/06 16:38:11 INFO HiveMetaStore: 42: get_table : db=bug tbl=uniqdata
> 18/02/06 16:38:11 INFO audit: ugi=hduser ip=unknown-ip-addr cmd=get_table : 
> db=bug tbl=uniqdata 
> 18/02/06 16:38:11 INFO HiveMetaStore: 42: Opening raw store with 
> implemenation class:org.apache.hadoop.hive.metastore.ObjectStore
> 18/02/06 16:38:11 INFO ObjectStore: ObjectStore, initialize called
> 18/02/06 16:38:11 INFO Query: Reading in results for query 
> "org.datanucleus.store.rdbms.query.SQLQuery@0" since the connection used is 
> closing
> 18/02/06 16:38:11 INFO MetaStoreDirectSql: Using direct SQL, underlying DB is 
> DERBY
> 18/02/06 16:38:11 INFO ObjectStore: Initialized ObjectStore
> 18/02/06 16:38:11 INFO CatalystSqlParser: Parsing command: array<string>
> 18/02/06 16:38:11 INFO CarbonLoadDataCommand: pool-23-thread-41 Deleting 
> stale folders if present for table bug.uniqdata
> 18/02/06 16:38:11 INFO CarbonLoadDataCommand: pool-23-thread-41 Initiating 
> Direct Load for the Table : (bug.uniqdata)
> 18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/tablestatus.lock
> 18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/Segment_1.lock
> 18/02/06 16:38:12 INFO DeleteLoadFolders: pool-23-thread-41 Info: Deleted the 
> load 1
> 18/02/06 16:38:12 INFO DeleteLoadFolders: pool-23-thread-41 Info: Segment 
> lock on segment:1 is released
> 18/02/06 16:38:12 INFO DataLoadingUtil$: pool-23-thread-41 Table status lock 
> has been successfully acquired.
> 18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 Deleted the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/tablestatus.lock
> 18/02/06 16:38:12 INFO CarbonLockUtil: pool-23-thread-41 Table status lock 
> has been successfully released
> 18/02/06 16:38:12 WARN DeleteLoadFolders: pool-23-thread-41 Files are not 
> found in segment 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/Fact/Part0/Segment_0
>  it seems, files are already being deleted
> 18/02/06 16:38:12 WARN DeleteLoadFolders: pool-23-thread-41 Files are not 
> found in segment 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/Fact/Part0/Segment_1
>  it seems, files are already being deleted
> 18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/tablestatus.lock
> 18/02/06 16:38:12 INFO CarbonLoaderUtil: pool-23-thread-41 Acquired lock for 
> tablebug.uniqdata for table status updation
> 18/02/06 16:38:12 INFO HdfsFileLock: pool-23-thread-41 Deleted the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/tablestatus.lock
> 18/02/06 16:38:12 INFO CarbonLoaderUtil: pool-23-thread-41 Table unlocked 
> successfully after table status updationbug.uniqdata
> 18/02/06 16:38:12 INFO GlobalDictionaryUtil$: pool-23-thread-41 Generate 
> global dictionary from source data files!
> 18/02/06 16:38:12 INFO MemoryStore: Block broadcast_36 stored as values in 
> memory (estimated size 293.6 KB, free 2.5 GB)
> 18/02/06 16:38:12 INFO MemoryStore: Block broadcast_36_piece0 stored as bytes 
> in memory (estimated size 24.7 KB, free 2.5 GB)
> 18/02/06 16:38:12 INFO BlockManagerInfo: Added broadcast_36_piece0 in memory 
> on 192.168.2.160:44339 (size: 24.7 KB, free: 2.5 GB)
> 18/02/06 16:38:12 INFO SparkContext: Created broadcast 36 from NewHadoopRDD 
> at GlobalDictionaryUtil.scala:377
> 18/02/06 16:38:12 INFO SparkContext: Starting job: collect at 
> GlobalDictionaryUtil.scala:755
> 18/02/06 16:38:12 INFO FileInputFormat: Total input paths to process : 1
> 18/02/06 16:38:12 INFO DAGScheduler: Registering RDD 98 (RDD at 
> CarbonRDD.scala:33)
> 18/02/06 16:38:12 INFO DAGScheduler: Got job 22 (collect at 
> GlobalDictionaryUtil.scala:755) with 10 output partitions
> 18/02/06 16:38:12 INFO DAGScheduler: Final stage: ResultStage 30 (collect at 
> GlobalDictionaryUtil.scala:755)
> 18/02/06 16:38:12 INFO DAGScheduler: Parents of final stage: 
> List(ShuffleMapStage 29)
> 18/02/06 16:38:12 INFO DAGScheduler: Missing parents: List(ShuffleMapStage 29)
> 18/02/06 16:38:12 INFO DAGScheduler: Submitting ShuffleMapStage 29 
> (CarbonBlockDistinctValuesCombineRDD[98] at RDD at CarbonRDD.scala:33), which 
> has no missing parents
> 18/02/06 16:38:12 INFO MemoryStore: Block broadcast_37 stored as values in 
> memory (estimated size 11.3 KB, free 2.5 GB)
> 18/02/06 16:38:12 INFO MemoryStore: Block broadcast_37_piece0 stored as bytes 
> in memory (estimated size 5.8 KB, free 2.5 GB)
> 18/02/06 16:38:12 INFO BlockManagerInfo: Added broadcast_37_piece0 in memory 
> on 192.168.2.160:44339 (size: 5.8 KB, free: 2.5 GB)
> 18/02/06 16:38:12 INFO SparkContext: Created broadcast 37 from broadcast at 
> DAGScheduler.scala:996
> 18/02/06 16:38:12 INFO DAGScheduler: Submitting 1 missing tasks from 
> ShuffleMapStage 29 (CarbonBlockDistinctValuesCombineRDD[98] at RDD at 
> CarbonRDD.scala:33)
> 18/02/06 16:38:12 INFO TaskSchedulerImpl: Adding task set 29.0 with 1 tasks
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 0.0 in stage 29.0 (TID 
> 92, localhost, executor driver, partition 0, ANY, 6597 bytes)
> 18/02/06 16:38:12 INFO Executor: Running task 0.0 in stage 29.0 (TID 92)
> 18/02/06 16:38:12 INFO NewHadoopRDD: Input split: 
> hdfs://localhost:54310/Data/uniqdata/2000_UniqData.csv:0+376223
> 18/02/06 16:38:12 INFO Executor: Finished task 0.0 in stage 29.0 (TID 92). 
> 1343 bytes result sent to driver
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 0.0 in stage 29.0 (TID 
> 92) in 108 ms on localhost (executor driver) (1/1)
> 18/02/06 16:38:12 INFO TaskSchedulerImpl: Removed TaskSet 29.0, whose tasks 
> have all completed, from pool 
> 18/02/06 16:38:12 INFO DAGScheduler: ShuffleMapStage 29 (RDD at 
> CarbonRDD.scala:33) finished in 0.107 s
> 18/02/06 16:38:12 INFO DAGScheduler: looking for newly runnable stages
> 18/02/06 16:38:12 INFO DAGScheduler: running: Set()
> 18/02/06 16:38:12 INFO DAGScheduler: waiting: Set(ResultStage 30)
> 18/02/06 16:38:12 INFO DAGScheduler: failed: Set()
> 18/02/06 16:38:12 INFO DAGScheduler: Submitting ResultStage 30 
> (CarbonGlobalDictionaryGenerateRDD[100] at RDD at CarbonRDD.scala:33), which 
> has no missing parents
> 18/02/06 16:38:12 INFO MemoryStore: Block broadcast_38 stored as values in 
> memory (estimated size 10.7 KB, free 2.5 GB)
> 18/02/06 16:38:12 INFO MemoryStore: Block broadcast_38_piece0 stored as bytes 
> in memory (estimated size 5.4 KB, free 2.5 GB)
> 18/02/06 16:38:12 INFO BlockManagerInfo: Added broadcast_38_piece0 in memory 
> on 192.168.2.160:44339 (size: 5.4 KB, free: 2.5 GB)
> 18/02/06 16:38:12 INFO SparkContext: Created broadcast 38 from broadcast at 
> DAGScheduler.scala:996
> 18/02/06 16:38:12 INFO DAGScheduler: Submitting 10 missing tasks from 
> ResultStage 30 (CarbonGlobalDictionaryGenerateRDD[100] at RDD at 
> CarbonRDD.scala:33)
> 18/02/06 16:38:12 INFO TaskSchedulerImpl: Adding task set 30.0 with 10 tasks
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 0.0 in stage 30.0 (TID 
> 93, localhost, executor driver, partition 0, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 1.0 in stage 30.0 (TID 
> 94, localhost, executor driver, partition 1, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 2.0 in stage 30.0 (TID 
> 95, localhost, executor driver, partition 2, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 3.0 in stage 30.0 (TID 
> 96, localhost, executor driver, partition 3, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO Executor: Running task 0.0 in stage 30.0 (TID 93)
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/8c24301c-143c-47f9-8bc7-4bf122787b70.lock
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO Executor: Running task 1.0 in stage 30.0 (TID 94)
> 18/02/06 16:38:12 INFO Executor: Running task 2.0 in stage 30.0 (TID 95)
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/fa04704a-f022-4d7e-b8e9-078357d50e84.lock
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-27 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/c65a6129-0e3d-4686-8176-b82c200e9e81.lock
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO Executor: Running task 3.0 in stage 30.0 (TID 96)
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-29 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/e1872e3d-3c67-4cc5-9356-2e6c614725db.lock
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for cust_id
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-26 
>  columnName: cust_id
>  columnId: 8c24301c-143c-47f9-8bc7-4bf122787b70
>  new distinct values count: 0
>  combine lists: 2
>  create dictionary cache: 1
>  sort list, distinct and write: 1
>  write sort info: 0
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for cust_name
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for active_emui_version
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for bigint_column1
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-29 
>  columnName: bigint_column1
>  columnId: e1872e3d-3c67-4cc5-9356-2e6c614725db
>  new distinct values count: 0
>  combine lists: 4
>  create dictionary cache: 2
>  sort list, distinct and write: 1
>  write sort info: 0
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-28 
>  columnName: active_emui_version
>  columnId: fa04704a-f022-4d7e-b8e9-078357d50e84
>  new distinct values count: 0
>  combine lists: 3
>  create dictionary cache: 4
>  sort list, distinct and write: 1
>  write sort info: 0
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-27 
>  columnName: cust_name
>  columnId: c65a6129-0e3d-4686-8176-b82c200e9e81
>  new distinct values count: 0
>  combine lists: 2
>  create dictionary cache: 6
>  sort list, distinct and write: 1
>  write sort info: 0
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/8c24301c-143c-47f9-8bc7-4bf122787b70.lock
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary cust_id 
> Unlocked Successfully.
> 18/02/06 16:38:12 INFO Executor: Finished task 0.0 in stage 30.0 (TID 93). 
> 1728 bytes result sent to driver
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 4.0 in stage 30.0 (TID 
> 97, localhost, executor driver, partition 4, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 0.0 in stage 30.0 (TID 
> 93) in 123 ms on localhost (executor driver) (1/10)
> 18/02/06 16:38:12 INFO Executor: Running task 4.0 in stage 30.0 (TID 97)
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/4108c641-a578-4415-b3ed-6ccb1a587d9b.lock
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-27 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/c65a6129-0e3d-4686-8176-b82c200e9e81.lock
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/fa04704a-f022-4d7e-b8e9-078357d50e84.lock
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary 
> cust_name Unlocked Successfully.
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary 
> active_emui_version Unlocked Successfully.
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-29 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/e1872e3d-3c67-4cc5-9356-2e6c614725db.lock
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary 
> bigint_column1 Unlocked Successfully.
> 18/02/06 16:38:12 INFO Executor: Finished task 2.0 in stage 30.0 (TID 95). 
> 1728 bytes result sent to driver
> 18/02/06 16:38:12 INFO Executor: Finished task 1.0 in stage 30.0 (TID 94). 
> 1728 bytes result sent to driver
> 18/02/06 16:38:12 INFO Executor: Finished task 3.0 in stage 30.0 (TID 96). 
> 1728 bytes result sent to driver
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 5.0 in stage 30.0 (TID 
> 98, localhost, executor driver, partition 5, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO Executor: Running task 5.0 in stage 30.0 (TID 98)
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 6.0 in stage 30.0 (TID 
> 99, localhost, executor driver, partition 6, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO Executor: Running task 6.0 in stage 30.0 (TID 99)
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 7.0 in stage 30.0 (TID 
> 100, localhost, executor driver, partition 7, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-29 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/34778415-43d0-4d19-917b-52d079c9284f.lock
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-27 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/08a717d5-4a69-4fde-a14f-5f3bd762c149.lock
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 2.0 in stage 30.0 (TID 
> 95) in 135 ms on localhost (executor driver) (2/10)
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 1.0 in stage 30.0 (TID 
> 94) in 136 ms on localhost (executor driver) (3/10)
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 3.0 in stage 30.0 (TID 
> 96) in 136 ms on localhost (executor driver) (4/10)
> 18/02/06 16:38:12 INFO Executor: Running task 7.0 in stage 30.0 (TID 100)
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/67332d44-9548-473d-b171-ea410122f773.lock
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for bigint_column2
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-26 
>  columnName: bigint_column2
>  columnId: 4108c641-a578-4415-b3ed-6ccb1a587d9b
>  new distinct values count: 0
>  combine lists: 1
>  create dictionary cache: 1
>  sort list, distinct and write: 2
>  write sort info: 0
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for decimal_column1
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for double_column1
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for decimal_column2
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-27 
>  columnName: decimal_column2
>  columnId: 08a717d5-4a69-4fde-a14f-5f3bd762c149
>  new distinct values count: 0
>  combine lists: 3
>  create dictionary cache: 2
>  sort list, distinct and write: 6
>  write sort info: 0
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-29 
>  columnName: decimal_column1
>  columnId: 34778415-43d0-4d19-917b-52d079c9284f
>  new distinct values count: 0
>  combine lists: 3
>  create dictionary cache: 2
>  sort list, distinct and write: 6
>  write sort info: 0
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-28 
>  columnName: double_column1
>  columnId: 67332d44-9548-473d-b171-ea410122f773
>  new distinct values count: 0
>  combine lists: 1
>  create dictionary cache: 8
>  sort list, distinct and write: 0
>  write sort info: 0
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/4108c641-a578-4415-b3ed-6ccb1a587d9b.lock
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary 
> bigint_column2 Unlocked Successfully.
> 18/02/06 16:38:12 INFO Executor: Finished task 4.0 in stage 30.0 (TID 97). 
> 1801 bytes result sent to driver
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 8.0 in stage 30.0 (TID 
> 101, localhost, executor driver, partition 8, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 4.0 in stage 30.0 (TID 
> 97) in 118 ms on localhost (executor driver) (5/10)
> 18/02/06 16:38:12 INFO Executor: Running task 8.0 in stage 30.0 (TID 101)
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/8b0db910-ca20-4a95-9f9c-fe94fa460b27.lock
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_33_piece0 on 
> 192.168.2.160:44339 in memory (size: 5.8 KB, free: 2.5 GB)
> 18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_34_piece0 on 
> 192.168.2.160:44339 in memory (size: 5.4 KB, free: 2.5 GB)
> 18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_35_piece0 on 
> 192.168.2.160:44339 in memory (size: 30.7 KB, free: 2.5 GB)
> 18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_37_piece0 on 
> 192.168.2.160:44339 in memory (size: 5.8 KB, free: 2.5 GB)
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/67332d44-9548-473d-b171-ea410122f773.lock
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary 
> double_column1 Unlocked Successfully.
> 18/02/06 16:38:12 INFO Executor: Finished task 7.0 in stage 30.0 (TID 100). 
> 1801 bytes result sent to driver
> 18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_31_piece0 on 
> 192.168.2.160:44339 in memory (size: 30.6 KB, free: 2.5 GB)
> 18/02/06 16:38:12 INFO TaskSetManager: Starting task 9.0 in stage 30.0 (TID 
> 102, localhost, executor driver, partition 9, ANY, 6314 bytes)
> 18/02/06 16:38:12 INFO Executor: Running task 9.0 in stage 30.0 (TID 102)
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 7.0 in stage 30.0 (TID 
> 100) in 123 ms on localhost (executor driver) (6/10)
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/a5d84076-9ba2-4c63-88cc-6881452eccfb.lock
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty 
> blocks out of 1 blocks
> 18/02/06 16:38:12 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches 
> in 0 ms
> 18/02/06 16:38:12 INFO BlockManagerInfo: Removed broadcast_32_piece0 on 
> 192.168.2.160:44339 in memory (size: 24.7 KB, free: 2.5 GB)
> 18/02/06 16:38:12 INFO ContextCleaner: Cleaned shuffle 6
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-27 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/08a717d5-4a69-4fde-a14f-5f3bd762c149.lock
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary 
> decimal_column2 Unlocked Successfully.
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-29 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/34778415-43d0-4d19-917b-52d079c9284f.lock
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary 
> decimal_column1 Unlocked Successfully.
> 18/02/06 16:38:12 INFO Executor: Finished task 5.0 in stage 30.0 (TID 98). 
> 1801 bytes result sent to driver
> 18/02/06 16:38:12 INFO Executor: Finished task 6.0 in stage 30.0 (TID 99). 
> 1801 bytes result sent to driver
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 5.0 in stage 30.0 (TID 
> 98) in 133 ms on localhost (executor driver) (7/10)
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 6.0 in stage 30.0 (TID 
> 99) in 133 ms on localhost (executor driver) (8/10)
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for double_column2
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-26 
>  columnName: double_column2
>  columnId: 8b0db910-ca20-4a95-9f9c-fe94fa460b27
>  new distinct values count: 0
>  combine lists: 0
>  create dictionary cache: 2
>  sort list, distinct and write: 0
>  write sort info: 0
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Successfully able 
> to get the dictionary lock for integer_column1
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Executor task 
> launch worker-28 
>  columnName: integer_column1
>  columnId: a5d84076-9ba2-4c63-88cc-6881452eccfb
>  new distinct values count: 0
>  combine lists: 2
>  create dictionary cache: 2
>  sort list, distinct and write: 2
>  write sort info: 0
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-26 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/8b0db910-ca20-4a95-9f9c-fe94fa460b27.lock
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary 
> double_column2 Unlocked Successfully.
> 18/02/06 16:38:12 INFO Executor: Finished task 8.0 in stage 30.0 (TID 101). 
> 1728 bytes result sent to driver
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 8.0 in stage 30.0 (TID 
> 101) in 204 ms on localhost (executor driver) (9/10)
> 18/02/06 16:38:12 INFO HdfsFileLock: Executor task launch worker-28 Deleted 
> the lock file 
> hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/a5d84076-9ba2-4c63-88cc-6881452eccfb.lock
> 18/02/06 16:38:12 INFO CarbonGlobalDictionaryGenerateRDD: Dictionary 
> integer_column1 Unlocked Successfully.
> 18/02/06 16:38:12 INFO Executor: Finished task 9.0 in stage 30.0 (TID 102). 
> 1728 bytes result sent to driver
> 18/02/06 16:38:12 INFO TaskSetManager: Finished task 9.0 in stage 30.0 (TID 
> 102) in 200 ms on localhost (executor driver) (10/10)
> 18/02/06 16:38:12 INFO TaskSchedulerImpl: Removed TaskSet 30.0, whose tasks 
> have all completed, from pool 
> 18/02/06 16:38:12 INFO DAGScheduler: ResultStage 30 (collect at 
> GlobalDictionaryUtil.scala:755) finished in 0.457 s
> 18/02/06 16:38:12 INFO DAGScheduler: Job 22 finished: collect at 
> GlobalDictionaryUtil.scala:755, took 0.601328 s
> 18/02/06 16:38:12 INFO GlobalDictionaryUtil$: pool-23-thread-41 generate 
> global dictionary successfully
> 18/02/06 16:38:12 AUDIT CarbonDataRDDFactory$: 
> [knoldus][hduser][Thread-1336]Data load request has been received for table 
> bug.uniqdata
> 18/02/06 16:38:13 WARN CarbonDataProcessorUtil: pool-23-thread-41 sort scope 
> is set to NO_SORT
> 18/02/06 16:38:13 INFO HdfsFileLock: pool-23-thread-41 HDFS lock 
> path:hdfs://localhost:54310/opt/prestocarbonStore/bug/uniqdata/Segment_2.lock
> 18/02/06 16:38:13 INFO CommonUtil$: pool-23-thread-41 [Block Distribution]
> 18/02/06 16:38:13 INFO CommonUtil$: pool-23-thread-41 
> totalInputSpaceConsumed: 376223 , defaultParallelism: 4
> 18/02/06 16:38:13 INFO CommonUtil$: pool-23-thread-41 
> mapreduce.input.fileinputformat.split.maxsize: 16777216
> 18/02/06 16:38:13 INFO FileInputFormat: Total input paths to process : 1
> 18/02/06 16:38:13 INFO DistributionUtil$: pool-23-thread-41 Executors 
> configured : 1
> 18/02/06 16:38:13 INFO DistributionUtil$: pool-23-thread-41 Total Time taken 
> to ensure the required executors : 1
> 18/02/06 16:38:13 INFO DistributionUtil$: pool-23-thread-41 Time elapsed to 
> allocate the required executors: 0
> 18/02/06 16:38:13 INFO CarbonDataRDDFactory$: pool-23-thread-41 Total Time 
> taken in block allocation: 1
> 18/02/06 16:38:13 INFO CarbonDataRDDFactory$: pool-23-thread-41 Total no of 
> blocks: 1, No.of Nodes: 1
> 18/02/06 16:38:13 INFO CarbonDataRDDFactory$: pool-23-thread-41 #Node: 
> knoldus no.of.blocks: 1
> 18/02/06 16:38:13 INFO SparkContext: Starting job: collect at 
> CarbonDataRDDFactory.scala:1092
> 18/02/06 16:38:13 INFO DAGScheduler: Got job 23 (collect at 
> CarbonDataRDDFactory.scala:1092) with 1 output partitions
> 18/02/06 16:38:13 INFO DAGScheduler: Final stage: ResultStage 31 (collect at 
> CarbonDataRDDFactory.scala:1092)
> 18/02/06 16:38:13 INFO DAGScheduler: Parents of final stage: List()
> 18/02/06 16:38:13 INFO DAGScheduler: Missing parents: List()
> 18/02/06 16:38:13 INFO DAGScheduler: Submitting ResultStage 31 
> (NewCarbonDataLoadRDD[101] at RDD at CarbonRDD.scala:33), which has no 
> missing parents
> 18/02/06 16:38:13 INFO NewCarbonDataLoadRDD: Preferred Location for split : 
> knoldus
> 18/02/06 16:38:13 INFO MemoryStore: Block broadcast_39 stored as values in 
> memory (estimated size 38.3 KB, free 2.5 GB)
> 18/02/06 16:38:13 INFO MemoryStore: Block broadcast_39_piece0 stored as bytes 
> in memory (estimated size 30.7 KB, free 2.5 GB)
> 18/02/06 16:38:13 INFO BlockManagerInfo: Added broadcast_39_piece0 in memory 
> on 192.168.2.160:44339 (size: 30.7 KB, free: 2.5 GB)
> 18/02/06 16:38:13 INFO SparkContext: Created broadcast 39 from broadcast at 
> DAGScheduler.scala:996
> 18/02/06 16:38:13 INFO DAGScheduler: Submitting 1 missing tasks from 
> ResultStage 31 (NewCarbonDataLoadRDD[101] at RDD at CarbonRDD.scala:33)
> 18/02/06 16:38:13 INFO TaskSchedulerImpl: Adding task set 31.0 with 1 tasks
> 18/02/06 16:38:13 INFO TaskSetManager: Starting task 0.0 in stage 31.0 (TID 
> 103, localhost, executor driver, partition 0, ANY, 6880 bytes)
> 18/02/06 16:38:13 INFO Executor: Running task 0.0 in stage 31.0 (TID 103)
> 18/02/06 16:38:13 INFO NewCarbonDataLoadRDD: Input split: knoldus
> 18/02/06 16:38:13 INFO NewCarbonDataLoadRDD: The Block Count in this node :1
> 18/02/06 16:38:13 INFO SparkPartitionLoader: [Executor task launch 
> worker-28][partitionID:uniqdata;queryID:82072464882259] Temp location for 
> loading data: /tmp/carbon82072469095359_0
> 18/02/06 16:38:13 WARN CarbonDataProcessorUtil: [Executor task launch 
> worker-28][partitionID:uniqdata;queryID:82072464882259] sort scope is set to 
> NO_SORT
> 18/02/06 16:38:13 INFO AbstractDataLoadProcessorStep: Thread-1137 Rows 
> processed in step Data Writer : 0
> 18/02/06 16:38:13 INFO AbstractDataLoadProcessorStep: Thread-1138 Rows 
> processed in step Data Converter : 0
> 18/02/06 16:38:13 INFO AbstractDataLoadProcessorStep: Thread-1139 Rows 
> processed in step Input Processor : 0
> 18/02/06 16:38:13 INFO DataLoadExecutor: [Executor task launch 
> worker-28][partitionID:uniqdata;queryID:82072464882259] Data Loading is 
> started for table uniqdata
> 18/02/06 16:38:13 WARN CarbonDataProcessorUtil: [Executor task launch 
> worker-28][partitionID:uniqdata;queryID:82072464882259] sort scope is set to 
> NO_SORT
> 18/02/06 16:38:13 INFO CarbonFactDataHandlerColumnar: [Executor task launch 
> worker-28][partitionID:uniqdata;queryID:82072464882259] Initializing writer 
> executors
> 18/02/06 16:38:13 INFO CarbonFactDataHandlerColumnar: [Executor task launch 
> worker-28][partitionID:uniqdata;queryID:82072464882259] Columns considered as 
> NoInverted Index are cust_id,cust_name,decimal_column2,double_column1,
> 18/02/06 16:38:13 INFO CarbonFactDataHandlerColumnar: [Executor task launch 
> worker-28][partitionID:uniqdata;queryID:82072464882259] Number of rows per 
> column blocklet 32000
> 18/02/06 16:38:13 INFO AbstractFactDataWriter: [Executor task launch 
> worker-28][partitionID:uniqdata;queryID:82072464882259] Total file size: 
> 268435456 and dataBlock Size: 241591911
> 18/02/06 16:38:13 INFO AbstractFactDataWriter: [Executor task launch 
> worker-28][partitionID:uniqdata;queryID:82072464882259] Randomly choose 
> factdata temp location: /tmp/carbon82072469095359_0/Fact/Part0/Segment_2/0
> 18/02/06 16:38:13 ERROR CarbonRowDataWriterProcessorStepImpl: [Executor task 
> launch worker-28][partitionID:uniqdata;queryID:82072464882259] Failed for 
> table: uniqdata in DataWriterProcessorStepImpl
> org.apache.carbondata.processing.loading.exception.CarbonDataLoadingException:
>  unable to generate the mdkey
>  at 
> org.apache.carbondata.processing.loading.steps.CarbonRowDataWriterProcessorStepImpl.processBatch(CarbonRowDataWriterProcessorStepImpl.java:281)
>  at 
> org.apache.carbondata.processing.loading.steps.CarbonRowDataWriterProcessorStepImpl.doExecute(CarbonRowDataWriterProcessorStepImpl.java:167)
>  at 
> org.apache.carbondata.processing.loading.steps.CarbonRowDataWriterProcessorStepImpl.execute(CarbonRowDataWriterProcessorStepImpl.java:122)
>  at 
> org.apache.carbondata.processing.loading.DataLoadExecutor.execute(DataLoadExecutor.java:51)
>  at 
> org.apache.carbondata.spark.rdd.NewCarbonDataLoadRDD$$anon$1.<init>(NewCarbonDataLoadRDD.scala:246)
>  at 
> org.apache.carbondata.spark.rdd.NewCarbonDataLoadRDD.internalCompute(NewCarbonDataLoadRDD.scala:221)
>  at org.apache.carbondata.spark.rdd.CarbonRDD.compute(CarbonRDD.scala:60)
>  at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:323)
>  at org.apache.spark.rdd.RDD.iterator(RDD.scala:287)
>  at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:87)
>  at org.apache.spark.scheduler.Task.run(Task.scala:99)
>  at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:282)
>  at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>  at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>  at java.lang.Thread.run(Thread.java:745)
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to