Priyal Sachdeva created CARBONDATA-660:
------------------------------------------
Summary: Bad Records Logs and Raw CSVs should get display under
segment id instead of Tasks id
Key: CARBONDATA-660
URL: https://issues.apache.org/jira/browse/CARBONDATA-660
Project: CarbonData
Issue Type: Improvement
Components: data-load
Reporter: Priyal Sachdeva
Priority: Minor
create table if not exists Badrecords_test (imei string,AMSize int) STORED BY
'org.apache.carbondata.format';
LOAD DATA INPATH 'hdfs://hacluster/CSVs/bad_records.csv' into table
Badrecords_test OPTIONS('DELIMITER'=',' ,
'QUOTECHAR'='"','BAD_RECORDS_LOGGER_ENABLE'='TRUE',
'BAD_RECORDS_ACTION'='REDIRECT','FILEHEADER'='imei,AMSize');
Bad Records Logs and raw csvs are getting display under Task ID
linux-61:/srv/OSCON/BigData/HACluster/install/hadoop/datanode #
bin/hadoop fs -ls /tmp/carbon/default/badrecords_test
drwxr-xr-x - root users 0 2017-01-18 21:08
/tmp/carbon/default/badrecords_test/0--------------------------->Task ID
0: jdbc:hive2://172.168.100.205:23040> show segments for table Badrecords_test;
+--------------------+------------------+--------------------------+--------------------------+--+
| SegmentSequenceId | Status | Load Start Time | Load
End Time |
+--------------------+------------------+--------------------------+--------------------------+--+
| 8 | Partial Success | 2017-01-18 21:12:58.018 | 2017-01-18
21:12:59.652 |
| 7 | Partial Success | 2017-01-18 21:08:07.426 | 2017-01-18
21:08:11.791 |
| 6 | Partial Success | 2017-01-18 21:07:07.645 | 2017-01-18
21:07:08.747 |
| 5 | Partial Success | 2017-01-18 19:34:16.163 | 2017-01-18
19:34:18.163 |
| 4 | Partial Success | 2017-01-18 19:34:13.669 | 2017-01-18
19:34:15.811 |
| 3 | Partial Success | 2017-01-18 19:30:18.753 | 2017-01-18
19:30:19.644 |
| 2 | Partial Success | 2017-01-18 19:30:13.508 | 2017-01-18
19:30:15.578 |
| 1 | Partial Success | 2017-01-18 19:18:54.787 | 2017-01-18
19:18:54.94 |
| 0 | Partial Success | 2017-01-18 19:18:53.741 | 2017-01-18
19:18:54.614 |
+--------------------+------------------+--------------------------+--------------------------+--+
Bad Records Logs and raw csvs are getting display under Task ID. It would be
good to have the information of bad records as per the load i.e under segment
id..
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)