Vandana Yadav created CARBONDATA-1013:
-----------------------------------------

             Summary: Unexpected characters displays in results while using 
join query.
                 Key: CARBONDATA-1013
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-1013
             Project: CarbonData
          Issue Type: Bug
          Components: data-query
    Affects Versions: 1.1.0
         Environment: spark 2.1
            Reporter: Vandana Yadav
            Priority: Minor
         Attachments: customer_C1.csv, payment_C1.csv, unwanted characters.png

Unexpected characters displays in result-set while using join query.

Steps to reproduce:

1) Create tables:
a) In carbondata:
table 1 : create table Comp_TABLE_ONE_JOIN (customer_uid String,customer_id 
int, gender String, first_name String, middle_name String, last_name 
String,customer_address String, country String) STORED BY 
'org.apache.carbondata.format' 
TBLPROPERTIES('DICTIONARY_EXCLUDE'='customer_uid')

table 2: create table Comp_TABLE_TWO_JOIN (customer_payment_id 
String,customer_id int,payment_amount Decimal(15,5), payment_mode 
String,payment_details String) STORED BY 'org.apache.carbondata.format' 
TBLPROPERTIES('DICTIONARY_EXCLUDE'='customer_payment_id')

b) In hive:
table 1: create table Comp_TABLE_ONE_JOIN_h (customer_uid String,customer_id 
int, gender String, first_name String, middle_name String, last_name 
String,customer_address String, country String)ROW FORMAT DELIMITED FIELDS 
TERMINATED BY ",";

table 2:create table Comp_TABLE_TWO_JOIN_h (customer_payment_id 
String,customer_id int,payment_amount Decimal(15,5), payment_mode 
String,payment_details String) ROW FORMAT DELIMITED FIELDS TERMINATED BY ",";

2) Load Data :
a) In Carbondata:
table 1 : LOAD DATA INPATH 'HDFS_URL/BabuStore/Data/customer_C1.csv' INTO table 
Comp_TABLE_ONE_JOIN options 
('DELIMITER'=',','QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='customer_uid,customer_id,gender,first_name,middle_name,last_name,customer_address,country')

table 2:LOAD DATA INPATH 'HDFS_URL/BabuStore/Data/payment_C1.csv' INTO table 
Comp_TABLE_TWO_JOIN options 
('DELIMITER'=',','QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='customer_payment_id
 ,customer_id,payment_amount ,payment_mode, payment_details')


b) In hive:
table 1: LOAD DATA LOCAL INPATH 
'/home/knoldus/Desktop/csv/TestData2/Data/customer_C1.csv' INTO table 
Comp_TABLE_ONE_JOIN_h;

table 2: LOAD DATA LOCAL INPATH 
'/home/knoldus/Desktop/csv/TestData2/Data/payment_C1.csv' INTO table 
Comp_TABLE_TWO_JOIN_h;

3)Execute query:
select * from Comp_TABLE_ONE_JOIN join Comp_TABLE_TWO_JOIN on 
Comp_TABLE_ONE_JOIN.customer_id=Comp_TABLE_TWO_JOIN.customer_id limit  2;

Actual Result:
a) In Carbondata:
+---------------+--------------+---------+-------------+--------------+------------+-------------------+------------+-----------------------+--------------+-----------------+---------------+-----------------------------+--+
| customer_uid  | customer_id  | gender  | first_name  | middle_name  | 
last_name  | customer_address  |  country   |  customer_payment_id  | 
customer_id  | payment_amount  | payment_mode  |       payment_details       |
+---------------+--------------+---------+-------------+--------------+------------+-------------------+------------+-----------------------+--------------+-----------------+---------------+-----------------------------+--+
| UID31a31      | 31           | female  | fname31     | mname31      | lname31 
   | address 31        | country31  | Cust_payment_ID31a31  | 31           | 
193288.72000    |           |p  |
| UID31a31      | 31           | female  | fname31     | mname31      | lname31 
   | address 31        | country31  | Cust_payment_ID31a31  | 31           | 
193288.72000    |           |p  |
+---------------+--------------+---------+-------------+--------------+------------+-------------------+------------+-----------------------+--------------+-----------------+---------------+-----------------------------+--+
2 rows selected (0.499 seconds)

b) In Hive

---------------+--------------+---------+-------------+--------------+------------+-------------------+-----------+----------------------+--------------+-----------------+---------------+---------------------------------+--+
| customer_uid  | customer_id  | gender  | first_name  | middle_name  | 
last_name  | customer_address  |  country  | customer_payment_id  | customer_id 
 | payment_amount  | payment_mode  |         payment_details         |
+---------------+--------------+---------+-------------+--------------+------------+-------------------+-----------+----------------------+--------------+-----------------+---------------+---------------------------------+--+
| UID1a1        | 1            | female  | fname1      | mname1       | lname1  
   | address 1         | country1  | Cust_payment_ID1a1   | 1            | 
6235.12000      | debit         | details of the payment for : 1  |
| UID1a1        | 1            | female  | fname1      | mname1       | lname1  
   | address 1         | country1  | Cust_payment_ID1a1   | 1            | 
6235.12000      | debit         | details of the payment for : 1  |
+---------------+--------------+---------+-------------+--------------+------------+-------------------+-----------+----------------------+--------------+-----------------+---------------+---------------------------------+--+
2 rows selected (0.224 seconds)


Expected result: unwanted characters should not be displayed.




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to