[ 
https://issues.apache.org/jira/browse/HIVE-6013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16480603#comment-16480603
 ] 

Shubham edited comment on HIVE-6013 at 5/18/18 12:49 PM:
---------------------------------------------------------

I am using Hive 0.13.1 and have a parquet database in which the column names 
are containing unicode characters. Can someone please revert how to address 
this issue with exact process. Any help would be appreciated. Below is the 
query i am using

Create external table teda_in (machine_id string, url_idc string, person_id 
string, ss2k bigint, time_id int, domain_name string, url_host string, url_dir 
string, url_page string, url_refer_domain string, url_refer_host string, 
url_refer_dir string, url_refer_page string, mimetype string, http_rc int, 
keywords string, html_title string, pattern_id int) partitioned by (dt string) 
ROW FORMAT delimited fields terminated by '\t' lines terminated by '\n';
ALTER TABLE teda_in ADD PARTITION (dt='20171002') LOCATION 
's3:/data/dt=2017-10-02/';
ALTER TABLE teda_in ADD PARTITION (dt='20171003') LOCATION 
's3:/data/dt=2017-10-03/';



was (Author: gurav.shubham):
I am using Hive 0.13.1 and have a parquet database in which the column names 
are containing unicode characters. Can someone please revert how to address 
this issue with exact process. Any help would be appreciated. Below is the 
query i am using

Create external table teda_in (machine_id string, url_idc string, person_id 
string, ss2k bigint, time_id int, domain_name string, url_host string, url_dir 
string, url_page string, url_refer_domain string, url_refer_host string, 
url_refer_dir string, url_refer_page string, mimetype string, http_rc int, 
keywords string, html_title string, pattern_id int) partitioned by (dt string) 
ROW FORMAT delimited fields terminated by '\t' lines terminated by '\n';
ALTER TABLE teda_in ADD PARTITION (dt='20171002') LOCATION 
's3:/data/dt=2017-10-02/';
ALTER TABLE teda_in ADD PARTITION (dt='20171003') LOCATION 's3:/dt=2017-10-03/';


> Supporting Quoted Identifiers in Column Names
> ---------------------------------------------
>
>                 Key: HIVE-6013
>                 URL: https://issues.apache.org/jira/browse/HIVE-6013
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Harish Butani
>            Assignee: Harish Butani
>            Priority: Major
>             Fix For: 0.13.0
>
>         Attachments: HIVE-6013.1.patch, HIVE-6013.2.patch, HIVE-6013.3.patch, 
> HIVE-6013.4.patch, HIVE-6013.5.patch, HIVE-6013.6.patch, HIVE-6013.7.patch, 
> QuotedIdentifier.html
>
>
> Hive's current behavior on Quoted Identifiers is different from the normal 
> interpretation. Quoted Identifier (using backticks) has a special 
> interpretation for Select expressions(as Regular Expressions). Have 
> documented current behavior and proposed a solution in attached doc.
> Summary of solution is:
> - Introduce 'standard' quoted identifiers for columns only. 
> - At the langauage level this is turned on by a flag.
> - At the metadata level we relax the constraint on column names.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to