[ 
https://issues.apache.org/jira/browse/HIVE-820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12753654#action_12753654
 ] 

Edward Capriolo commented on HIVE-820:
--------------------------------------

Matt,

It is a tough call. 

I believe you can issue
{noformat}
 FIELDS TERMINATED BY ',' " +
{noformat}

As well as 
{noformat}
 FIELDS TERMINATED BY '\054' " +
{noformat}

in a create table statement. It is stored as its ascii/unicode value. Hive will 
restrict delimiters over ascii 128 I believe. 

If someone is issuing 'show tables' showing them '054' as opposed to a ',' 
might be a pain, as they will need an ascii table to figure out what the 
delimiter is'.  However I think showing them the octal/hex/decimal is the best 
way as the ouput is consistent. 

We also can do <LF> but we should do replacements for all non-printable 
characters. 


I think 054 is fine, but maybe someone wants to chime in and speak about what 
the delimiters could be down the road. Guys?

> Describe Extended Line Breaks When Delimiter is \n
> --------------------------------------------------
>
>                 Key: HIVE-820
>                 URL: https://issues.apache.org/jira/browse/HIVE-820
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.2.0, 0.3.0, 0.3.1, 0.3.2, 0.4.0, 0.5.0
>            Reporter: Matt Pestritto
>            Assignee: Matt Pestritto
>            Priority: Minor
>             Fix For: 0.5.0
>
>         Attachments: hive_820.patch
>
>
> Tables defined delimited with \t and breaks using \n has output of describe 
> extended that is not contiguous.
> Line.delim outputs an actual \n which breaks the display output so using the 
> hiveservice you have to do another FetchOne to get the rest of the line.
> For example.
> Original Output:
> Detailed Table Information    Table(tableName:cobra_merchandise, 
> dbName:default, owner:hive, createTime:1248726291, lastAccessTime:0, 
> retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:merchandise_tid, 
> type:string, comment:null), FieldSchema(name:client_merch_type_tid, 
> type:string, comment:null), FieldSchema(name:description, type:string, 
> comment:null), FieldSchema(name:client_description, type:string, 
> comment:null), FieldSchema(name:price, type:string, comment:null), 
> FieldSchema(name:cost, type:string, comment:null), 
> FieldSchema(name:start_date, type:string, comment:null), 
> FieldSchema(name:end_date, type:string, comment:null)], 
> location:hdfs://mustique:9000/user/hive/warehouse/m, 
> inputFormat:org.apache.hadoop.mapred.TextInputFormat, 
> outputFormat:org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat, 
> compressed:false, numBuckets:-1, serdeInfo:SerDeInfo(name:null, 
> serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, 
> parameters:{serialization.format=9,line.delim=
> ,field.delim=    }), bucketCols:[], sortCols:[], parameters:{}), 
> partitionKeys:[FieldSchema(name:client_tid, type:int, comment:null)], 
> parameters:{})   
> Proposed Output:
> Detailed Table Information    Table(tableName:cobra_merchandise, 
> dbName:default, owner:hive, createTime:1248726291, lastAccessTime:0, 
> retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:merchandise_tid, 
> type:string, comment:null), FieldSchema(name:client_merch_type_tid, 
> type:string, comment:null), FieldSchema(name:description, type:string, 
> comment:null), FieldSchema(name:client_description, type:string, 
> comment:null), FieldSchema(name:price, type:string, comment:null), 
> FieldSchema(name:cost, type:string, comment:null), 
> FieldSchema(name:start_date, type:string, comment:null), 
> FieldSchema(name:end_date, type:string, comment:null)], 
> location:hdfs://mustique:9000/user/hive/warehouse/m, 
> inputFormat:org.apache.hadoop.mapred.TextInputFormat, 
> outputFormat:org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat, 
> compressed:false, numBuckets:-1, serdeInfo:SerDeInfo(name:null, 
> serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, 
> parameters:{serialization.format=9,line.delim=<LF>,field.delim=<TAB>}), 
> bucketCols:[], sortCols:[], parameters:{}), 
> partitionKeys:[FieldSchema(name:client_tid, type:int, comment:null)], 
> parameters:{})   

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to