[jira] Commented: (HIVE-634) ctrl-A is the only output delimiter used, regardless of the Hive table structure

Neal Richter (JIRA) Wed, 22 Jul 2009 08:09:38 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12734147#action_12734147
 ]


Neal Richter commented on HIVE-634:
-----------------------------------

Note that both methods seem to use '\N' in the output files instead of 'NULL'.  

It would be better for the export to use 'NULL' for full compatibility with 
MySQL and postgres.  I tested '\N' and it's not recognized as 'NULL' with using 
MySQL's LOAD DATA INFILE.  Easy enough for sed to fix with another pass through 
the data... at the cost of another pass through the data.

Can one also accomplish this goal with a custom Transform? 
http://wiki.apache.org/hadoop/Hive/LanguageManual/Transform

> ctrl-A is the only output delimiter used, regardless of the Hive table 
> structure
> --------------------------------------------------------------------------------
>
>                 Key: HIVE-634
>                 URL: https://issues.apache.org/jira/browse/HIVE-634
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Serializers/Deserializers
>    Affects Versions: 0.3.0
>            Reporter: Peter Skomoroch
>            Priority: Minor
>
> No matter what the table format, INSERT OVERWRITE LOCAL DIRECTORY will always 
> use ctrl-A delimiters ('\001' ).
> INSERT OVERWRITE LOCAL DIRECTORY '/mnt/daily_timelines' SELECT * FROM 
> daily_timelines;
> where daily_timelines is defined as tab delimited
> CREATE TABLE daily_timelines (
>     page_id BIGINT, 
>     dates STRING, 
>     pageviews STRING, 
>     total_pageviews BIGINT) 
>   ROW FORMAT DELIMITED 
>     FIELDS TERMINATED BY '\t' 
>   STORED AS TEXTFILE;
> This page also indicates Hive uses a fixed delimiter, and should be updated: 
> http://wiki.apache.org/hadoop/Hive/LanguageManual/DML

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HIVE-634) ctrl-A is the only output delimiter used, regardless of the Hive table structure

Reply via email to