[
https://issues.apache.org/jira/browse/HIVE-634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12734147#action_12734147
]
Neal Richter commented on HIVE-634:
-----------------------------------
Note that both methods seem to use '\N' in the output files instead of 'NULL'.
It would be better for the export to use 'NULL' for full compatibility with
MySQL and postgres. I tested '\N' and it's not recognized as 'NULL' with using
MySQL's LOAD DATA INFILE. Easy enough for sed to fix with another pass through
the data... at the cost of another pass through the data.
Can one also accomplish this goal with a custom Transform?
http://wiki.apache.org/hadoop/Hive/LanguageManual/Transform
> ctrl-A is the only output delimiter used, regardless of the Hive table
> structure
> --------------------------------------------------------------------------------
>
> Key: HIVE-634
> URL: https://issues.apache.org/jira/browse/HIVE-634
> Project: Hadoop Hive
> Issue Type: Bug
> Components: Serializers/Deserializers
> Affects Versions: 0.3.0
> Reporter: Peter Skomoroch
> Priority: Minor
>
> No matter what the table format, INSERT OVERWRITE LOCAL DIRECTORY will always
> use ctrl-A delimiters ('\001' ).
> INSERT OVERWRITE LOCAL DIRECTORY '/mnt/daily_timelines' SELECT * FROM
> daily_timelines;
> where daily_timelines is defined as tab delimited
> CREATE TABLE daily_timelines (
> page_id BIGINT,
> dates STRING,
> pageviews STRING,
> total_pageviews BIGINT)
> ROW FORMAT DELIMITED
> FIELDS TERMINATED BY '\t'
> STORED AS TEXTFILE;
> This page also indicates Hive uses a fixed delimiter, and should be updated:
> http://wiki.apache.org/hadoop/Hive/LanguageManual/DML
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.