'compressed' keyword in DDL syntax misleading and does not compress
-------------------------------------------------------------------

                 Key: HADOOP-4169
                 URL: https://issues.apache.org/jira/browse/HADOOP-4169
             Project: Hadoop Core
          Issue Type: Bug
          Components: contrib/hive
            Reporter: Joydeep Sen Sarma


Hive two types of data files - flat files and sequencefiles. Syntax should 
reflect this. Currently the 'compressed' keyword is used to choose sequencefile 
format - but does not actually compress the files. this is misleading. In 
addition - flat files can also be compressed.

Proposal is to replace 'compressed' with 'sequencefile'. And compression 
options should be applied from standard hadoop way of specifying whether output 
should be compressed (''mapred.output.compress') - ie. session options. 
(session options will also define codec etc.). default file format and 
compression options can be specified in conf file.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to