[
https://issues.apache.org/jira/browse/HIVE-11912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Szehon Ho updated HIVE-11912:
-----------------------------
Attachment: HIVE-11912.patch
Attaching a patch. Unfortunately, there's no Serde extension today to specify
default table properties, like StorageHandler.
The most logical place seemed to be StorageFormat abstraction, which is the the
rough equivalent of StorageHandler. By putting it there instead of in
AbstractSerde, we don't have to waste time initializing the Serde.
Also, there's some issues on MacOS. To workaround the Mac / Snappy issue for
Snappy version < 1.0.5 as is the case for Hadoop 2.6, HADOOP_OPTS should be set
like
{noformat}
export HADOOP_OPTS="-Dorg.xerial.snappy.tempdir=/tmp
-Dorg.xerial.snappy.lib.name=libsnappyjava.jnilib $HADOOP_OPTS"
{noformat}
> Make snappy compression default for parquet tables
> --------------------------------------------------
>
> Key: HIVE-11912
> URL: https://issues.apache.org/jira/browse/HIVE-11912
> Project: Hive
> Issue Type: Improvement
> Components: File Formats
> Reporter: Szehon Ho
> Attachments: HIVE-11912.patch
>
>
> Snappy is a popular compression codec for Parquet, and is the default in many
> Parquet applications, increasing the performance.
> This change would make it the default for new Hive Parquet tables.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)