[
https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15169160#comment-15169160
]
Hyukjin Kwon commented on SPARK-11691:
--------------------------------------
This issue deals with a bit more generalized compression options compareing to
the issue I gave here. However, for general Hadoop compression configurations,
they can only be applied to JSON, CSV and TEXT datasources which I already
submitted some PRs and they were merged. So, although the issues themselves are
slightly different, I think the PRs I submitted cover this.
Also, I think we can't just assume all the {{HadoopFsRelation}}s support
compression. For ORC and Parquet, they might have to be dealt with differently
due to dofferent configuration keys and supporting codecs.
> Allow to specify compression codec in HadoopFsRelation when saving
> -------------------------------------------------------------------
>
> Key: SPARK-11691
> URL: https://issues.apache.org/jira/browse/SPARK-11691
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Reporter: Jeff Zhang
>
> Currently, there's no way to specify compression codec when saving data frame
> to hdfs. It would nice to allow specify compression codec in DataFrameWriter
> just as we did in RDD api
> {code}
> def saveAsTextFile(path: String, codec: Class[_ <: CompressionCodec]): Unit =
> withScope {
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]