[jira] [Commented] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

Hyukjin Kwon (JIRA) Fri, 26 Feb 2016 07:08:32 -0800

    [ 
https://issues.apache.org/jira/browse/SPARK-11691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15169160#comment-15169160
 ]


Hyukjin Kwon commented on SPARK-11691:
--------------------------------------

This issue deals with a bit more generalized compression options compareing to 
the issue I gave here. However, for general Hadoop compression configurations, 
they can only be applied to JSON, CSV and TEXT datasources which I already 
submitted some PRs and they were merged. So, although the issues themselves are 
slightly different, I think the PRs I submitted cover this. 

Also, I think we can't just assume all the {{HadoopFsRelation}}s support 
compression. For ORC and Parquet, they might have to be dealt with differently 
due to dofferent configuration keys and supporting codecs.

> Allow to specify compression codec in HadoopFsRelation when saving 
> -------------------------------------------------------------------
>
>                 Key: SPARK-11691
>                 URL: https://issues.apache.org/jira/browse/SPARK-11691
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>            Reporter: Jeff Zhang
>
> Currently, there's no way to specify compression codec when saving data frame 
> to hdfs. It would nice to allow specify compression codec in DataFrameWriter 
> just as we did in RDD api
> {code}
> def saveAsTextFile(path: String, codec: Class[_ <: CompressionCodec]): Unit = 
> withScope {
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-11691) Allow to specify compression codec in HadoopFsRelation when saving

Reply via email to