[
https://issues.apache.org/jira/browse/SPARK-5685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15657502#comment-15657502
]
Aditya commented on SPARK-5685:
-------------------------------
If this issue is still open, I can work on it
> Show warning when users open text files compressed with non-splittable
> algorithms like gzip
> -------------------------------------------------------------------------------------------
>
> Key: SPARK-5685
> URL: https://issues.apache.org/jira/browse/SPARK-5685
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core
> Reporter: Nicholas Chammas
> Priority: Minor
>
> This is a usability or user-friendliness issue.
> It's extremely common for people to load a text file compressed with gzip,
> process it, and then wonder why only 1 core in their cluster is doing any
> work.
> Some examples:
> * http://stackoverflow.com/q/28127119/877069
> * http://stackoverflow.com/q/27531816/877069
> I'm not sure how this problem can be generalized, but at the very least it
> would be helpful if Spark displayed some kind of warning in the common case
> when someone opens a gzipped file with {{sc.textFile}}.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]