[
https://issues.apache.org/jira/browse/SPARK-14688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15245278#comment-15245278
]
seth commented on SPARK-14688:
------------------------------
Furthermore, It would appear that pyspark {{textFileStream}} doesn't support
regex like the the {{sc.textFile}} does.
i.e:
{{RDD = sc.textFile('somelocation/*/another/')}} is supported, while
{{RDD = scc.textFileStream('somelocation/*/another/')}} is not
> pyspark textFileStream gzipped
> ------------------------------
>
> Key: SPARK-14688
> URL: https://issues.apache.org/jira/browse/SPARK-14688
> Project: Spark
> Issue Type: Improvement
> Components: PySpark, Streaming
> Affects Versions: 1.6.1
> Reporter: seth
> Labels: pyspark, streaming
>
> pyspark streamingObject does not support reading gzip files.
> 2 notes:
> 1.regular sparkContext does support gzip files
> 2. Java/Scala method support streaming gzip files
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]