[
https://issues.apache.org/jira/browse/SPARK-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14532680#comment-14532680
]
Yana Kadiyska commented on SPARK-3928:
--------------------------------------
Marius, are you saying that wildcards are not supported then? in my case, I
would really like to do "/r/warehouse/hive/pkey=????-2015-04/*" (which works w/
textFile method btw) -- i.e. pass a single path for all April 2015 partitions.
Enumerating all paths underneath is pretty crazy, that's a huge list.
Are you saying that is the only way? I thought the whole point of this bug is
that we _don't_ have to enumerate the paths explicitly. Also in my case hc is a
HiveContext instance, not a dataframe.
> Support wildcard matches on Parquet files
> -----------------------------------------
>
> Key: SPARK-3928
> URL: https://issues.apache.org/jira/browse/SPARK-3928
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core, SQL
> Reporter: Nicholas Chammas
> Priority: Minor
> Fix For: 1.3.0
>
>
> {{SparkContext.textFile()}} supports patterns like {{part-*}} and
> {{2014-\?\?-\?\?}}.
> It would be nice if {{SparkContext.parquetFile()}} did the same.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]