I’ve created a jira issue for this 
https://issues.apache.org/jira/browse/SPARK-4967

Originally we want to support multiple parquet file paths scanning as I guess, 
and those file paths are in a single string separated by comma internally, 
however I didn’t find any public example says we support multiple parquet files 
for API sqlContext.parquetFile, we need to think how to support multiple paths 
in some other way.

Cheng Hao


From: Michael Armbrust [mailto:mich...@databricks.com]
Sent: Thursday, December 25, 2014 1:01 PM
To: Daniel Siegmann
Cc: user@spark.apache.org
Subject: Re: Escape commas in file names

No, there is not.  Can you open a JIRA?

On Tue, Dec 23, 2014 at 6:33 PM, Daniel Siegmann 
<daniel.siegm...@velos.io<mailto:daniel.siegm...@velos.io>> wrote:
I am trying to load a Parquet file which has a comma in its name. Yes, this is 
a valid file name in HDFS. However, sqlContext.parquetFile interprets this as a 
comma-separated list of parquet files.

Is there any way to escape the comma so it is treated as part of a single file 
name?

--
Daniel Siegmann, Software Developer
Velos
Accelerating Machine Learning

54 W 40th St, New York, NY 10018
E: daniel.siegm...@velos.io<mailto:daniel.siegm...@velos.io> W: 
www.velos.io<http://www.velos.io>

Reply via email to