[
https://issues.apache.org/jira/browse/SPARK-9347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Armbrust resolved SPARK-9347.
-------------------------------------
Resolution: Duplicate
Okay, thanks for clarifying. I'm still going to close this since there is a
duplicate ticket in progress.
I'm curious, have you tested the performance of the new implementation and
found that its not sufficient?
> spark load of existing parquet files extremely slow if large number of files
> ----------------------------------------------------------------------------
>
> Key: SPARK-9347
> URL: https://issues.apache.org/jira/browse/SPARK-9347
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 1.3.1
> Reporter: Samphel Norden
>
> When spark sql shell is launched and we point it to a folder containing a
> large number of parquet files, the sqlContext.parquetFile() command takes a
> very long time to load the tables.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]