[jira] [Commented] (SPARK-11787) Speed up parquet reader for flat schemas

Apache Spark (JIRA) Thu, 19 Nov 2015 13:39:49 -0800

    [ 
https://issues.apache.org/jira/browse/SPARK-11787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15014436#comment-15014436
 ]


Apache Spark commented on SPARK-11787:
--------------------------------------

User 'nongli' has created a pull request for this issue:
https://github.com/apache/spark/pull/9845

> Speed up parquet reader for flat schemas
> ----------------------------------------
>
>                 Key: SPARK-11787
>                 URL: https://issues.apache.org/jira/browse/SPARK-11787
>             Project: Spark
>          Issue Type: Task
>          Components: SQL
>            Reporter: Nong Li
>            Assignee: Nong Li
>             Fix For: 1.6.0
>
>
> Measuring the performance of running some of the TPCDS and anecdotally, 
> parquet scan and record reconstruction performance is a bottleneck.
> For simple schemas, we can do better using the lower level parquet-mr APIs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-11787) Speed up parquet reader for flat schemas

Reply via email to