[GitHub] spark pull request #21893: [SPARK-24965][SQL] Support selecting from partiti...

gatorsmile Mon, 30 Jul 2018 09:18:29 -0700

Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21893#discussion_r206234291
  
    --- Diff: 
sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala 
---
    @@ -437,6 +437,8 @@ private[hive] class HiveClientImpl(
                 throw new AnalysisException("Hive index table is not 
supported.")
             },
             schema = schema,
    +        hasMultiFormatPartitions =
    +          shim.getAllPartitions(client, 
h).map(_.getInputFormatClass).distinct.size > 1,
    --- End diff --
    
    Fetching the metadata of all the partitions will slow down all the queries 
when the number of partitions is huge. We will face a huge perf regression.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #21893: [SPARK-24965][SQL] Support selecting from partiti...

Reply via email to