[GitHub] spark pull request: [SPARK-2706][SQL] Enable Spark to support Hive...

yhuai Fri, 19 Sep 2014 13:06:01 -0700

Github user yhuai commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2241#discussion_r17806210
  
    --- Diff: 
sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/HiveTableScan.scala 
---
    @@ -71,14 +72,14 @@ case class HiveTableScan(
         Cast(Literal(value), dataType).eval(null)
       }
     
    +
       private def addColumnMetadataToConf(hiveConf: HiveConf) {
         // Specifies needed column IDs for those non-partitioning columns.
         val neededColumnIDs =
           attributes.map(a =>
             relation.attributes.indexWhere(_.name == a.name): 
Integer).filter(index => index >= 0)
     
    -    ColumnProjectionUtils.appendReadColumnIDs(hiveConf, neededColumnIDs)
    -    ColumnProjectionUtils.appendReadColumnNames(hiveConf, 
attributes.map(_.name))
    +    HiveShim.appendReadColumns(hiveConf, neededColumnIDs, 
attributes.map(_.name))
    --- End diff --
    
    Yeah, because we were always using append, these lists will be messed up in 
the case of join or after running the first query. I opened 
https://issues.apache.org/jira/browse/SPARK-3559. It will be great if it can 
also fix it.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request: [SPARK-2706][SQL] Enable Spark to support Hive...

Reply via email to