[02/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
http://git-wip-us.apache.org/repos/asf/spark/blob/b521a3b0/sql/hive/src/test/resources/golden/windowing_windowspec.q (deterministic)-5-ee44c5cdc80e1c832b702f9fb76d8145 -- diff --git

[10/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
http://git-wip-us.apache.org/repos/asf/spark/blob/b521a3b0/sql/hive/src/test/resources/golden/windowing_navfn.q (deterministic)-1-9ee79e711248dd6e0a6ce27e439e55f4 -- diff --git

[05/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
http://git-wip-us.apache.org/repos/asf/spark/blob/b521a3b0/sql/hive/src/test/resources/golden/windowing_udaf.q (deterministic)-4-f2e4d659b65a833e9281b6786d3d55c1 -- diff --git a/sql/hive/src/test/resources/golden/windowing_udaf.q

[12/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
http://git-wip-us.apache.org/repos/asf/spark/blob/b521a3b0/sql/hive/src/main/scala/org/apache/spark/sql/hive/hiveUdfs.scala -- diff --git a/sql/hive/src/main/scala/org/apache/spark/sql/hive/hiveUdfs.scala

[01/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 7212897dc - b521a3b03 http://git-wip-us.apache.org/repos/asf/spark/blob/b521a3b0/sql/hive/src/test/resources/golden/windowing_windowspec.q (deterministic)-8-45a1d7c2aba45d761e19ff4dfdf5463e

[07/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
http://git-wip-us.apache.org/repos/asf/spark/blob/b521a3b0/sql/hive/src/test/resources/golden/windowing_rank.q (deterministic) 2-0-81bb7f49a55385878637c8aac4d08e5 -- diff --git

[03/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
http://git-wip-us.apache.org/repos/asf/spark/blob/b521a3b0/sql/hive/src/test/resources/golden/windowing_windowspec.q (deterministic)-3-6f104992e0050576085064815de43194 -- diff --git

[01/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master c3eb441f5 - f2c47082c http://git-wip-us.apache.org/repos/asf/spark/blob/f2c47082/sql/hive/src/test/resources/golden/windowing_windowspec.q (deterministic)-8-45a1d7c2aba45d761e19ff4dfdf5463e

[03/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
http://git-wip-us.apache.org/repos/asf/spark/blob/f2c47082/sql/hive/src/test/resources/golden/windowing_windowspec.q (deterministic)-3-6f104992e0050576085064815de43194 -- diff --git

[13/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
[SPARK-1442] [SQL] Window Function Support for Spark SQL Adding more information about the implementation... This PR is adding the support of window functions to Spark SQL (specifically OVER and WINDOW clause). For every expression having a OVER clause, we use a WindowExpression as the

[09/13] spark git commit: [SPARK-1442] [SQL] Window Function Support for Spark SQL

2015-05-06 Thread marmbrus
http://git-wip-us.apache.org/repos/asf/spark/blob/f2c47082/sql/hive/src/test/resources/golden/windowing_ntile.q (deterministic)-1-a3d352560ac835993001665db6954965 -- diff --git

spark git commit: [SPARK-6231][SQL/DF] Automatically resolve join condition ambiguity for self-joins.

2015-05-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 762ff2e11 - e61083cca [SPARK-6231][SQL/DF] Automatically resolve join condition ambiguity for self-joins. See the comment in join function for more information. Author: Reynold Xin r...@databricks.com Closes #5919 from

spark git commit: [SPARK-6231][SQL/DF] Automatically resolve join condition ambiguity for self-joins.

2015-05-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 0092abb47 - 1fd31ba08 [SPARK-6231][SQL/DF] Automatically resolve join condition ambiguity for self-joins. See the comment in join function for more information. Author: Reynold Xin r...@databricks.com Closes #5919 from

spark git commit: [SPARK-6907] [SQL] Isolated client for HiveMetastore

2015-05-03 Thread marmbrus
: Michael Armbrust mich...@databricks.com Closes #5851 from marmbrus/isolatedClient and squashes the following commits: c72f6ac [Michael Armbrust] rxins comments 1e271fa [Michael Armbrust] [SPARK-6907][SQL] Isolated client for HiveMetastore Project: http://git-wip-us.apache.org/repos/asf/spark

spark git commit: [SPARK-5213] [SQL] Pluggable SQL Parser Support

2015-05-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 82c8c37c0 - 5d6b90d93 [SPARK-5213] [SQL] Pluggable SQL Parser Support based on #4015, we should not delete `sqlParser` from sqlcontext, that leads to mima failed. Users implement dialect to give a fallback for `sqlParser` and we should

spark git commit: [SPARK-6913][SQL] Fixed java.sql.SQLException: No suitable driver found

2015-04-30 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master a0d8a61ab - e991255e7 [SPARK-6913][SQL] Fixed java.sql.SQLException: No suitable driver found Fixed `java.sql.SQLException: No suitable driver found` when loading DataFrame into Spark SQL if the driver is supplied with `--jars` argument.

spark git commit: [SPARK-7093] [SQL] Using newPredicate in NestedLoopJoin to enable code generation

2015-04-30 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ee0441393 - 079733817 [SPARK-7093] [SQL] Using newPredicate in NestedLoopJoin to enable code generation Using newPredicate in NestedLoopJoin instead of InterpretedPredicate to make it can make use of code generation Author: scwf

spark git commit: [SPARK-5213] [SQL] Pluggable SQL Parser Support

2015-04-30 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master e991255e7 - 3ba5aaab8 [SPARK-5213] [SQL] Pluggable SQL Parser Support This PR aims to make the SQL Parser Pluggable, and user can register it's own parser via Spark SQL CLI. ``` # add the jar into the classpath

spark git commit: [SPARK-7109] [SQL] Push down left side filter for left semi join

2015-04-30 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 079733817 - a0d8a61ab [SPARK-7109] [SQL] Push down left side filter for left semi join Now in spark sql optimizer we only push down right side filter for left semi join, actually we can push down left side filter because left semi join is

spark git commit: [SPARK-7123] [SQL] support table.star in sqlcontext

2015-04-30 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 3ba5aaab8 - 473552fa5 [SPARK-7123] [SQL] support table.star in sqlcontext Run following sql get error `SELECT r.* FROM testData l join testData2 r on (l.key = r.a)` Author: scwf wangf...@huawei.com Closes #5690 from scwf/tablestar and

spark git commit: [SPARK-7055][SQL]Use correct ClassLoader for JDBC Driver in JDBCRDD.getConnector

2015-04-23 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 534f2a436 - c1213e6a9 [SPARK-7055][SQL]Use correct ClassLoader for JDBC Driver in JDBCRDD.getConnector Author: Vinod K C vinod...@huawei.com Closes #5633 from vinodkc/use_correct_classloader_driverload and squashes the following commits:

spark git commit: [SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.

2015-04-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master fbe7106d7 - baf865ddc [SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin. Author: Reynold Xin r...@databricks.com Closes #5638 from rxin/joinUsing and squashes the following commits: 13e9cc9 [Reynold Xin] Code review +

spark git commit: [SPARK-6996][SQL] Support map types in java beans

2015-04-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 6265cba00 - 2a24bf92e [SPARK-6996][SQL] Support map types in java beans liancheng mengxr this is similar to #5146. Author: Punya Biswal pbis...@palantir.com Closes #5578 from punya/feature/SPARK-6996 and squashes the following commits:

spark git commit: [SPARK-7011] Build(compilation) fails with scala 2.11 option, because a protected[sql] type is accessed in ml package.

2015-04-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 45c47fa41 - 04bf34e34 [SPARK-7011] Build(compilation) fails with scala 2.11 option, because a protected[sql] type is accessed in ml package.

spark git commit: [SPARK-5817] [SQL] Fix bug of udtf with column names

2015-04-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 2a24bf92e - 7662ec23b [SPARK-5817] [SQL] Fix bug of udtf with column names It's a bug while do query like: ```sql select d from (select explode(array(1,1)) d from src limit 1) t ``` And it will throws exception like: ```

spark git commit: [SPARK-6969][SQL] Refresh the cached table when REFRESH TABLE is used

2015-04-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 03fd92167 - 6265cba00 [SPARK-6969][SQL] Refresh the cached table when REFRESH TABLE is used https://issues.apache.org/jira/browse/SPARK-6969 Author: Yin Huai yh...@databricks.com Closes #5583 from yhuai/refreshTableRefreshDataCache and

spark git commit: [SQL][minor] make it more clear that we only need to re-throw GetField exception for UnresolvedAttribute

2015-04-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 2e8c6ca47 - 03fd92167 [SQL][minor] make it more clear that we only need to re-throw GetField exception for UnresolvedAttribute For `GetField` outside `UnresolvedAttribute`, we will throw exception in `Analyzer`. Author: Wenchen Fan

spark git commit: [SPARK-6994] Allow to fetch field values by name in sql.Row

2015-04-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 04bf34e34 - 2e8c6ca47 [SPARK-6994] Allow to fetch field values by name in sql.Row It looked weird that up to now there was no way in Spark's Scala API to access fields of `DataFrame/sql.Row` by name, only by their index. This tries to

spark git commit: [SPARK-6635][SQL] DataFrame.withColumn should replace columns with identical column names

2015-04-20 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ce7ddabbc - c736220da [SPARK-6635][SQL] DataFrame.withColumn should replace columns with identical column names JIRA https://issues.apache.org/jira/browse/SPARK-6635 Author: Liang-Chi Hsieh vii...@gmail.com Closes #5541 from

spark git commit: [SPARK-6368][SQL] Build a specialized serializer for Exchange operator.

2015-04-20 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 517bdf36a - ce7ddabbc [SPARK-6368][SQL] Build a specialized serializer for Exchange operator. JIRA: https://issues.apache.org/jira/browse/SPARK-6368 Author: Yin Huai yh...@databricks.com Closes #5497 from yhuai/serializer2 and squashes

spark git commit: [SPARK-6911] [SQL] improve accessor for nested types

2015-04-16 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 5fe434335 - 6183b5e2c [SPARK-6911] [SQL] improve accessor for nested types Support access columns by index in Python: ``` df[df[0] 3].collect() [Row(age=5, name=u'Bob')] ``` Access items in ArrayType or MapType ```

spark git commit: [SQL][Minor] Fix foreachUp of treenode

2015-04-16 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 6183b5e2c - d96608674 [SQL][Minor] Fix foreachUp of treenode `foreachUp` should runs the given function recursively on [[children]] then on this node(just like transformUp). The current implementation does not follow this. This will

spark git commit: [SPARK-6899][SQL] Fix type mismatch when using codegen with Average on DecimalType

2015-04-16 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d96608674 - 1e43851d6 [SPARK-6899][SQL] Fix type mismatch when using codegen with Average on DecimalType JIRA https://issues.apache.org/jira/browse/SPARK-6899 Author: Liang-Chi Hsieh vii...@gmail.com Closes #5517 from

spark git commit: [SPARK-6966][SQL] Use correct ClassLoader for JDBC Driver

2015-04-16 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1e43851d6 - e5949c287 [SPARK-6966][SQL] Use correct ClassLoader for JDBC Driver Otherwise we cannot add jars with drivers after the fact. Author: Michael Armbrust mich...@databricks.com Closes #5543 from marmbrus/jdbcClassloader

spark git commit: SPARK-6927 [SQL] Sorting Error when codegen on

2015-04-16 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 04e44b37c - 5fe434335 SPARK-6927 [SQL] Sorting Error when codegen on Fix this error by adding BinaryType comparor in GenerateOrdering. JIRA https://issues.apache.org/jira/browse/SPARK-6927 Author: 云峤 chensong...@alibaba-inc.com

spark git commit: [SPARK-6898][SQL] completely support special chars in column names

2015-04-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 557a797a2 - 4754e16f4 [SPARK-6898][SQL] completely support special chars in column names Even if we wrap column names in backticks like `` `a#$b.c` ``, we still handle the . inside column name specially. I think it's fragile to use a

spark git commit: [SPARK-6730][SQL] Allow using keyword as identifier in OPTIONS

2015-04-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f11288d52 - b75b30707 [SPARK-6730][SQL] Allow using keyword as identifier in OPTIONS JIRA: https://issues.apache.org/jira/browse/SPARK-6730 It is very possible that keyword will be used as identifier in `OPTIONS`, this pr makes it works.

spark git commit: [SPARK-6800][SQL] Update doc for JDBCRelation's columnPartition

2015-04-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 1e4b065b7 - 84878e0f0 [SPARK-6800][SQL] Update doc for JDBCRelation's columnPartition JIRA https://issues.apache.org/jira/browse/SPARK-6800 Author: Liang-Chi Hsieh vii...@gmail.com Closes #5488 from viirya/fix_jdbc_where and squashes

spark git commit: [SPARK-6887][SQL] ColumnBuilder misses FloatType

2015-04-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master e3e4e9a38 - 785f95586 [SPARK-6887][SQL] ColumnBuilder misses FloatType https://issues.apache.org/jira/browse/SPARK-6887 Author: Yin Huai yh...@databricks.com Closes #5499 from yhuai/inMemFloat and squashes the following commits: 84cba38

spark git commit: [SPARK-6844][SQL] Clean up accumulators used in InMemoryRelation when it is uncached

2015-04-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 85842760d - cf38fe04f [SPARK-6844][SQL] Clean up accumulators used in InMemoryRelation when it is uncached JIRA: https://issues.apache.org/jira/browse/SPARK-6844 Author: Liang-Chi Hsieh vii...@gmail.com Closes #5475 from

spark git commit: [SPARK-5277][SQL] - SparkSqlSerializer doesn't always register user specified KryoRegistrators

2015-04-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d5f1b9650 - 8a53de16f [SPARK-5277][SQL] - SparkSqlSerializer doesn't always register user specified KryoRegistrators [SPARK-5277][SQL] - SparkSqlSerializer doesn't always register user specified KryoRegistrators There were a few places

spark git commit: [SPARK-2213] [SQL] sort merge join for spark sql

2015-04-15 Thread marmbrus
pull request #3 from marmbrus/pr/5208 952168a [Michael Armbrust] add type 5492884 [Michael Armbrust] copy when ordering 7ddd656 [Michael Armbrust] Cleanup addition of ordering requirements b198278 [Daoyuan Wang] inherit ordering in project c8e82a3 [Daoyuan Wang] fix style 6e897dd [Daoyuan Wang] hide

spark git commit: [SPARK-6871][SQL] WITH clause in CTE can not following another WITH clause

2015-04-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 30a6e0dcc - 6be918942 [SPARK-6871][SQL] WITH clause in CTE can not following another WITH clause JIRA https://issues.apache.org/jira/browse/SPARK-6871 Author: Liang-Chi Hsieh vii...@gmail.com Closes #5480 from viirya/no_cte_after_cte and

spark git commit: [SQL][SPARK-6742]: Don't push down predicates which reference partition column(s)

2015-04-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 85ee0cabe - 3a205bbd9 [SQL][SPARK-6742]: Don't push down predicates which reference partition column(s) cc liancheng Author: Yash Datta yash.da...@guavus.com Closes #5390 from saucam/fpush and squashes the following commits: 3f026d6

spark git commit: [SPARK-5941] [SQL] Unit Test loads the table `src` twice for leftsemijoin.q

2015-04-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master e63a86abe - c5602bdc3 [SPARK-5941] [SQL] Unit Test loads the table `src` twice for leftsemijoin.q In `leftsemijoin.q`, there is a data loading command for table `sales` already, but in `TestHive`, it also created the table `sales`, which

spark git commit: [SPARK-6303][SQL] Remove unnecessary Average in GeneratedAggregate

2015-04-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d7f2c1986 - 5b8b324f3 [SPARK-6303][SQL] Remove unnecessary Average in GeneratedAggregate Because `Average` is a `PartialAggregate`, we never get a `Average` node when reaching `HashAggregation` to prepare `GeneratedAggregate`. That is

spark git commit: [SPARK-6877][SQL] Add code generation support for Min

2015-04-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 5b8b324f3 - 4898dfa46 [SPARK-6877][SQL] Add code generation support for Min Currently `min` is not supported in code generation. This pr adds the support for it. Author: Liang-Chi Hsieh vii...@gmail.com Closes #5487 from

spark git commit: [SPARK-5794] [SQL] fix add jar

2015-04-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 3782e1f2b - b45059d0d [SPARK-5794] [SQL] fix add jar Author: Daoyuan Wang daoyuan.w...@intel.com Closes #4586 from adrian-wang/addjar and squashes the following commits: efdd602 [Daoyuan Wang] move jar to another place 6c707e8 [Daoyuan

spark git commit: [SPARK-6611][SQL] Add support for INTEGER as synonym of INT.

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 198cf2a3f - 5f7b7cdab [SPARK-6611][SQL] Add support for INTEGER as synonym of INT. https://issues.apache.org/jira/browse/SPARK-6611 Author: Santiago M. Mola santiago.m...@sap.com Closes #5271 from smola/features/integer-parse and

spark git commit: [SPARK-6858][SQL] Register Java HashMap for SparkSqlSerializer

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 3ceb810aa - 198cf2a3f [SPARK-6858][SQL] Register Java HashMap for SparkSqlSerializer Since now kyro serializer is used for `GeneralHashedRelation` whether kyro is enabled or not, it is better to register Java `HashMap` in

spark git commit: [SPARK-6863] Fix formatting on SQL programming guide.

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 ffc821e20 - ea13948b9 [SPARK-6863] Fix formatting on SQL programming guide. https://issues.apache.org/jira/browse/SPARK-6863 Author: Santiago M. Mola santiago.m...@sap.com Closes #5472 from smola/fix/sql-docs and squashes the

spark git commit: [SPARK-6863] Fix formatting on SQL programming guide.

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 5f7b7cdab - 6437e7cc3 [SPARK-6863] Fix formatting on SQL programming guide. https://issues.apache.org/jira/browse/SPARK-6863 Author: Santiago M. Mola santiago.m...@sap.com Closes #5472 from smola/fix/sql-docs and squashes the following

spark git commit: [Minor][SQL] Fix typo in sql

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 6437e7cc3 - 7dbd37160 [Minor][SQL] Fix typo in sql In this PR, analyser is changed to analyzer to keep a consistent naming. Some other typos are also fixed. Author: Guancheng (G.C.) Chen chenguanch...@gmail.com Closes #5474 from

spark git commit: [SPARK-6199] [SQL] Support CTE in HiveContext and SQLContext

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 7dbd37160 - 2f5358873 [SPARK-6199] [SQL] Support CTE in HiveContext and SQLContext Author: haiyang huhaiy...@huawei.com Closes #4929 from haiyangsea/cte and squashes the following commits: 220b67d [haiyang] add golden files for cte test

spark git commit: [Spark-5068][SQL]Fix bug query data when path doesn't exist for HiveContext

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 2f5358873 - 1f39a6111 [Spark-5068][SQL]Fix bug query data when path doesn't exist for HiveContext This PR follow up PR #3907 #3891 #4356. According to marmbrus liancheng 's comments, I try to use fs.globStatus to retrieve all

spark git commit: [SPARK-6179][SQL] Add token for SHOW PRINCIPALS role_name and SHOW TRANSACTIONS and SHOW COMPACTIONS

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1f39a6111 - 48cc84002 [SPARK-6179][SQL] Add token for SHOW PRINCIPALS role_name and SHOW TRANSACTIONS and SHOW COMPACTIONS [SHOW PRINCIPALS role_name] Lists all roles and users who belong to this role. Only the admin role has privilege

spark git commit: [SQL] Handle special characters in the authority of a Path's URI.

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 352a5da42 - d2383fb5f [SQL] Handle special characters in the authority of a Path's URI. Author: Yin Huai yh...@databricks.com Closes #5381 from yhuai/parquetPath2 and squashes the following commits: fe296b4 [Yin Huai] Create new Path to

spark git commit: [SPARK-6367][SQL] Use the proper data type for those expressions that are hijacking existing data types.

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d2383fb5f - 6d4e854ff [SPARK-6367][SQL] Use the proper data type for those expressions that are hijacking existing data types. This PR adds internal UDTs for expressions that are hijacking existing data types. The following UDTs are

spark git commit: [SQL][minor] move `resolveGetField` into a object

2015-04-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 6d4e854ff - 5c2844c51 [SQL][minor] move `resolveGetField` into a object The method `resolveGetField` isn't belong to `LogicalPlan` logically and didn't access any members of it. Author: Wenchen Fan cloud0...@outlook.com Closes #5435

spark git commit: [SPARK-6851][SQL] Create new instance for each converted parquet relation

2015-04-10 Thread marmbrus
. Author: Michael Armbrust mich...@databricks.com Closes #5458 from marmbrus/selfJoinParquet and squashes the following commits: 22df77c [Michael Armbrust] [SPARK-6851][SQL] Create new instance for each converted parquet relation Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit

[2/2] spark git commit: [SQL] [SPARK-6620] Speed up toDF() and rdd() functions by constructing converters in ScalaReflection

2015-04-10 Thread marmbrus
[SQL] [SPARK-6620] Speed up toDF() and rdd() functions by constructing converters in ScalaReflection cc marmbrus Author: Volodymyr Lyubinets vlyu...@gmail.com Closes #5279 from vlyubin/speedup and squashes the following commits: e75a387 [Volodymyr Lyubinets] Changes to ScalaUDF 11a20ec

[1/2] spark git commit: [SQL] [SPARK-6620] Speed up toDF() and rdd() functions by constructing converters in ScalaReflection

2015-04-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 23d5f8864 - 67d06880e http://git-wip-us.apache.org/repos/asf/spark/blob/67d06880/sql/catalyst/src/main/scala/org/apache/spark/sql/types/DataTypeConversions.scala -- diff

spark git commit: [SPARK-6451][SQL] supported code generation for CombineSum

2015-04-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 941828054 - 7d7384c78 [SPARK-6451][SQL] supported code generation for CombineSum Author: Venkata Ramana Gollamudi ramana.gollam...@huawei.com Closes #5138 from gvramana/sum_fix_codegen and squashes the following commits: 95f5fe4 [Venkata

spark git commit: [SPARK-6696] [SQL] Adds HiveContext.refreshTable to PySpark

2015-04-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 7d7384c78 - 891ada5be [SPARK-6696] [SQL] Adds HiveContext.refreshTable to PySpark !-- Reviewable:start -- [img src=https://reviewable.io/review_button.png; height=40 alt=Review on

spark git commit: [SQL][minor] remove duplicated resolveGetField and update comment

2015-04-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 55a92ef34 - 941828054 [SQL][minor] remove duplicated resolveGetField and update comment It's after https://github.com/apache/spark/pull/5189 Author: Wenchen Fan cloud0...@outlook.com Closes #5304 from cloud-fan/tmp and squashes the

spark git commit: [SPARK-6647][SQL] Make trait StringComparison as BinaryPredicate and fix unit tests of string data source Filter

2015-04-03 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 14632b794 - 26b415e15 [SPARK-6647][SQL] Make trait StringComparison as BinaryPredicate and fix unit tests of string data source Filter Now trait `StringComparison` is a `BinaryExpression`. In fact, it should be a `BinaryPredicate`. By

spark git commit: [SPARK-6243][SQL] The Operation of match did not conside the scenarios that order.dataType does not match NativeType

2015-04-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master dfd2982bc - 947802cb0 [SPARK-6243][SQL] The Operation of match did not conside the scenarios that order.dataType does not match NativeType It did not conside that order.dataType does not match NativeType. So i add case other = ... for

spark git commit: [SQL][Minor] Use analyzed logical instead of unresolved in HiveComparisonTest

2015-04-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 5db89127e - dfd2982bc [SQL][Minor] Use analyzed logical instead of unresolved in HiveComparisonTest Some internal unit test failed due to the logical plan node in pattern matching in `HiveComparisonTest`, e.g.

spark git commit: [SPARK-6686][SQL] Use resolved output instead of names for toDF rename

2015-04-02 Thread marmbrus
renames. Author: Michael Armbrust mich...@databricks.com Closes #5337 from marmbrus/toDFrename and squashes the following commits: 6a3159d [Michael Armbrust] [SPARK-6686][SQL] Use resolved output instead of names for toDF rename Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit

spark git commit: [SPARK-6686][SQL] Use resolved output instead of names for toDF rename

2015-04-02 Thread marmbrus
renames. Author: Michael Armbrust mich...@databricks.com Closes #5337 from marmbrus/toDFrename and squashes the following commits: 6a3159d [Michael Armbrust] [SPARK-6686][SQL] Use resolved output instead of names for toDF rename (cherry picked from commit 052dee0707830cfd3cd8821ecc3471a37ede294a

spark git commit: [SPARK-6655][SQL] We need to read the schema of a data source table stored in spark.sql.sources.schema property

2015-04-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 4214e50fc - 251698fb7 [SPARK-6655][SQL] We need to read the schema of a data source table stored in spark.sql.sources.schema property https://issues.apache.org/jira/browse/SPARK-6655 Author: Yin Huai yh...@databricks.com Closes #5313

spark git commit: [SPARK-6655][SQL] We need to read the schema of a data source table stored in spark.sql.sources.schema property

2015-04-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 78ba24584 - aecec07d6 [SPARK-6655][SQL] We need to read the schema of a data source table stored in spark.sql.sources.schema property https://issues.apache.org/jira/browse/SPARK-6655 Author: Yin Huai yh...@databricks.com Closes

spark git commit: [SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata

2015-04-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 45134ec92 - 4b82bd730 [SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata https://issues.apache.org/jira/browse/SPARK-6575 Author: Yin Huai yh...@databricks.com Closes #5339 from yhuai/parquetRelationCache and

spark git commit: [SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata

2015-04-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 0ef46b2d8 - 0c1c0fb90 [SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata https://issues.apache.org/jira/browse/SPARK-6575 Author: Yin Huai yh...@databricks.com Closes #5339 from yhuai/parquetRelationCache

spark git commit: [SQL] Throw UnsupportedOperationException instead of NotImplementedError

2015-04-02 Thread marmbrus
mich...@databricks.com Closes #5315 from marmbrus/throwUnsupported and squashes the following commits: c29e03b [Michael Armbrust] [SQL] Throw UnsupportedOperationException instead of NotImplementedError 052e05b [Michael Armbrust] [SQL] Throw UnsupportedOperationException instead

spark git commit: [SQL] Throw UnsupportedOperationException instead of NotImplementedError

2015-04-02 Thread marmbrus
...@databricks.com Closes #5315 from marmbrus/throwUnsupported and squashes the following commits: c29e03b [Michael Armbrust] [SQL] Throw UnsupportedOperationException instead of NotImplementedError 052e05b [Michael Armbrust] [SQL] Throw UnsupportedOperationException instead

spark git commit: [SPARK-6672][SQL] convert row to catalyst in createDataFrame(RDD[Row], ...)

2015-04-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 e6ee95cbd - c2694bba6 [SPARK-6672][SQL] convert row to catalyst in createDataFrame(RDD[Row], ...) We assume that `RDD[Row]` contains Scala types. So we need to convert them into catalyst types in createDataFrame. liancheng Author:

spark git commit: [SQL] SPARK-6658: Update DataFrame documentation to refer to correct types

2015-04-01 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 d697b764b - 223dd3f5d [SQL] SPARK-6658: Update DataFrame documentation to refer to correct types Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/223dd3f5 Tree:

spark git commit: [SPARK-6658][SQL] Update DataFrame documentation to fix type references.

2015-04-01 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 899ebcb14 - 191524e74 [SPARK-6658][SQL] Update DataFrame documentation to fix type references. First contribution here; would love to be getting some code contributions in soon. Let me know if there's anything about contribution process I

spark git commit: [SPARK-6145][SQL] fix ORDER BY on nested fields

2015-03-31 Thread marmbrus
avoid creating UnresolvedGetField expressions during resolution. Instead we either resolve GetField or we return None. This avoids us going down the wrong path early on. Author: Michael Armbrust mich...@databricks.com Closes #5189 from marmbrus/nestedOrderBy and squashes the following commits

spark git commit: [SPARK-5371][SQL] Propagate types after function conversion, before futher resolution

2015-03-31 Thread marmbrus
marmbrus/unionNull and squashes the following commits: dc3581a [Michael Armbrust] [SPARK-5371][SQL] Propogate types after function conversion / before futher resolution (cherry picked from commit beebb7ffc21c66ae3e4c61194d1e19ede1bb) Signed-off-by: Michael Armbrust mich...@databricks.com

spark git commit: [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation

2015-03-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 fd600cec0 - 9ebefb1f1 [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation Also removes temporary workarounds made in #5183 and #5251. !-- Reviewable:start -- [img src=https://reviewable.io/review_button.png;

spark git commit: [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation

2015-03-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d01a6d8c3 - a7992ffaf [SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation Also removes temporary workarounds made in #5183 and #5251. !-- Reviewable:start -- [img src=https://reviewable.io/review_button.png;

spark git commit: [SPARK-6550][SQL] Use analyzed plan in DataFrame

2015-03-27 Thread marmbrus
for the left side of the join, rendering the previously returned `AttributeReference` invalid. As a fix, I propose we keep the analyzed plan instead of the unresolved plan inside of a `DataFrame`. Author: Michael Armbrust mich...@databricks.com Closes #5217 from marmbrus/preanalyzer and squashes

spark git commit: [SPARK-6550][SQL] Use analyzed plan in DataFrame

2015-03-27 Thread marmbrus
for the left side of the join, rendering the previously returned `AttributeReference` invalid. As a fix, I propose we keep the analyzed plan instead of the unresolved plan inside of a `DataFrame`. Author: Michael Armbrust mich...@databricks.com Closes #5217 from marmbrus/preanalyzer and squashes

spark git commit: [SPARK-6574] [PySpark] fix sql example

2015-03-27 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 5d9c37c23 - 887e1b72d [SPARK-6574] [PySpark] fix sql example Fix the import in sql example. Author: Davies Liu dav...@databricks.com Closes #5230 from davies/fix_sql_example and squashes the following commits: 7ecc5f4 [Davies Liu] fix

spark git commit: [SPARK-6574] [PySpark] fix sql example

2015-03-27 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 bc7518956 - b902a9552 [SPARK-6574] [PySpark] fix sql example Fix the import in sql example. Author: Davies Liu dav...@databricks.com Closes #5230 from davies/fix_sql_example and squashes the following commits: 7ecc5f4 [Davies Liu]

spark git commit: [SPARK-6554] [SQL] Don't push down predicates which reference partition column(s)

2015-03-26 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 784fcd532 - 71a0d40eb [SPARK-6554] [SQL] Don't push down predicates which reference partition column(s) There are two cases for the new Parquet data source: 1. Partition columns exist in the Parquet data files We don't need to

spark git commit: [DOCS][SQL] Fix JDBC example

2015-03-26 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 3d545782e - 54d92b542 [DOCS][SQL] Fix JDBC example Author: Michael Armbrust mich...@databricks.com Closes #5192 from marmbrus/fixJDBCDocs and squashes the following commits: b48a33d [Michael Armbrust] [DOCS][SQL] Fix JDBC example

spark git commit: [DOCS][SQL] Fix JDBC example

2015-03-26 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 71a0d40eb - aad003227 [DOCS][SQL] Fix JDBC example Author: Michael Armbrust mich...@databricks.com Closes #5192 from marmbrus/fixJDBCDocs and squashes the following commits: b48a33d [Michael Armbrust] [DOCS][SQL] Fix JDBC example

spark git commit: [SPARK-6409][SQL] It is not necessary that avoid old inteface of hive, because this will make some UDAF can not work.

2015-03-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 982952f4a - 968408b34 [SPARK-6409][SQL] It is not necessary that avoid old inteface of hive, because this will make some UDAF can not work. spark avoid old inteface of hive, then some udaf can not work like

spark git commit: [SPARK-6409][SQL] It is not necessary that avoid old inteface of hive, because this will make some UDAF can not work.

2015-03-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 6791f425d - 4efa6c5dd [SPARK-6409][SQL] It is not necessary that avoid old inteface of hive, because this will make some UDAF can not work. spark avoid old inteface of hive, then some udaf can not work like

spark git commit: [SPARK-6202] [SQL] enable variable substitution on test framework

2015-03-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 328daf65f - 5ab6e9f0c [SPARK-6202] [SQL] enable variable substitution on test framework Author: Daoyuan Wang daoyuan.w...@intel.com Closes #4930 from adrian-wang/testvs and squashes the following commits: 2ce590f [Daoyuan Wang] add

spark git commit: [SPARK-6450] [SQL] Fixes metastore Parquet table conversion

2015-03-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d44a3362e - 8c3b0052f [SPARK-6450] [SQL] Fixes metastore Parquet table conversion The `ParquetConversions` analysis rule generates a hash map, which maps from the original `MetastoreRelation` instances to the newly created

spark git commit: [SPARK-6450] [SQL] Fixes metastore Parquet table conversion

2015-03-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.3 4efa6c5dd - 0cd474893 [SPARK-6450] [SQL] Fixes metastore Parquet table conversion The `ParquetConversions` analysis rule generates a hash map, which maps from the original `MetastoreRelation` instances to the newly created

spark git commit: [SPARK-6463][SQL] AttributeSet.equal should compare size

2015-03-25 Thread marmbrus
...@huawei.com Author: Michael Armbrust mich...@databricks.com Closes #5194 from marmbrus/pr/5133 and squashes the following commits: 5ed4615 [Michael Armbrust] fix imports d4cbbc0 [Michael Armbrust] Add test cases 0a0834f [sisihj] AttributeSet.equal should compare size (cherry picked from commit

spark git commit: [SPARK-6463][SQL] AttributeSet.equal should compare size

2015-03-25 Thread marmbrus
...@huawei.com Author: Michael Armbrust mich...@databricks.com Closes #5194 from marmbrus/pr/5133 and squashes the following commits: 5ed4615 [Michael Armbrust] fix imports d4cbbc0 [Michael Armbrust] Add test cases 0a0834f [sisihj] AttributeSet.equal should compare size Project: http://git-wip

spark git commit: [SPARK-5498][SQL]fix query exception when partition schema does not match table schema

2015-03-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 8c3b0052f - e6d1406ab [SPARK-5498][SQL]fix query exception when partition schema does not match table schema In hive,the schema of partition may be difference from the table schema.When we use spark-sql to query the data of partition

spark git commit: [SPARK-6326][SQL] Improve castStruct to be faster

2015-03-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master e6d1406ab - 73d57754d [SPARK-6326][SQL] Improve castStruct to be faster Current `castStruct` should be very slow. This pr slightly improves it. Author: Liang-Chi Hsieh vii...@gmail.com Closes #5017 from viirya/faster_caststruct and

spark git commit: [SPARK-6271][SQL] Sort these tokens in alphabetic order to avoid further duplicate in HiveQl

2015-03-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 73d57754d - 328daf65f [SPARK-6271][SQL] Sort these tokens in alphabetic order to avoid further duplicate in HiveQl Author: DoingDone9 799203...@qq.com Closes #4973 from DoingDone9/sort_token and squashes the following commits: 855fa10

<    3   4   5   6   7   8   9   10   11   12   >