spark git commit: Revert "[SPARK-9372] [SQL] Filter nulls in join keys"

2015-08-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master 702aa9d7f -> b2e4b85d2 Revert "[SPARK-9372] [SQL] Filter nulls in join keys" This reverts commit 687c8c37150f4c93f8e57d86bb56321a4891286b. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/rep

spark git commit: Add a prerequisites section for building docs

2015-08-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master 13675c742 -> 7abaaad5b Add a prerequisites section for building docs This puts all the install commands that need to be run in one section instead of being spread over many paragraphs cc rxin Author: Shivaram Venkataraman Closes #7

spark git commit: [SPARK-9577][SQL] Surface concrete iterator types in various sort classes.

2015-08-03 Thread rxin
n a more concrete type, so the callsite uses that type and JIT can inline the iterator calls. Author: Reynold Xin Closes #7911 from rxin/surface-concrete-type and squashes the following commits: 0422add [Reynold Xin] [SPARK-9577][SQL] Surface concrete iterator types in various sort classes. Proj

spark git commit: [SPARK-9577][SQL] Surface concrete iterator types in various sort classes.

2015-08-03 Thread rxin
n a more concrete type, so the callsite uses that type and JIT can inline the iterator calls. Author: Reynold Xin Closes #7911 from rxin/surface-concrete-type and squashes the following commits: 0422add [Reynold Xin] [SPARK-9577][SQL] Surface concrete iterator types in various sort clas

spark git commit: [SPARK-3190] [GRAPHX] Fix VertexRDD.count() overflow regression

2015-08-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.4 f904b46c9 -> dea04bf84 [SPARK-3190] [GRAPHX] Fix VertexRDD.count() overflow regression SPARK-3190 was originally fixed by 96df92906978c5f58e0cc8ff5eebe5b35a08be3b, but a5ef58113667ff73562ce6db381cff96a0b354b0 introduced a regression du

spark git commit: [SPARK-3190] [GRAPHX] Fix VertexRDD.count() overflow regression

2015-08-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master 0afa6fbf5 -> 9e952ecbc [SPARK-3190] [GRAPHX] Fix VertexRDD.count() overflow regression SPARK-3190 was originally fixed by 96df92906978c5f58e0cc8ff5eebe5b35a08be3b, but a5ef58113667ff73562ce6db381cff96a0b354b0 introduced a regression during

spark git commit: [SPARK-3190] [GRAPHX] Fix VertexRDD.count() overflow regression

2015-08-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 1f7dbcd6f -> 29f2d5a06 [SPARK-3190] [GRAPHX] Fix VertexRDD.count() overflow regression SPARK-3190 was originally fixed by 96df92906978c5f58e0cc8ff5eebe5b35a08be3b, but a5ef58113667ff73562ce6db381cff96a0b354b0 introduced a regression du

spark git commit: [SPARK-3190] [GRAPHX] Fix VertexRDD.count() overflow regression

2015-08-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.3 265ec35bc -> cd5d1be6e [SPARK-3190] [GRAPHX] Fix VertexRDD.count() overflow regression SPARK-3190 was originally fixed by 96df92906978c5f58e0cc8ff5eebe5b35a08be3b, but a5ef58113667ff73562ce6db381cff96a0b354b0 introduced a regression du

spark git commit: [SPARK-9553][SQL] remove the no-longer-necessary createCode and createStructCode, and replace the usage

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 be37b1bd3 -> 43f6b021e [SPARK-9553][SQL] remove the no-longer-necessary createCode and createStructCode, and replace the usage Author: Wenchen Fan Closes #7890 from cloud-fan/minor and squashes the following commits: c3b1be3 [Wenche

spark git commit: [SPARK-9553][SQL] remove the no-longer-necessary createCode and createStructCode, and replace the usage

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/master a0cc01759 -> f4b1ac08a [SPARK-9553][SQL] remove the no-longer-necessary createCode and createStructCode, and replace the usage Author: Wenchen Fan Closes #7890 from cloud-fan/minor and squashes the following commits: c3b1be3 [Wenchen Fa

spark git commit: [SPARK-9452] [SQL] Support records larger than page size in UnsafeExternalSorter

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/master f4b1ac08a -> ab8ee1a3b [SPARK-9452] [SQL] Support records larger than page size in UnsafeExternalSorter This patch extends UnsafeExternalSorter to support records larger than the page size. The basic strategy is the same as in #7762: store

spark git commit: [SPARK-9452] [SQL] Support records larger than page size in UnsafeExternalSorter

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 43f6b021e -> f771a83f4 [SPARK-9452] [SQL] Support records larger than page size in UnsafeExternalSorter This patch extends UnsafeExternalSorter to support records larger than the page size. The basic strategy is the same as in #7762: s

spark git commit: [SPARK-9602] remove "Akka/Actor" words from comments

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/master ab8ee1a3b -> 9d668b736 [SPARK-9602] remove "Akka/Actor" words from comments https://issues.apache.org/jira/browse/SPARK-9602 Although we have hidden Akka behind RPC interface, I found that the Akka/Actor-related comments are still spreadi

spark git commit: [SPARK-9602] remove "Akka/Actor" words from comments

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 f771a83f4 -> 560b2da78 [SPARK-9602] remove "Akka/Actor" words from comments https://issues.apache.org/jira/browse/SPARK-9602 Although we have hidden Akka behind RPC interface, I found that the Akka/Actor-related comments are still spr

spark git commit: [SPARK-9598][SQL] do not expose generic getter in internal row

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 cff0fe291 -> 1954a7bb1 [SPARK-9598][SQL] do not expose generic getter in internal row Author: Wenchen Fan Closes #7932 from cloud-fan/generic-getter and squashes the following commits: c60de4c [Wenchen Fan] do not expose generic gett

spark git commit: [SPARK-9598][SQL] do not expose generic getter in internal row

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/master b77d3b968 -> 7c8fc1f7c [SPARK-9598][SQL] do not expose generic getter in internal row Author: Wenchen Fan Closes #7932 from cloud-fan/generic-getter and squashes the following commits: c60de4c [Wenchen Fan] do not expose generic getter i

spark git commit: [SPARK-9432][SQL] Audit expression unit tests to make sure we pass the proper numeric ranges

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/master d92fa1417 -> a7fe48f68 [SPARK-9432][SQL] Audit expression unit tests to make sure we pass the proper numeric ranges JIRA: https://issues.apache.org/jira/browse/SPARK-9432 Author: Yijie Shen Closes #7933 from yjshen/numeric_ranges and sq

spark git commit: [SPARK-9432][SQL] Audit expression unit tests to make sure we pass the proper numeric ranges

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 2237ddbe0 -> 02a6333d2 [SPARK-9432][SQL] Audit expression unit tests to make sure we pass the proper numeric ranges JIRA: https://issues.apache.org/jira/browse/SPARK-9432 Author: Yijie Shen Closes #7933 from yjshen/numeric_ranges an

spark git commit: [SPARK-9513] [SQL] [PySpark] Add python API for DataFrame functions

2015-08-04 Thread rxin
of functions (for BinaryType) does not have tests. cc rxin Author: Davies Liu Closes #7922 from davies/python_functions and squashes the following commits: 8ad942f [Davies Liu] fix test 5fb6ec3 [Davies Liu] fix bugs 3495ed3 [Davies Liu] fix issues ea5f7bb [Davies Liu] Add python API for DataFr

spark git commit: [SPARK-9513] [SQL] [PySpark] Add python API for DataFrame functions

2015-08-04 Thread rxin
of functions (for BinaryType) does not have tests. cc rxin Author: Davies Liu Closes #7922 from davies/python_functions and squashes the following commits: 8ad942f [Davies Liu] fix test 5fb6ec3 [Davies Liu] fix bugs 3495ed3 [Davies Liu] fix issues ea5f7bb [Davies Liu] Add python API for DataFr

spark git commit: Update docs/README.md to put all prereqs together.

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/master d34bac0e1 -> f7abd6bec Update docs/README.md to put all prereqs together. This pull request groups all the prereq requirements into a single section. cc srowen shivaram Author: Reynold Xin Closes #7951 from rxin/readme-docs

[1/2] spark git commit: Add a prerequisites section for building docs

2015-08-04 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 864d5de6d -> b6e8446a4 Add a prerequisites section for building docs This puts all the install commands that need to be run in one section instead of being spread over many paragraphs cc rxin Author: Shivaram Venkataraman Clo

[2/2] spark git commit: Update docs/README.md to put all prereqs together.

2015-08-04 Thread rxin
Update docs/README.md to put all prereqs together. This pull request groups all the prereq requirements into a single section. cc srowen shivaram Author: Reynold Xin Closes #7951 from rxin/readme-docs and squashes the following commits: ab7ded0 [Reynold Xin] Updated docs/README.md to put all

spark git commit: [SPARK-9581][SQL] Add unit test for JSON UDT

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master c2a71f071 -> 1d1a76c8c [SPARK-9581][SQL] Add unit test for JSON UDT This brings #7416 up-to-date by drubbo. Author: Emiliano Leporati Author: Reynold Xin Closes #7917 from rxin/udt-json-test and squashes the following commits: 93e3

spark git commit: [SPARK-9581][SQL] Add unit test for JSON UDT

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 ea23e54ff -> 57596fb8c [SPARK-9581][SQL] Add unit test for JSON UDT This brings #7416 up-to-date by drubbo. Author: Emiliano Leporati Author: Reynold Xin Closes #7917 from rxin/udt-json-test and squashes the following comm

spark git commit: Closes #7917

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1d1a76c8c -> d8ef538e5 Closes #7917 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/d8ef538e Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/d8ef538e Diff: http

[1/2] spark git commit: [SPARK-8861][SPARK-8862][SQL] Add basic instrumentation to each SparkPlan operator and add a new SQL tab

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1bf608b5e -> 1b0317f64 http://git-wip-us.apache.org/repos/asf/spark/blob/1b0317f6/sql/core/src/test/scala/org/apache/spark/sql/ui/SQLListenerSuite.scala -- diff --git a/sql

[2/2] spark git commit: [SPARK-8861][SPARK-8862][SQL] Add basic instrumentation to each SparkPlan operator and add a new SQL tab

2015-08-05 Thread rxin
[SPARK-8861][SPARK-8862][SQL] Add basic instrumentation to each SparkPlan operator and add a new SQL tab This PR includes the following changes: ### SPARK-8862: Add basic instrumentation to each SparkPlan operator A SparkPlan can override `def accumulators: Map[String, Accumulator[_]]` to expo

[1/2] spark git commit: [SPARK-8861][SPARK-8862][SQL] Add basic instrumentation to each SparkPlan operator and add a new SQL tab

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 6306019ff -> ebc3aad27 http://git-wip-us.apache.org/repos/asf/spark/blob/ebc3aad2/sql/core/src/test/scala/org/apache/spark/sql/ui/SQLListenerSuite.scala -- diff --git a

[2/2] spark git commit: [SPARK-8861][SPARK-8862][SQL] Add basic instrumentation to each SparkPlan operator and add a new SQL tab

2015-08-05 Thread rxin
[SPARK-8861][SPARK-8862][SQL] Add basic instrumentation to each SparkPlan operator and add a new SQL tab This PR includes the following changes: ### SPARK-8862: Add basic instrumentation to each SparkPlan operator A SparkPlan can override `def accumulators: Map[String, Accumulator[_]]` to expo

spark git commit: [SPARK-9628][SQL]Rename int to SQLDate, long to SQLTimestamp for better readability

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 ebc3aad27 -> f288cca3d [SPARK-9628][SQL]Rename int to SQLDate, long to SQLTimestamp for better readability JIRA: https://issues.apache.org/jira/browse/SPARK-9628 Author: Yijie Shen Closes #7953 from yjshen/datetime_alias and squashe

spark git commit: [SPARK-9628][SQL]Rename int to SQLDate, long to SQLTimestamp for better readability

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1b0317f64 -> 84ca3183b [SPARK-9628][SQL]Rename int to SQLDate, long to SQLTimestamp for better readability JIRA: https://issues.apache.org/jira/browse/SPARK-9628 Author: Yijie Shen Closes #7953 from yjshen/datetime_alias and squashes th

spark git commit: Closes #7778 since it is done as #7893.

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master e1e05873f -> eb5b8f4a6 Closes #7778 since it is done as #7893. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/eb5b8f4a Tree: http://git-wip-us.apache.org/repos/asf/spa

spark git commit: Closes #7474 since it's marked as won't fix.

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master 5f0fb6466 -> f9c2a2af1 Closes #7474 since it's marked as won't fix. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/f9c2a2af Tree: http://git-wip-us.apache.org/repos/as

spark git commit: [SPARK-9651] Fix UnsafeExternalSorterSuite.

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master 8c320e45b -> 4399b7b09 [SPARK-9651] Fix UnsafeExternalSorterSuite. First, it's probably a bad idea to call generated Scala methods from Java. In this case, the method being called wasn't actually "Utils.createTempDir()", but actually the me

spark git commit: [SPARK-9651] Fix UnsafeExternalSorterSuite.

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 5f037b3dc -> eb2229ac0 [SPARK-9651] Fix UnsafeExternalSorterSuite. First, it's probably a bad idea to call generated Scala methods from Java. In this case, the method being called wasn't actually "Utils.createTempDir()", but actually th

spark git commit: [SPARK-9674][SQL] Remove GeneratedAggregate.

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master 119b59053 -> 9270bd06f [SPARK-9674][SQL] Remove GeneratedAggregate. The new aggregate replaces the old GeneratedAggregate. Author: Reynold Xin Closes #7983 from rxin/remove-generated-agg and squashes the following commits: 8334

spark git commit: [SPARK-9664] [SQL] Remove UDAFRegistration and add apply to UserDefinedAggregateFunction.

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/master 9270bd06f -> d5a9af323 [SPARK-9664] [SQL] Remove UDAFRegistration and add apply to UserDefinedAggregateFunction. https://issues.apache.org/jira/browse/SPARK-9664 Author: Yin Huai Closes #7982 from yhuai/udafRegister and squashes the fol

spark git commit: [SPARK-9674][SQL] Remove GeneratedAggregate.

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 f24cd8cb9 -> 252eb6193 [SPARK-9674][SQL] Remove GeneratedAggregate. The new aggregate replaces the old GeneratedAggregate. Author: Reynold Xin Closes #7983 from rxin/remove-generated-agg and squashes the following commits: 8334

spark git commit: [SPARK-9664] [SQL] Remove UDAFRegistration and add apply to UserDefinedAggregateFunction.

2015-08-05 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 252eb6193 -> 29ace3bbf [SPARK-9664] [SQL] Remove UDAFRegistration and add apply to UserDefinedAggregateFunction. https://issues.apache.org/jira/browse/SPARK-9664 Author: Yin Huai Closes #7982 from yhuai/udafRegister and squashes the

spark git commit: [SPARK-9659][SQL] Rename inSet to isin to match Pandas function.

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master 98e69467d -> 5e1b0ef07 [SPARK-9659][SQL] Rename inSet to isin to match Pandas function. Inspiration drawn from this blog post: https://lab.getbase.com/pandarize-spark-dataframes/ Author: Reynold Xin Closes #7977 from rxin/isin

spark git commit: [SPARK-9659][SQL] Rename inSet to isin to match Pandas function.

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 78f168e97 -> 6b8d2d7ed [SPARK-9659][SQL] Rename inSet to isin to match Pandas function. Inspiration drawn from this blog post: https://lab.getbase.com/pandarize-spark-dataframes/ Author: Reynold Xin Closes #7977 from rxin/isin

spark git commit: [SPARK-9632][SQL] update InternalRow.toSeq to make it accept data type info

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 6b8d2d7ed -> 2382b483a [SPARK-9632][SQL] update InternalRow.toSeq to make it accept data type info Author: Wenchen Fan Closes #7955 from cloud-fan/toSeq and squashes the following commits: 21665e2 [Wenchen Fan] fix hive again... 4add

spark git commit: [SPARK-9632][SQL] update InternalRow.toSeq to make it accept data type info

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master 5e1b0ef07 -> 6e009cb9c [SPARK-9632][SQL] update InternalRow.toSeq to make it accept data type info Author: Wenchen Fan Closes #7955 from cloud-fan/toSeq and squashes the following commits: 21665e2 [Wenchen Fan] fix hive again... 4addf29

spark git commit: [SPARK-9632][SQL] update InternalRow.toSeq to make it accept data type info

2015-08-06 Thread rxin
ses #8002 from rxin/InternalRow-toSeq and squashes the following commits: 332416a [Reynold Xin] Merge pull request #7955 from cloud-fan/toSeq 21665e2 [Wenchen Fan] fix hive again... 4addf29 [Wenchen Fan] fix hive bc16c59 [Wenchen Fan] minor fix 33d802c [Wenchen Fan] pass data type info

spark git commit: [SPARK-9618] [SQL] Use the specified schema when reading Parquet files

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 8b00c0690 -> d5f788121 [SPARK-9618] [SQL] Use the specified schema when reading Parquet files The user specified schema is currently ignored when loading Parquet files. One workaround is to use the `format` and `load` methods instead o

spark git commit: [SPARK-9381] [SQL] Migrate JSON data source to the new partitioning data source

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 d5f788121 -> 3d247672b [SPARK-9381] [SQL] Migrate JSON data source to the new partitioning data source Support partitioning for the JSON data source. Still 2 open issues for the `HadoopFsRelation` - `refresh()` will invoke the `discove

spark git commit: [SPARK-6923] [SPARK-7550] [SQL] Persists data source relations in Hive compatible format when possible

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 3d247672b -> 92e8acc98 [SPARK-6923] [SPARK-7550] [SQL] Persists data source relations in Hive compatible format when possible This PR is a fork of PR #5733 authored by chenghao-intel. For committers who's going to merge this PR, plea

spark git commit: [SPARK-9548][SQL] Add a destructive iterator for BytesToBytesMap

2015-08-06 Thread rxin
the effort to avoid starving when we have more than one operators that can exhaust memory. This is based on #7924, but fixes a bug there (Don't use destructive iterator in UnsafeKVExternalSorter). Closes #7924. Author: Liang-Chi Hsieh Author: Reynold Xin Closes #8003 from rxin/map-des

spark git commit: [SPARK-9548][SQL] Add a destructive iterator for BytesToBytesMap

2015-08-06 Thread rxin
ort to avoid starving when we have more than one operators that can exhaust memory. This is based on #7924, but fixes a bug there (Don't use destructive iterator in UnsafeKVExternalSorter). Closes #7924. Author: Liang-Chi Hsieh Author: Reynold Xin Closes #8003 from rxin/map-destructive-

[2/2] spark git commit: [SPARK-9630] [SQL] Clean up new aggregate operators (SPARK-9240 follow up)

2015-08-06 Thread rxin
[SPARK-9630] [SQL] Clean up new aggregate operators (SPARK-9240 follow up) This is the followup of https://github.com/apache/spark/pull/7813. It renames `HybridUnsafeAggregationIterator` to `TungstenAggregationIterator` and makes it only work with `UnsafeRow`. Also, I add a `TungstenAggregate` t

[2/2] spark git commit: [SPARK-9630] [SQL] Clean up new aggregate operators (SPARK-9240 follow up)

2015-08-06 Thread rxin
[SPARK-9630] [SQL] Clean up new aggregate operators (SPARK-9240 follow up) This is the followup of https://github.com/apache/spark/pull/7813. It renames `HybridUnsafeAggregationIterator` to `TungstenAggregationIterator` and makes it only work with `UnsafeRow`. Also, I add a `TungstenAggregate` t

[1/2] spark git commit: [SPARK-9630] [SQL] Clean up new aggregate operators (SPARK-9240 follow up)

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 980687206 -> 272e88342 http://git-wip-us.apache.org/repos/asf/spark/blob/272e8834/sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/utils.scala -- diff --

[1/2] spark git commit: [SPARK-9630] [SQL] Clean up new aggregate operators (SPARK-9240 follow up)

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master 346209097 -> 3504bf3aa http://git-wip-us.apache.org/repos/asf/spark/blob/3504bf3a/sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/utils.scala -- diff --git

spark git commit: [SPARK-9691] [SQL] PySpark SQL rand function treats seed 0 as no seed

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 985e454cb -> 75b4e5ab3 [SPARK-9691] [SQL] PySpark SQL rand function treats seed 0 as no seed https://issues.apache.org/jira/browse/SPARK-9691 jkbradley rxin Author: Yin Huai Closes #7999 from yhuai/pythonRand and squashes

spark git commit: [SPARK-9691] [SQL] PySpark SQL rand function treats seed 0 as no seed

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master 681e3024b -> baf4587a5 [SPARK-9691] [SQL] PySpark SQL rand function treats seed 0 as no seed https://issues.apache.org/jira/browse/SPARK-9691 jkbradley rxin Author: Yin Huai Closes #7999 from yhuai/pythonRand and squashes the follow

spark git commit: [SPARK-9691] [SQL] PySpark SQL rand function treats seed 0 as no seed

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.4 116f61187 -> e5a994f21 [SPARK-9691] [SQL] PySpark SQL rand function treats seed 0 as no seed https://issues.apache.org/jira/browse/SPARK-9691 jkbradley rxin Author: Yin Huai Closes #7999 from yhuai/pythonRand and squashes

spark git commit: [SPARK-9650][SQL] Fix quoting behavior on interpolated column names

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 b4feccf6c -> 9be9d3842 [SPARK-9650][SQL] Fix quoting behavior on interpolated column names Make sure that `$"column"` is consistent with other methods with respect to backticks. Adds a bunch of tests for various ways of constructing c

spark git commit: [SPARK-9650][SQL] Fix quoting behavior on interpolated column names

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master 4e70e8256 -> 0867b23c7 [SPARK-9650][SQL] Fix quoting behavior on interpolated column names Make sure that `$"column"` is consistent with other methods with respect to backticks. Adds a bunch of tests for various ways of constructing colum

spark git commit: [SPARK-9692] Remove SqlNewHadoopRDD's generated Tuple2 and InterruptibleIterator.

2015-08-06 Thread rxin
don't need an extra wrapper from InterruptibleIterator. Author: Reynold Xin Closes #8000 from rxin/SPARK-9692 and squashes the following commits: 1d4d0b3 [Reynold Xin] [SPARK-9692] Remove SqlNewHadoopRDD's generated Tuple2 and InterruptibleIterator. Project: http://git-wip-us.apa

spark git commit: [SPARK-9692] Remove SqlNewHadoopRDD's generated Tuple2 and InterruptibleIterator.

2015-08-06 Thread rxin
so don't need an extra wrapper from InterruptibleIterator. Author: Reynold Xin Closes #8000 from rxin/SPARK-9692 and squashes the following commits: 1d4d0b3 [Reynold Xin] [SPARK-9692] Remove SqlNewHadoopRDD's generated Tuple2 and InterruptibleIterator. (ch

spark git commit: [SPARK-9709] [SQL] Avoid starving unsafe operators that use sort

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master b87825310 -> 014a9f9d8 [SPARK-9709] [SQL] Avoid starving unsafe operators that use sort The issue is that a task may run multiple sorts, and the sorts run by the child operator (i.e. parent RDD) may acquire all available memory such that o

spark git commit: [SPARK-9709] [SQL] Avoid starving unsafe operators that use sort

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 37b6403cb -> 472f0dc34 [SPARK-9709] [SQL] Avoid starving unsafe operators that use sort The issue is that a task may run multiple sorts, and the sorts run by the child operator (i.e. parent RDD) may acquire all available memory such th

spark git commit: [SPARK-9228] [SQL] use tungsten.enabled in public for both of codegen/unsafe

2015-08-06 Thread rxin
rus rxin Author: Davies Liu Closes #7998 from davies/tungsten and squashes the following commits: c1c16da [Davies Liu] update doc 1a47be1 [Davies Liu] use tungsten.enabled for both of codegen/unsafe (cherry picked from commit 4e70e8256ce2f45b438642372329eac7b1e9e8cf) Signed-off-by: Reynold

spark git commit: Fix doc typo

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master 17284db31 -> fe12277b4 Fix doc typo Straightforward fix on doc typo Author: Jeff Zhang Closes #8019 from zjffdu/master and squashes the following commits: aed6e64 [Jeff Zhang] Fix doc typo Project: http://git-wip-us.apache.org/repos/a

spark git commit: Fix doc typo

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 472f0dc34 -> 5491dfb9a Fix doc typo Straightforward fix on doc typo Author: Jeff Zhang Closes #8019 from zjffdu/master and squashes the following commits: aed6e64 [Jeff Zhang] Fix doc typo (cherry picked from commit fe12277b4008258

spark git commit: [SPARK-8057][Core]Call TaskAttemptContext.getTaskAttemptID using Reflection

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 5491dfb9a -> e902c4f26 [SPARK-8057][Core]Call TaskAttemptContext.getTaskAttemptID using Reflection Someone may use the Spark core jar in the maven repo with hadoop 1. SPARK-2075 has already resolved the compatibility issue to support i

spark git commit: [SPARK-8057][Core]Call TaskAttemptContext.getTaskAttemptID using Reflection

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master fe12277b4 -> 672f46766 [SPARK-8057][Core]Call TaskAttemptContext.getTaskAttemptID using Reflection Someone may use the Spark core jar in the maven repo with hadoop 1. SPARK-2075 has already resolved the compatibility issue to support it. B

spark git commit: [SPARK-7550] [SQL] [MINOR] Fixes logs when persisting DataFrames

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master 672f46766 -> f0cda587f [SPARK-7550] [SQL] [MINOR] Fixes logs when persisting DataFrames Author: Cheng Lian Closes #8021 from liancheng/spark-7550/fix-logs and squashes the following commits: b7bd0ed [Cheng Lian] Fixes logs Project: ht

spark git commit: [SPARK-7550] [SQL] [MINOR] Fixes logs when persisting DataFrames

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 e902c4f26 -> aedc8f3c3 [SPARK-7550] [SQL] [MINOR] Fixes logs when persisting DataFrames Author: Cheng Lian Closes #8021 from liancheng/spark-7550/fix-logs and squashes the following commits: b7bd0ed [Cheng Lian] Fixes logs (cherry

spark git commit: [SPARK-8862][SQL]Support multiple SQLContexts in Web UI

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master f0cda587f -> 7aaed1b11 [SPARK-8862][SQL]Support multiple SQLContexts in Web UI This is a follow-up PR to solve the UI issue when there are multiple SQLContexts. Each SQLContext has a separate tab and contains queries which are executed by

spark git commit: [SPARK-8862][SQL]Support multiple SQLContexts in Web UI

2015-08-06 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 aedc8f3c3 -> c34fdaf55 [SPARK-8862][SQL]Support multiple SQLContexts in Web UI This is a follow-up PR to solve the UI issue when there are multiple SQLContexts. Each SQLContext has a separate tab and contains queries which are execute

spark git commit: [SPARK-9700] Pick default page size more intelligently.

2015-08-06 Thread rxin
so that the default page size, if unset by the user, is determined by the number of cores available and the total execution memory available. Author: Reynold Xin Closes #8012 from rxin/pagesize and squashes the following commits: 16f4756 [Reynold Xin] Fixed failing test. 5afd570 [Reynold

spark git commit: [SPARK-9700] Pick default page size more intelligently.

2015-08-06 Thread rxin
ges it so that the default page size, if unset by the user, is determined by the number of cores available and the total execution memory available. Author: Reynold Xin Closes #8012 from rxin/pagesize and squashes the following commits: 16f4756 [Reynold Xin] Fixed failing test. 5afd570 [Reynold

spark git commit: [SPARK-9467][SQL]Add SQLMetric to specialize accumulators to avoid boxing

2015-08-07 Thread rxin
Repository: spark Updated Branches: refs/heads/master e57d6b561 -> ebfd91c54 [SPARK-9467][SQL]Add SQLMetric to specialize accumulators to avoid boxing This PR adds SQLMetric/SQLMetricParam/SQLMetricValue to specialize accumulators to avoid boxing. All SQL metrics should use these classes rath

spark git commit: [SPARK-9467][SQL]Add SQLMetric to specialize accumulators to avoid boxing

2015-08-07 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 064ba906a -> 7a6f95050 [SPARK-9467][SQL]Add SQLMetric to specialize accumulators to avoid boxing This PR adds SQLMetric/SQLMetricParam/SQLMetricValue to specialize accumulators to avoid boxing. All SQL metrics should use these classes

spark git commit: [SPARK-9674][SPARK-9667] Remove SparkSqlSerializer2

2015-08-07 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 7a6f95050 -> 6c2f30c10 [SPARK-9674][SPARK-9667] Remove SparkSqlSerializer2 It is now subsumed by various Tungsten operators. Author: Reynold Xin Closes #7981 from rxin/SPARK-9674 and squashes the following commits: 144f96e [Reyn

spark git commit: [SPARK-9674][SPARK-9667] Remove SparkSqlSerializer2

2015-08-07 Thread rxin
Repository: spark Updated Branches: refs/heads/master ebfd91c54 -> 76eaa7018 [SPARK-9674][SPARK-9667] Remove SparkSqlSerializer2 It is now subsumed by various Tungsten operators. Author: Reynold Xin Closes #7981 from rxin/SPARK-9674 and squashes the following commits: 144f96e [Reynold

spark git commit: [SPARK-9667][SQL] followup: Use GenerateUnsafeProjection.canSupport to test Exchange supported data types.

2015-08-07 Thread rxin
rom rxin/cansupport and squashes the following commits: f7302ff [Reynold Xin] Can GenerateUnsafeProjection.canSupport to test Exchange supported data types. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/aeddeafc Tree: http://

spark git commit: [SPARK-9667][SQL] followup: Use GenerateUnsafeProjection.canSupport to test Exchange supported data types.

2015-08-07 Thread rxin
rom rxin/cansupport and squashes the following commits: f7302ff [Reynold Xin] Can GenerateUnsafeProjection.canSupport to test Exchange supported data types. (cherry picked from commit aeddeafc03d77a5149d2c8f9489b0ca83e6b3e03) Signed-off-by: Reynold Xin Project: http://git-wip-us.apache.org/re

spark git commit: [SPARK-9733][SQL] Improve physical plan explain for data sources

2015-08-07 Thread rxin
tion[file:/scratch/rxin/spark/sales4][date#0,cat#1,count#2] ``` Author: Reynold Xin Closes #8024 from rxin/SPARK-9733 and squashes the following commits: 811b90e [Reynold Xin] Fixed Python test case. 52cab77 [Reynold Xin] Cast. eea9ccc [Reynold Xin] Fix test case. fcecb22 [Reynold Xin] [SPARK-

spark git commit: [SPARK-9733][SQL] Improve physical plan explain for data sources

2015-08-07 Thread rxin
CAST((CAST(count#2, IntegerType) + 1), LongType)),mode=Final,isDistinct=false)] Exchange hashpartitioning(date#0,cat#1) TungstenAggregate(key=[date#0,cat#1], value=[(sum(CAST((CAST(count#2, IntegerType) + 1), LongType)),mode=Partial,isDistinct=false)] ConvertToUnsafe Scan ParquetRelation[

spark git commit: [SPARK-9754][SQL] Remove TypeCheck in debug package.

2015-08-07 Thread rxin
Repository: spark Updated Branches: refs/heads/master 85be65b39 -> 998f4ff94 [SPARK-9754][SQL] Remove TypeCheck in debug package. TypeCheck no longer applies in the new "Tungsten" world. Author: Reynold Xin Closes #8043 from rxin/SPARK-9754 and squashes the following comm

spark git commit: [SPARK-9754][SQL] Remove TypeCheck in debug package.

2015-08-07 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 c5d43d6c8 -> 5598b6238 [SPARK-9754][SQL] Remove TypeCheck in debug package. TypeCheck no longer applies in the new "Tungsten" world. Author: Reynold Xin Closes #8043 from rxin/SPARK-9754 and squashes the following comm

spark git commit: [SPARK-9753] [SQL] TungstenAggregate should also accept InternalRow instead of just UnsafeRow

2015-08-07 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 5598b6238 -> 47e473550 [SPARK-9753] [SQL] TungstenAggregate should also accept InternalRow instead of just UnsafeRow https://issues.apache.org/jira/browse/SPARK-9753 This PR makes TungstenAggregate to accept `InternalRow` instead of j

spark git commit: [SPARK-9753] [SQL] TungstenAggregate should also accept InternalRow instead of just UnsafeRow

2015-08-07 Thread rxin
Repository: spark Updated Branches: refs/heads/master 998f4ff94 -> c564b2744 [SPARK-9753] [SQL] TungstenAggregate should also accept InternalRow instead of just UnsafeRow https://issues.apache.org/jira/browse/SPARK-9753 This PR makes TungstenAggregate to accept `InternalRow` instead of just

spark git commit: [SPARK-9728][SQL]Support CalendarIntervalType in HiveQL

2015-08-08 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 3427f5731 -> 3c438c789 [SPARK-9728][SQL]Support CalendarIntervalType in HiveQL This PR enables converting interval term in HiveQL to CalendarInterval Literal. JIRA: https://issues.apache.org/jira/browse/SPARK-9728 Author: Yijie Shen

spark git commit: [SPARK-9728][SQL]Support CalendarIntervalType in HiveQL

2015-08-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master ac507a03c -> 23695f1d2 [SPARK-9728][SQL]Support CalendarIntervalType in HiveQL This PR enables converting interval term in HiveQL to CalendarInterval Literal. JIRA: https://issues.apache.org/jira/browse/SPARK-9728 Author: Yijie Shen Clo

spark git commit: [SPARK-9486][SQL] Add data source aliasing for external packages

2015-08-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master 23695f1d2 -> a3aec918b [SPARK-9486][SQL] Add data source aliasing for external packages Users currently have to provide the full class name for external data sources, like: `sqlContext.read.format("com.databricks.spark.avro").load(path)`

spark git commit: [SPARK-9486][SQL] Add data source aliasing for external packages

2015-08-08 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 3c438c789 -> 06b623403 [SPARK-9486][SQL] Add data source aliasing for external packages Users currently have to provide the full class name for external data sources, like: `sqlContext.read.format("com.databricks.spark.avro").load(pat

spark git commit: [MINOR] inaccurate comments for showString()

2015-08-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master a3aec918b -> 25c363e93 [MINOR] inaccurate comments for showString() Author: CodingCat Closes #8050 from CodingCat/minor and squashes the following commits: 5bc4b89 [CodingCat] inaccurate comments Project: http://git-wip-us.apache.org/r

spark git commit: [MINOR] inaccurate comments for showString()

2015-08-08 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 06b623403 -> 874b9d855 [MINOR] inaccurate comments for showString() Author: CodingCat Closes #8050 from CodingCat/minor and squashes the following commits: 5bc4b89 [CodingCat] inaccurate comments (cherry picked from commit 25c363e93

spark git commit: [SPARK-9752][SQL] Support UnsafeRow in Sample operator.

2015-08-09 Thread rxin
Repository: spark Updated Branches: refs/heads/master 3ca995b78 -> e9c36938b [SPARK-9752][SQL] Support UnsafeRow in Sample operator. In order for this to work, I had to disable gap sampling. Author: Reynold Xin Closes #8040 from rxin/SPARK-9752 and squashes the following commits: f9e2

spark git commit: [SPARK-9752][SQL] Support UnsafeRow in Sample operator.

2015-08-09 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 251d1eef4 -> b12f0737f [SPARK-9752][SQL] Support UnsafeRow in Sample operator. In order for this to work, I had to disable gap sampling. Author: Reynold Xin Closes #8040 from rxin/SPARK-9752 and squashes the following comm

spark git commit: Disable JobGeneratorSuite "Do not clear received block data too soon".

2015-08-09 Thread rxin
Repository: spark Updated Branches: refs/heads/master 86fa4ba6d -> a863348fd Disable JobGeneratorSuite "Do not clear received block data too soon". Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/a863348f Tree: http://git-

spark git commit: [CORE] [SPARK-9760] Use Option instead of Some for Ivy repos

2015-08-09 Thread rxin
Repository: spark Updated Branches: refs/heads/master 23cf5af08 -> 46025616b [CORE] [SPARK-9760] Use Option instead of Some for Ivy repos This was introduced in #7599 cc rxin brkyvz Author: Shivaram Venkataraman Closes #8055 from shivaram/spark-packages-repo-fix and squashes the follow

spark git commit: [CORE] [SPARK-9760] Use Option instead of Some for Ivy repos

2015-08-09 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 323d68606 -> 0e0471db4 [CORE] [SPARK-9760] Use Option instead of Some for Ivy repos This was introduced in #7599 cc rxin brkyvz Author: Shivaram Venkataraman Closes #8055 from shivaram/spark-packages-repo-fix and squashes

spark git commit: [SPARK-9777] [SQL] Window operator can accept UnsafeRows

2015-08-09 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.5 0e0471db4 -> f75c64b0c [SPARK-9777] [SQL] Window operator can accept UnsafeRows https://issues.apache.org/jira/browse/SPARK-9777 Author: Yin Huai Closes #8064 from yhuai/windowUnsafe and squashes the following commits: 8fb3537 [Yin

spark git commit: [SPARK-9777] [SQL] Window operator can accept UnsafeRows

2015-08-09 Thread rxin
Repository: spark Updated Branches: refs/heads/master 46025616b -> be80def0d [SPARK-9777] [SQL] Window operator can accept UnsafeRows https://issues.apache.org/jira/browse/SPARK-9777 Author: Yin Huai Closes #8064 from yhuai/windowUnsafe and squashes the following commits: 8fb3537 [Yin Huai

spark git commit: Fixed AtmoicReference<> Example

2015-08-10 Thread rxin
Repository: spark Updated Branches: refs/heads/master 00b655cce -> d28521275 Fixed AtmoicReference<> Example Author: Mahmoud Lababidi Closes #8076 from lababidi/master and squashes the following commits: af4553b [Mahmoud Lababidi] Fixed AtmoicReference<> Example Project: http://git-wip-us

<    1   2   3   4   5   6   7   8   9   10   >