[2/2] git commit: [SPARK-3086] [SPARK-3043] [SPARK-3156] [mllib] DecisionTree aggregation improvements

2014-09-08 Thread meng
[SPARK-3086] [SPARK-3043] [SPARK-3156] [mllib] DecisionTree aggregation improvements Summary: 1. Variable numBins for each feature [SPARK-3043] 2. Reduced data reshaping in aggregation [SPARK-3043] 3. Choose ordering for ordered categorical features adaptively [SPARK-3156] 4. Changed nodes to

[1/2] [SPARK-3086] [SPARK-3043] [SPARK-3156] [mllib] DecisionTree aggregation improvements

2014-09-08 Thread meng
Repository: spark Updated Branches: refs/heads/master 0d1cc4ae4 - 711356b42 http://git-wip-us.apache.org/repos/asf/spark/blob/711356b4/mllib/src/main/scala/org/apache/spark/mllib/tree/impl/DTStatsAggregator.scala -- diff --git

git commit: Provide a default PYSPARK_PYTHON for python/run_tests

2014-09-08 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 16a73c247 - 386bc24eb Provide a default PYSPARK_PYTHON for python/run_tests Without this the version of python used in the test is not recorded. The error is, Testing with Python version: ./run-tests: line 57: --version: command not

git commit: [SQL] Minor edits to sql programming guide.

2014-09-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 386bc24eb - 26bc7655d [SQL] Minor edits to sql programming guide. Author: Henry Cook hc...@eecs.berkeley.edu Closes #2316 from hcook/sql-docs and squashes the following commits: 373f94b [Henry Cook] Minor edits to sql programming guide.

git commit: [SQL] Minor edits to sql programming guide.

2014-09-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.1 8c6306a03 - 7a236dcf8 [SQL] Minor edits to sql programming guide. Author: Henry Cook hc...@eecs.berkeley.edu Closes #2316 from hcook/sql-docs and squashes the following commits: 373f94b [Henry Cook] Minor edits to sql programming

git commit: [SPARK-3417] Use new-style classes in PySpark

2014-09-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 26bc7655d - 939a322c8 [SPARK-3417] Use new-style classes in PySpark Tiny PR making SQLContext a new-style class. This allows various type logic to work more effectively ```Python In [1]: import pyspark In [2]:

[1/5] [SPARK-3019] Pluggable block transfer interface (BlockTransferService)

2014-09-08 Thread rxin
Repository: spark Updated Branches: refs/heads/master 939a322c8 - 08ce18881 http://git-wip-us.apache.org/repos/asf/spark/blob/08ce1888/core/src/test/scala/org/apache/spark/storage/BlockManagerSuite.scala -- diff --git

[2/5] [SPARK-3019] Pluggable block transfer interface (BlockTransferService)

2014-09-08 Thread rxin
http://git-wip-us.apache.org/repos/asf/spark/blob/08ce1888/core/src/main/scala/org/apache/spark/storage/BlockManager.scala -- diff --git a/core/src/main/scala/org/apache/spark/storage/BlockManager.scala

[3/5] [SPARK-3019] Pluggable block transfer interface (BlockTransferService)

2014-09-08 Thread rxin
http://git-wip-us.apache.org/repos/asf/spark/blob/08ce1888/core/src/main/scala/org/apache/spark/network/nio/ConnectionManager.scala -- diff --git a/core/src/main/scala/org/apache/spark/network/nio/ConnectionManager.scala

git commit: [SPARK-3349][SQL] Output partitioning of limit should not be inherited from child

2014-09-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 08ce18881 - 7db53391f [SPARK-3349][SQL] Output partitioning of limit should not be inherited from child This resolves https://issues.apache.org/jira/browse/SPARK-3349 Author: Eric Liang e...@google.com Closes #2262 from ericl/spark-3349

git commit: [SPARK-3443][MLLIB] update default values of tree:

2014-09-08 Thread meng
Repository: spark Updated Branches: refs/heads/master 7db53391f - 50a4fa774 [SPARK-3443][MLLIB] update default values of tree: Adjust the default values of decision tree, based on the memory requirement discussed in https://github.com/apache/spark/pull/2125 : 1. maxMemoryInMB: 128 - 256 2.

git commit: [SPARK-3414][SQL] Stores analyzed logical plan when registering a temp table

2014-09-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ca0348e68 - dc1dbf206 [SPARK-3414][SQL] Stores analyzed logical plan when registering a temp table Case insensitivity breaks when unresolved relation contains attributes with uppercase letters in their names, because we store unanalyzed

git commit: [SPARK-3329][SQL] Don't depend on Hive SET pair ordering in tests.

2014-09-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master dc1dbf206 - 2b7ab814f [SPARK-3329][SQL] Don't depend on Hive SET pair ordering in tests. This fixes some possible spurious test failures in `HiveQuerySuite` by comparing sets of key-value pairs as sets, rather than as lists. Author:

git commit: SPARK-2425 Don't kill a still-running Application because of some misbehaving Executors

2014-09-08 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 2b7ab814f - 092e2f152 SPARK-2425 Don't kill a still-running Application because of some misbehaving Executors Introduces a LOADING - RUNNING ApplicationState transition and prevents Master from removing an Application with RUNNING