[SPARK-3086] [SPARK-3043] [SPARK-3156] [mllib] DecisionTree aggregation
improvements
Summary:
1. Variable numBins for each feature [SPARK-3043]
2. Reduced data reshaping in aggregation [SPARK-3043]
3. Choose ordering for ordered categorical features adaptively [SPARK-3156]
4. Changed nodes to
Repository: spark
Updated Branches:
refs/heads/master 0d1cc4ae4 - 711356b42
http://git-wip-us.apache.org/repos/asf/spark/blob/711356b4/mllib/src/main/scala/org/apache/spark/mllib/tree/impl/DTStatsAggregator.scala
--
diff --git
Repository: spark
Updated Branches:
refs/heads/master 16a73c247 - 386bc24eb
Provide a default PYSPARK_PYTHON for python/run_tests
Without this the version of python used in the test is not
recorded. The error is,
Testing with Python version:
./run-tests: line 57: --version: command not
Repository: spark
Updated Branches:
refs/heads/master 386bc24eb - 26bc7655d
[SQL] Minor edits to sql programming guide.
Author: Henry Cook hc...@eecs.berkeley.edu
Closes #2316 from hcook/sql-docs and squashes the following commits:
373f94b [Henry Cook] Minor edits to sql programming guide.
Repository: spark
Updated Branches:
refs/heads/branch-1.1 8c6306a03 - 7a236dcf8
[SQL] Minor edits to sql programming guide.
Author: Henry Cook hc...@eecs.berkeley.edu
Closes #2316 from hcook/sql-docs and squashes the following commits:
373f94b [Henry Cook] Minor edits to sql programming
Repository: spark
Updated Branches:
refs/heads/master 26bc7655d - 939a322c8
[SPARK-3417] Use new-style classes in PySpark
Tiny PR making SQLContext a new-style class. This allows various type logic to
work more effectively
```Python
In [1]: import pyspark
In [2]:
Repository: spark
Updated Branches:
refs/heads/master 939a322c8 - 08ce18881
http://git-wip-us.apache.org/repos/asf/spark/blob/08ce1888/core/src/test/scala/org/apache/spark/storage/BlockManagerSuite.scala
--
diff --git
http://git-wip-us.apache.org/repos/asf/spark/blob/08ce1888/core/src/main/scala/org/apache/spark/storage/BlockManager.scala
--
diff --git a/core/src/main/scala/org/apache/spark/storage/BlockManager.scala
http://git-wip-us.apache.org/repos/asf/spark/blob/08ce1888/core/src/main/scala/org/apache/spark/network/nio/ConnectionManager.scala
--
diff --git
a/core/src/main/scala/org/apache/spark/network/nio/ConnectionManager.scala
Repository: spark
Updated Branches:
refs/heads/master 08ce18881 - 7db53391f
[SPARK-3349][SQL] Output partitioning of limit should not be inherited from
child
This resolves https://issues.apache.org/jira/browse/SPARK-3349
Author: Eric Liang e...@google.com
Closes #2262 from ericl/spark-3349
Repository: spark
Updated Branches:
refs/heads/master 7db53391f - 50a4fa774
[SPARK-3443][MLLIB] update default values of tree:
Adjust the default values of decision tree, based on the memory requirement
discussed in https://github.com/apache/spark/pull/2125 :
1. maxMemoryInMB: 128 - 256
2.
Repository: spark
Updated Branches:
refs/heads/master ca0348e68 - dc1dbf206
[SPARK-3414][SQL] Stores analyzed logical plan when registering a temp table
Case insensitivity breaks when unresolved relation contains attributes with
uppercase letters in their names, because we store unanalyzed
Repository: spark
Updated Branches:
refs/heads/master dc1dbf206 - 2b7ab814f
[SPARK-3329][SQL] Don't depend on Hive SET pair ordering in tests.
This fixes some possible spurious test failures in `HiveQuerySuite` by
comparing sets of key-value pairs as sets, rather than as lists.
Author:
Repository: spark
Updated Branches:
refs/heads/master 2b7ab814f - 092e2f152
SPARK-2425 Don't kill a still-running Application because of some misbehaving
Executors
Introduces a LOADING - RUNNING ApplicationState transition and prevents Master
from removing an Application with RUNNING
14 matches
Mail list logo