spark git commit: [SPARK-4596][MLLib] Refactorize Normalizer to make code cleaner

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/master 0fe54cff1 - 89f912264 [SPARK-4596][MLLib] Refactorize Normalizer to make code cleaner In this refactoring, the performance will be slightly increased due to removing the overhead from breeze vector. The bottleneck is still in breeze norm

spark git commit: [SPARK-4596][MLLib] Refactorize Normalizer to make code cleaner

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.2 1f4d1ac4b - 74571991b [SPARK-4596][MLLib] Refactorize Normalizer to make code cleaner In this refactoring, the performance will be slightly increased due to removing the overhead from breeze vector. The bottleneck is still in breeze

spark git commit: [SPARK-4526][MLLIB]GradientDescent get a wrong gradient value according to the gradient formula.

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/master 89f912264 - f515f9432 [SPARK-4526][MLLIB]GradientDescent get a wrong gradient value according to the gradient formula. This is caused by the miniBatchSize parameter.The number of `RDD.sample` returns is not fixed. cc mengxr Author:

spark git commit: [SPARK-4526][MLLIB]GradientDescent get a wrong gradient value according to the gradient formula.

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.2 74571991b - d117f8fa4 [SPARK-4526][MLLIB]GradientDescent get a wrong gradient value according to the gradient formula. This is caused by the miniBatchSize parameter.The number of `RDD.sample` returns is not fixed. cc mengxr Author:

spark git commit: [SPARK-4535][Streaming] Fix the error in comments

2014-11-25 Thread tdas
Repository: spark Updated Branches: refs/heads/master f515f9432 - a51118a34 [SPARK-4535][Streaming] Fix the error in comments change `NetworkInputDStream` to `ReceiverInputDStream` change `ReceiverInputTracker` to `ReceiverTracker` Author: q00251598 qiyad...@huawei.com Closes #3400 from

spark git commit: [SPARK-4535][Streaming] Fix the error in comments

2014-11-25 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.2 d117f8fa4 - 42b9d0d31 [SPARK-4535][Streaming] Fix the error in comments change `NetworkInputDStream` to `ReceiverInputDStream` change `ReceiverInputTracker` to `ReceiverTracker` Author: q00251598 qiyad...@huawei.com Closes #3400 from

spark git commit: [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first

2014-11-25 Thread tgraves
Repository: spark Updated Branches: refs/heads/master fef27b294 - d24076019 [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first The documentation for the two parameters is the same with a pointer from the standalone parameter to the yarn parameter Author: arahuja

spark git commit: [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first

2014-11-25 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-1.2 b026546e3 - a689ab98d [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first The documentation for the two parameters is the same with a pointer from the standalone parameter to the yarn parameter Author: arahuja

spark git commit: [SPARK-4601][Streaming] Set correct call site for streaming jobs so that it is displayed correctly on the Spark UI

2014-11-25 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.2 a689ab98d - 96f76fc40 [SPARK-4601][Streaming] Set correct call site for streaming jobs so that it is displayed correctly on the Spark UI When running the NetworkWordCount, the description of the word count jobs are set as getCallsite

spark git commit: [SPARK-4581][MLlib] Refactorize StandardScaler to improve the transformation performance

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/master 69cd53eae - bf1a6aaac [SPARK-4581][MLlib] Refactorize StandardScaler to improve the transformation performance The following optimizations are done to improve the StandardScaler model transformation performance. 1) Covert Breeze dense

spark git commit: [SPARK-4581][MLlib] Refactorize StandardScaler to improve the transformation performance

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.2 96f76fc40 - 1e356a8fa [SPARK-4581][MLlib] Refactorize StandardScaler to improve the transformation performance The following optimizations are done to improve the StandardScaler model transformation performance. 1) Covert Breeze

spark git commit: [SPARK-4196][SPARK-4602][Streaming] Fix serialization issue in PairDStreamFunctions.saveAsNewAPIHadoopFiles

2014-11-25 Thread tdas
Repository: spark Updated Branches: refs/heads/master bf1a6aaac - 8838ad7c1 [SPARK-4196][SPARK-4602][Streaming] Fix serialization issue in PairDStreamFunctions.saveAsNewAPIHadoopFiles Solves two JIRAs in one shot - Makes the ForechDStream created by saveAsNewAPIHadoopFiles serializable for

spark git commit: [SPARK-4196][SPARK-4602][Streaming] Fix serialization issue in PairDStreamFunctions.saveAsNewAPIHadoopFiles

2014-11-25 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.2 1e356a8fa - a9944c809 [SPARK-4196][SPARK-4602][Streaming] Fix serialization issue in PairDStreamFunctions.saveAsNewAPIHadoopFiles Solves two JIRAs in one shot - Makes the ForechDStream created by saveAsNewAPIHadoopFiles serializable

spark git commit: [SPARK-4196][SPARK-4602][Streaming] Fix serialization issue in PairDStreamFunctions.saveAsNewAPIHadoopFiles

2014-11-25 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.1 63717375d - 7aa592c74 [SPARK-4196][SPARK-4602][Streaming] Fix serialization issue in PairDStreamFunctions.saveAsNewAPIHadoopFiles Solves two JIRAs in one shot - Makes the ForechDStream created by saveAsNewAPIHadoopFiles serializable

spark git commit: [HOTFIX] Fixing broken build due to missing imports.

2014-11-25 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.2 a9944c809 - a2c01ae5e [HOTFIX] Fixing broken build due to missing imports. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/a2c01ae5 Tree:

spark git commit: [HOTFIX] Fixing broken build due to missing imports.

2014-11-25 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.1 7aa592c74 - 1a7f4144e [HOTFIX] Fixing broken build due to missing imports. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1a7f4144 Tree:

spark git commit: [SPARK-4592] Avoid duplicate worker registrations in standalone mode

2014-11-25 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 a2c01ae5e - ee0317509 [SPARK-4592] Avoid duplicate worker registrations in standalone mode **Summary.** On failover, the Master may receive duplicate registrations from the same worker, causing the worker to exit. This is caused by

spark git commit: [SPARK-4546] Improve HistoryServer first time user experience

2014-11-25 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 1b2ab1cd1 - 9afcbe494 [SPARK-4546] Improve HistoryServer first time user experience The documentation points the user to run the following ``` sbin/start-history-server.sh ``` The first thing this does is throw an exception that complains

spark git commit: [SPARK-4546] Improve HistoryServer first time user experience

2014-11-25 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 ee0317509 - 58c840dde [SPARK-4546] Improve HistoryServer first time user experience The documentation points the user to run the following ``` sbin/start-history-server.sh ``` The first thing this does is throw an exception that

spark git commit: Fix SPARK-4471: blockManagerIdFromJson function throws exception while B...

2014-11-25 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 58c840dde - 93b914df1 Fix SPARK-4471: blockManagerIdFromJson function throws exception while B... Fix [SPARK-4471](https://issues.apache.org/jira/browse/SPARK-4471): blockManagerIdFromJson function throws exception while

spark git commit: Fix SPARK-4471: blockManagerIdFromJson function throws exception while B...

2014-11-25 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 9afcbe494 - 9bdf5da59 Fix SPARK-4471: blockManagerIdFromJson function throws exception while B... Fix [SPARK-4471](https://issues.apache.org/jira/browse/SPARK-4471): blockManagerIdFromJson function throws exception while BlockManagerId be

spark git commit: [Spark-4509] Revert EC2 tag-based cluster membership patch

2014-11-25 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 9bdf5da59 - 7eba0fbe4 [Spark-4509] Revert EC2 tag-based cluster membership patch This PR reverts changes related to tag-based cluster membership. As discussed in SPARK-3332, we didn't figure out a safe strategy to use tags to determine

spark git commit: [Spark-4509] Revert EC2 tag-based cluster membership patch

2014-11-25 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-1.2 93b914df1 - a48ea3cef [Spark-4509] Revert EC2 tag-based cluster membership patch This PR reverts changes related to tag-based cluster membership. As discussed in SPARK-3332, we didn't figure out a safe strategy to use tags to

spark git commit: [SPARK-4583] [mllib] LogLoss for GradientBoostedTrees fix + doc updates

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.2 a48ea3cef - 6880b467f [SPARK-4583] [mllib] LogLoss for GradientBoostedTrees fix + doc updates Currently, the LogLoss used by GradientBoostedTrees has 2 issues: * the gradient (and therefore loss) does not match that used by Friedman

spark git commit: [SPARK-4583] [mllib] LogLoss for GradientBoostedTrees fix + doc updates

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/master 7eba0fbe4 - c251fd740 [SPARK-4583] [mllib] LogLoss for GradientBoostedTrees fix + doc updates Currently, the LogLoss used by GradientBoostedTrees has 2 issues: * the gradient (and therefore loss) does not match that used by Friedman (1999)

spark git commit: [HOTFIX]: Adding back without-hive dist

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 6880b467f - 37d58aaac [HOTFIX]: Adding back without-hive dist Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/37d58aaa Tree:

spark git commit: [SPARK-4604][MLLIB] make MatrixFactorizationModel public

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.2 37d58aaac - 2756d0de9 [SPARK-4604][MLLIB] make MatrixFactorizationModel public User could construct an MF model directly. I added a note about the performance. Author: Xiangrui Meng m...@databricks.com Closes #3459 from

spark git commit: [SPARK-4604][MLLIB] make MatrixFactorizationModel public

2014-11-25 Thread meng
Repository: spark Updated Branches: refs/heads/master 4d95526a7 - b5fb1410c [SPARK-4604][MLLIB] make MatrixFactorizationModel public User could construct an MF model directly. I added a note about the performance. Author: Xiangrui Meng m...@databricks.com Closes #3459 from mengxr/SPARK-4604

svn commit: r7261 - in /release/spark/spark-1.1.1: spark-1.1.1-bin-hadoop2.4-without-hive.tgz spark-1.1.1-bin-hadoop2.4-without-hive.tgz.asc spark-1.1.1-bin-hadoop2.4-without-hive.tgz.md5 spark-1.1.1-

2014-11-25 Thread andrewor14
Author: andrewor14 Date: Wed Nov 26 04:18:17 2014 New Revision: 7261 Log: Remove Spark 1.1.1 *without-hive artifacts Removed: release/spark/spark-1.1.1/spark-1.1.1-bin-hadoop2.4-without-hive.tgz release/spark/spark-1.1.1/spark-1.1.1-bin-hadoop2.4-without-hive.tgz.asc

spark git commit: [SPARK-4516] Cap default number of Netty threads at 8

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/master b5fb1410c - f5f2d2738 [SPARK-4516] Cap default number of Netty threads at 8 In practice, only 2-4 cores should be required to transfer roughly 10 Gb/s, and each core that we use will have an initial overhead of roughly 32 MB of off-heap

spark git commit: [SPARK-4516] Cap default number of Netty threads at 8

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 2756d0de9 - 1e12f594b [SPARK-4516] Cap default number of Netty threads at 8 In practice, only 2-4 cores should be required to transfer roughly 10 Gb/s, and each core that we use will have an initial overhead of roughly 32 MB of

[1/2] spark git commit: Revert Preparing development version 1.2.1-SNAPSHOT

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 1e12f594b - 01271786e Revert Preparing development version 1.2.1-SNAPSHOT This reverts commit d7ac6013483e83caff8ea54c228f37aeca159db8. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

[2/2] spark git commit: Revert Preparing Spark release v1.2.0-snapshot1

2014-11-25 Thread pwendell
Revert Preparing Spark release v1.2.0-snapshot1 This reverts commit 38c1fbd9694430cefd962c90bc36b0d108c6124b. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/01271786 Tree:

Git Push Summary

2014-11-25 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.2.0-rc1 [created] db7f4a898 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[1/2] spark git commit: Preparing development version 1.2.1-SNAPSHOT

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 01271786e - d7b1ecb25 Preparing development version 1.2.1-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/d7b1ecb2 Tree:

[2/2] spark git commit: Preparing Spark release v1.2.0-rc1

2014-11-25 Thread pwendell
Preparing Spark release v1.2.0-rc1 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/db7f4a89 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/db7f4a89 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/db7f4a89

Git Push Summary

2014-11-25 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.2.0-rc1 [deleted] db7f4a898 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[1/2] spark git commit: Revert Preparing development version 1.2.1-SNAPSHOT

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 d7b1ecb25 - ce6200b26 Revert Preparing development version 1.2.1-SNAPSHOT This reverts commit d7b1ecb25676d228deb6fe05efdb4e2ab9c3e30b. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

[2/2] spark git commit: Preparing Spark release v1.2.0-rc1

2014-11-25 Thread pwendell
Preparing Spark release v1.2.0-rc1 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/5247dd85 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/5247dd85 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/5247dd85

Git Push Summary

2014-11-25 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.2.0-rc1 [created] 5247dd859 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[1/2] spark git commit: Preparing development version 1.2.1-SNAPSHOT

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 ce6200b26 - 79df6b43a Preparing development version 1.2.1-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/79df6b43 Tree:

[1/2] spark git commit: Revert Preparing development version 1.2.1-SNAPSHOT

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 79df6b43a - de8029b39 Revert Preparing development version 1.2.1-SNAPSHOT This reverts commit 79df6b43ae762263a8120f423ddb4a0811dd4b6f. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: HOTFIX: Updating additional version data

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 de8029b39 - dfb8c65b7 HOTFIX: Updating additional version data Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/dfb8c65b Tree:

[2/2] spark git commit: Preparing development version 1.2.1-SNAPSHOT

2014-11-25 Thread pwendell
Preparing development version 1.2.1-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/380eba5f Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/380eba5f Diff:

[1/2] spark git commit: Preparing Spark release v1.2.0-rc1

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 dfb8c65b7 - 380eba5f4 Preparing Spark release v1.2.0-rc1 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/cc2c05e4 Tree:

spark git commit: [SPARK-4516] Avoid allocating Netty PooledByteBufAllocators unnecessarily

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/master f5f2d2738 - 346bc17a2 [SPARK-4516] Avoid allocating Netty PooledByteBufAllocators unnecessarily Turns out we are allocating an allocator pool for every TransportClient (which means that the number increases with the number of nodes in the

spark git commit: [SPARK-4516] Avoid allocating Netty PooledByteBufAllocators unnecessarily

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 380eba5f4 - c7185f0c0 [SPARK-4516] Avoid allocating Netty PooledByteBufAllocators unnecessarily Turns out we are allocating an allocator pool for every TransportClient (which means that the number increases with the number of nodes in

[2/2] spark git commit: Revert Preparing Spark release v1.2.0-rc1

2014-11-25 Thread pwendell
Revert Preparing Spark release v1.2.0-rc1 This reverts commit cc2c05e4ee81d2f34873a2ebb9a5272867cb65c2. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/8f5ebcb6 Tree:

Git Push Summary

2014-11-25 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.2.0-rc1 [deleted] cc2c05e4e - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[1/2] spark git commit: Revert Preparing development version 1.2.1-SNAPSHOT

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 c7185f0c0 - 8f5ebcb63 Revert Preparing development version 1.2.1-SNAPSHOT This reverts commit 380eba5f49eca1dbd4084e6c84e19866fffd4efa. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: Revert [SPARK-4583] [mllib] LogLoss for GradientBoostedTrees fix + doc updates

2014-11-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.2 8f5ebcb63 - 17a4b8e59 Revert [SPARK-4583] [mllib] LogLoss for GradientBoostedTrees fix + doc updates This reverts commit 6880b467f66a4906161cbc343e70d975056a4f5f. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: