[1/3] [SPARK-1332] Improve Spark Streaming's Network Receiver and InputDStream API [WIP]

2014-04-21 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 a34e6fda1 - 94cbe2329 http://git-wip-us.apache.org/repos/asf/spark/blob/94cbe232/streaming/src/main/scala/org/apache/spark/streaming/scheduler/ReceiverTracker.scala --

[1/2] [SPARK-1439, SPARK-1440] Generate unified Scaladoc across projects and Javadocs

2014-04-21 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 04c37b6f7 - fc7838470 http://git-wip-us.apache.org/repos/asf/spark/blob/fc783847/project/plugins.sbt -- diff --git a/project/plugins.sbt b/project/plugins.sbt index

git commit: [SPARK-1459] Use local path (and not complete URL) when opening local lo...

2014-04-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/master b3e5366f6 - ac164b79d [SPARK-1459] Use local path (and not complete URL) when opening local lo... ...g file. Author: Marcelo Vanzin van...@cloudera.com Closes #375 from vanzin/event-file and squashes the following commits: f673029

git commit: [SPARK-1459] Use local path (and not complete URL) when opening local lo...

2014-04-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 b0d70e40f - 0a731039e [SPARK-1459] Use local path (and not complete URL) when opening local lo... ...g file. Author: Marcelo Vanzin van...@cloudera.com Closes #375 from vanzin/event-file and squashes the following commits: f673029

git commit: version number fix

2014-04-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-0.9 9e8978903 - 54c3b7e3b version number fix self-explain Author: Nan Zhu coding...@users.noreply.github.com Closes #467 from CodingCat/branch-0.9 and squashes the following commits: ba36109 [Nan Zhu] remove out-dated comments 9a8810e

git commit: SPARK-1496: Have jarOfClass return Option[String]

2014-04-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/master ac164b79d - 83084d3b7 SPARK-1496: Have jarOfClass return Option[String] A simple change, mostly had to change a bunch of example code. Author: Patrick Wendell pwend...@gmail.com Closes #438 from pwendell/jar-of-class and squashes

git commit: SPARK-1496: Have jarOfClass return Option[String]

2014-04-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 0a731039e - 72aa131cb SPARK-1496: Have jarOfClass return Option[String] A simple change, mostly had to change a bunch of example code. Author: Patrick Wendell pwend...@gmail.com Closes #438 from pwendell/jar-of-class and squashes

git commit: [HOTFIX] SPARK-1399: remove outdated comments

2014-04-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 83084d3b7 - 87de29084 [HOTFIX] SPARK-1399: remove outdated comments as the original PR was merged before this mistake is foundfix here, Sorry about that @pwendell, @andrewor14, I will be more careful next time Author: CodingCat

git commit: [SPARK-1560]: Updated Pyrolite Dependency to be Java 6 compatible

2014-04-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 72aa131cb - 798d93f30 [SPARK-1560]: Updated Pyrolite Dependency to be Java 6 compatible Changed the Pyrolite dependency to a build which targets Java 6. Author: Ahir Reddy ahirre...@gmail.com Closes #479 from ahirreddy/java6-pyrolite

git commit: [SPARK-1560]: Updated Pyrolite Dependency to be Java 6 compatible

2014-04-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 87de29084 - 0f87e6ad4 [SPARK-1560]: Updated Pyrolite Dependency to be Java 6 compatible Changed the Pyrolite dependency to a build which targets Java 6. Author: Ahir Reddy ahirre...@gmail.com Closes #479 from ahirreddy/java6-pyrolite and

[2/4] git commit: [SPARK-1281] Improve partitioning in ALS

2014-04-22 Thread pwendell
[SPARK-1281] Improve partitioning in ALS ALS was using HashPartitioner and explicit uses of `%` together. Further, the naked use of `%` meant that, if the number of partitions corresponded with the stride of arithmetic progressions appearing in user and product ids, users and products could

git commit: [Fix #204] Eliminate delay between binding and log checking

2014-04-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 898fc3480 - 54c96c234 [Fix #204] Eliminate delay between binding and log checking **Bug**: In the existing history server, there is a `spark.history.updateInterval` seconds delay before application logs show up on the UI. **Cause**:

git commit: SPARK-1119 and other build improvements

2014-04-23 Thread pwendell
to call make-distribution script. Author: Patrick Wendell pwend...@gmail.com Closes #502 from pwendell/make-distribution and squashes the following commits: 1a97f0d [Patrick Wendell] SPARK-1119 and other build improvements (cherry picked from commit cd4ed293262e2349794c13467d1737974385c019) Signed

git commit: SPARK-1488. Squash more language feature warnings in new commits by importing implicitConversion

2014-04-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/master faeb761cb - 6338a93f1 SPARK-1488. Squash more language feature warnings in new commits by importing implicitConversion A recent commit reintroduced some of the same warnings that SPARK-1488 resolved. These are just a few more of the same

git commit: SPARK-1488. Squash more language feature warnings in new commits by importing implicitConversion

2014-04-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 563be2f03 - 8d92d93c0 SPARK-1488. Squash more language feature warnings in new commits by importing implicitConversion A recent commit reintroduced some of the same warnings that SPARK-1488 resolved. These are just a few more of the

git commit: SPARK-1611: Fix incorrect initialization order in AppendOnlyMap

2014-04-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 8d92d93c0 - 00a3ccc7c SPARK-1611: Fix incorrect initialization order in AppendOnlyMap JIRA: https://issues.apache.org/jira/browse/SPARK-1611 Author: zsxwing zsxw...@gmail.com Closes #534 from zsxwing/SPARK-1611 and squashes the

git commit: Spark 1490 Add kerberos support to the HistoryServer

2014-04-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 78a49b253 - bd375094a Spark 1490 Add kerberos support to the HistoryServer Here I've added the ability for the History server to login from a kerberos keytab file so that the history server can be run as a super user and stay up for

git commit: Spark 1489 Fix the HistoryServer view acls

2014-04-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 4660991e6 - 44da5ab2d Spark 1489 Fix the HistoryServer view acls This allows the view acls set by the user to be enforced by the history server. It also fixes filters being applied properly. Author: Thomas Graves tgra...@apache.org

git commit: Spark 1489 Fix the HistoryServer view acls

2014-04-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 2a35fba34 - c8dd13221 Spark 1489 Fix the HistoryServer view acls This allows the view acls set by the user to be enforced by the history server. It also fixes filters being applied properly. Author: Thomas Graves tgra...@apache.org

git commit: [SPARK-1615] Synchronize accesses to the LiveListenerBus' event queue

2014-04-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 80429f3e2 - ee6f7e22a [SPARK-1615] Synchronize accesses to the LiveListenerBus' event queue Original poster is @zsxwing, who reported this bug in #516. Much of SparkListenerSuite relies on LiveListenerBus's `waitUntilEmpty()` method. As

git commit: SPARK-1584: Upgrade Flume dependency to 1.4.0

2014-04-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/master e53eb4f01 - d5c6ae6cc SPARK-1584: Upgrade Flume dependency to 1.4.0 Updated the Flume dependency in the maven pom file and the scala build file. Author: tmalaska ted.mala...@cloudera.com Closes #507 from tmalaska/master and squashes the

git commit: SPARK-1584: Upgrade Flume dependency to 1.4.0

2014-04-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 51a387a88 - 777a9a5a1 SPARK-1584: Upgrade Flume dependency to 1.4.0 Updated the Flume dependency in the maven pom file and the scala build file. Author: tmalaska ted.mala...@cloudera.com Closes #507 from tmalaska/master and squashes

git commit: SPARK-1632. Remove unnecessary boxing in compares in ExternalAppendOnlyM...

2014-04-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 027f1b85f - 87cf35c2d SPARK-1632. Remove unnecessary boxing in compares in ExternalAppendOnlyM... ...ap Author: Sandy Ryza sa...@cloudera.com Closes #559 from sryza/sandy-spark-1632 and squashes the following commits: a6cd352 [Sandy

git commit: SPARK-1467: Make StorageLevel.apply() factory methods Developer APIs

2014-04-26 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 8e37ed6eb - 762af4e9c SPARK-1467: Make StorageLevel.apply() factory methods Developer APIs We may want to evolve these in the future to add things like SSDs, so let's mark them as experimental for now. Long-term the right solution might

git commit: SPARK-1467: Make StorageLevel.apply() factory methods Developer APIs

2014-04-26 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 dcea67f11 - 18ecc6347 SPARK-1467: Make StorageLevel.apply() factory methods Developer APIs We may want to evolve these in the future to add things like SSDs, so let's mark them as experimental for now. Long-term the right solution

git commit: SPARK-1606: Infer user application arguments instead of requiring --arg.

2014-04-26 Thread pwendell
[options] user.jar [user options] Author: Patrick Wendell pwend...@gmail.com Closes #563 from pwendell/spark-submit and squashes the following commits: 32241fc [Patrick Wendell] Review feedback 3adfb69 [Patrick Wendell] Small fix bc48139 [Patrick Wendell] SPARK-1606: Infer user application arguments

git commit: SPARK-1650: Correctly identify maven project version

2014-04-27 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 ce57624b8 - 98b13e023 SPARK-1650: Correctly identify maven project version Better account for various side-effect outputs while executing mvn help:evaluate -Dexpression=project.version Author: Rahul Singhal rahul.sing...@guavus.com

git commit: SPARK-1650: Correctly identify maven project version

2014-04-27 Thread pwendell
Repository: spark Updated Branches: refs/heads/master aa9a7f5db - 7b2527d74 SPARK-1650: Correctly identify maven project version Better account for various side-effect outputs while executing mvn help:evaluate -Dexpression=project.version Author: Rahul Singhal rahul.sing...@guavus.com

git commit: SPARK-1648 Support closing JIRA's as part of merge script.

2014-04-27 Thread pwendell
disable this feature if it is not installed. It can be installed with `pip install jira-client`. Author: Patrick Wendell pwend...@gmail.com Closes #570 from pwendell/jira-pr-merge and squashes the following commits: 3022b96 [Patrick Wendell] SPARK-1648 Support closing JIRA's as part of merge

git commit: SPARK-1651: Delete existing deployment directory

2014-04-27 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 da26f9b8e - 3c6c6c265 SPARK-1651: Delete existing deployment directory Small bug fix to make sure the spark contents are copied to the deployment directory correctly. Author: Rahul Singhal rahul.sing...@guavus.com Closes #573 from

git commit: SPARK-1145: Memory mapping with many small blocks can cause JVM allocation failures

2014-04-27 Thread pwendell
to write that code block using Scala's `Try` but to make it easy to back port and as simple as possible, I opted for the more explicit but less pretty format. Author: Patrick Wendell pwend...@gmail.com Closes #43 from pwendell/block-iter-logging and squashes the following commits: 1cff512 [Patrick

git commit: SPARK-1145: Memory mapping with many small blocks can cause JVM allocation failures

2014-04-27 Thread pwendell
to write that code block using Scala's `Try` but to make it easy to back port and as simple as possible, I opted for the more explicit but less pretty format. Author: Patrick Wendell pwend...@gmail.com Closes #43 from pwendell/block-iter-logging and squashes the following commits: 1cff512

git commit: Fix SPARK-1609: Executor fails to start when Command.extraJavaOptions contains multiple Java options

2014-04-27 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 6b3c6e5dd - 71f4d2612 Fix SPARK-1609: Executor fails to start when Command.extraJavaOptions contains multiple Java options Author: witgo wi...@qq.com Closes #547 from witgo/SPARK-1609 and squashes the following commits: deb6a4c [witgo]

git commit: Fix SPARK-1609: Executor fails to start when Command.extraJavaOptions contains multiple Java options

2014-04-27 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 2f24159c1 - 7bbf313de Fix SPARK-1609: Executor fails to start when Command.extraJavaOptions contains multiple Java options Author: witgo wi...@qq.com Closes #547 from witgo/SPARK-1609 and squashes the following commits: deb6a4c

git commit: Changes to dev release script

2014-04-28 Thread pwendell
release candidate +# # Would be nice to add: # - Send output to stderr and have useful logging in stdout -# - Have this use sbt rather than Maven release plug in -GIT_USERNAME=pwendell -GIT_PASSWORD=XXX -GPG_PASSPHRASE=XXX -GIT_BRANCH=branch-0.9 -RELEASE_VERSION=0.9.0-incubating -RC_NAME=rc2

git commit: SPARK-1652: Spark submit should fail gracefully if YARN not enabled

2014-04-28 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 8421034e7 - cae054aaf SPARK-1652: Spark submit should fail gracefully if YARN not enabled Author: Patrick Wendell pwend...@gmail.com Closes #579 from pwendell/spark-submit-yarn-2 and squashes the following commits: 05e1b11 [Patrick

git commit: SPARK-1652: Spark submit should fail gracefully if YARN not enabled

2014-04-28 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 32d9db3e2 - 38bf23eae SPARK-1652: Spark submit should fail gracefully if YARN not enabled Author: Patrick Wendell pwend...@gmail.com Closes #579 from pwendell/spark-submit-yarn-2 and squashes the following commits: 05e1b11 [Patrick

git commit: SPARK-1654 and SPARK-1653: Fixes in spark-submit.

2014-04-28 Thread pwendell
submit used deprecated environment variables (SPARK_CLASSPATH) which triggered warnings. Now we use new, more narrowly scoped, variables. Author: Patrick Wendell pwend...@gmail.com Closes #576 from pwendell/spark-submit and squashes the following commits: 67004c9 [Patrick Wendell] SPARK-1654

git commit: SPARK-1654 and SPARK-1653: Fixes in spark-submit.

2014-04-28 Thread pwendell
. Spark submit used deprecated environment variables (SPARK_CLASSPATH) which triggered warnings. Now we use new, more narrowly scoped, variables. Author: Patrick Wendell pwend...@gmail.com Closes #576 from pwendell/spark-submit and squashes the following commits: 67004c9 [Patrick Wendell] SPARK

git commit: SPARK-1652: Remove incorrect deprecation warning in spark-submit

2014-04-28 Thread pwendell
pwend...@gmail.com Closes #578 from pwendell/spark-submit-yarn and squashes the following commits: 96027c7 [Patrick Wendell] Test fixes b5be173 [Patrick Wendell] Review feedback 4ac9cac [Patrick Wendell] SPARK-1652: spark-submit for yarn prints warnings even though calling as expected Project

git commit: SPARK-1652: Remove incorrect deprecation warning in spark-submit

2014-04-28 Thread pwendell
Wendell pwend...@gmail.com Closes #578 from pwendell/spark-submit-yarn and squashes the following commits: 96027c7 [Patrick Wendell] Test fixes b5be173 [Patrick Wendell] Review feedback 4ac9cac [Patrick Wendell] SPARK-1652: spark-submit for yarn prints warnings even though calling as expected

git commit: Manual revert of rc2 version changes.

2014-04-29 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 ee964609e - 28633440c Manual revert of rc2 version changes. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/28633440 Tree:

git commit: [maven-release-plugin] prepare release v1.0.0-rc3

2014-04-29 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 28633440c - d636772ea [maven-release-plugin] prepare release v1.0.0-rc3 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/d636772e Tree:

Git Push Summary

2014-04-29 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.0.0-rc3 [created] e7c46933a

git commit: Minor fix to python table caching API.

2014-04-29 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 4ed58aaa3 - 0995787ac Minor fix to python table caching API. Author: Michael Armbrust mich...@databricks.com Closes #585 from marmbrus/pythonCacheTable and squashes the following commits: 7ec1f91 [Michael Armbrust] Minor fix to

git commit: HOTFIX: minor change to release script

2014-04-29 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 838cb0ea1 - c27ce2b4f HOTFIX: minor change to release script Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/c27ce2b4 Tree:

git commit: SPARK-1588. Restore SPARK_YARN_USER_ENV and SPARK_JAVA_OPTS for YARN.

2014-04-29 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 9754d1b12 - 5f48721ec SPARK-1588. Restore SPARK_YARN_USER_ENV and SPARK_JAVA_OPTS for YARN. Author: Sandy Ryza sa...@cloudera.com Closes #586 from sryza/sandy-spark-1588 and squashes the following commits: 35eb38e [Sandy Ryza]

git commit: SPARK-1004. PySpark on YARN

2014-04-30 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 b0ded1f6f - 177361c54 SPARK-1004. PySpark on YARN This reopens https://github.com/apache/incubator-spark/pull/640 against the new repo Author: Sandy Ryza sa...@cloudera.com Closes #30 from sryza/sandy-spark-1004 and squashes the

git commit: Fix SPARK-1629: Spark should inline use of commons-lang `SystemUtils.IS_...

2014-04-30 Thread pwendell
Repository: spark Updated Branches: refs/heads/master ff5be9a41 - 55100daa6 Fix SPARK-1629: Spark should inline use of commons-lang `SystemUtils.IS_... ...OS_WINDOWS` Author: witgo wi...@qq.com Closes #569 from witgo/SPARK-1629 and squashes the following commits: 31520eb [witgo] Merge

git commit: fix the spelling mistake

2014-05-01 Thread pwendell
Repository: spark Updated Branches: refs/heads/master a43d9c14f - 55c760ff9 fix the spelling mistake Author: wangfei wangfei_he...@126.com Closes #614 from scwf/pxcw and squashes the following commits: d1016ba [wangfei] fix spelling mistake Project:

git commit: Add tests for FileLogger, EventLoggingListener, and ReplayListenerBus

2014-05-01 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 40cf6d310 - 394d8cb1c Add tests for FileLogger, EventLoggingListener, and ReplayListenerBus Modifications to Spark core are limited to exposing functionality to test files + minor style fixes. (728 / 769 lines are from tests) Author:

git commit: SPARK-1695: java8-tests compiler error: package com.google.common.co...

2014-05-02 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 394d8cb1c - f25ebed9f SPARK-1695: java8-tests compiler error: package com.google.common.co... ...llections does not exist Author: witgo wi...@qq.com Closes #611 from witgo/SPARK-1695 and squashes the following commits: d77a887 [witgo]

git commit: delete no use var

2014-05-02 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 d28c05887 - b65def770 delete no use var Author: wangfei wangfei_he...@126.com Closes #613 from scwf/masterIndex and squashes the following commits: 1463056 [wangfei] delete no use var: masterIndex (cherry picked from commit

git commit: SPARK-1492. Update Spark YARN docs to use spark-submit

2014-05-02 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 4bf24f789 - 2b961d880 SPARK-1492. Update Spark YARN docs to use spark-submit Author: Sandy Ryza sa...@cloudera.com Closes #601 from sryza/sandy-spark-1492 and squashes the following commits: 5df1634 [Sandy Ryza] Address additional

git commit: SPARK-1492. Update Spark YARN docs to use spark-submit

2014-05-02 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 b65def770 - a314342da SPARK-1492. Update Spark YARN docs to use spark-submit Author: Sandy Ryza sa...@cloudera.com Closes #601 from sryza/sandy-spark-1492 and squashes the following commits: 5df1634 [Sandy Ryza] Address additional

git commit: Update SchemaRDD.scala

2014-05-03 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 d2cbd3d76 - 34f22bcc4 Update SchemaRDD.scala Modify spelling errors Author: ArcherShao archers...@users.noreply.github.com Closes #619 from ArcherShao/patch-1 and squashes the following commits: 2957195 [ArcherShao] Update

git commit: SPARK-1663. Corrections for several compile errors in streaming code examples, and updates to follow API changes

2014-05-03 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 3d0a02dff - 11d549417 SPARK-1663. Corrections for several compile errors in streaming code examples, and updates to follow API changes I gave the Streaming code examples, both Scala and Java, a test run today. I turned up a number of

git commit: [Bugfix] Tachyon file cleanup logical error

2014-05-03 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 08c4d1112 - bc3bfeaff [Bugfix] Tachyon file cleanup logical error Should lookup `shutdownDeleteTachyonPaths` instead of `shutdownDeletePaths`. Together with a minor style clean up: `find {...}.isDefined` to `exists {...}`. Author:

git commit: SPARK-1689 AppClient should indicate app is dead() when removed

2014-05-03 Thread pwendell
Repository: spark Updated Branches: refs/heads/master ce72c72ae - 34719ba32 SPARK-1689 AppClient should indicate app is dead() when removed Previously, we indicated disconnected(), which keeps the application in a limbo state where it has no executors but thinks it will get them soon. This

git commit: SPARK-1689 AppClient should indicate app is dead() when removed

2014-05-03 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 bc3bfeaff - 36e687d90 SPARK-1689 AppClient should indicate app is dead() when removed Previously, we indicated disconnected(), which keeps the application in a limbo state where it has no executors but thinks it will get them soon.

git commit: The default version of yarn is equal to the hadoop version

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 e88a63625 - acbf307c2 The default version of yarn is equal to the hadoop version This is a part of [PR 590](https://github.com/apache/spark/pull/590) Author: witgo wi...@qq.com Closes #626 from witgo/yarn_version and squashes the

git commit: The default version of yarn is equal to the hadoop version

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 92b2902ca - fb0543224 The default version of yarn is equal to the hadoop version This is a part of [PR 590](https://github.com/apache/spark/pull/590) Author: witgo wi...@qq.com Closes #626 from witgo/yarn_version and squashes the

git commit: SPARK-1658: Correctly identify if maven is installed and working

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/master fb0543224 - e97a2e671 SPARK-1658: Correctly identify if maven is installed and working The current test is checking the exit code of tail rather than mvn. This new check will make sure that mvn is installed and was able to execute the

git commit: SPARK-1658: Correctly identify if maven is installed and working

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 acbf307c2 - 2ee5f0447 SPARK-1658: Correctly identify if maven is installed and working The current test is checking the exit code of tail rather than mvn. This new check will make sure that mvn is installed and was able to execute the

git commit: SPARK-1663. (Addendum) Fix signature of one version of JavaPairRDDStream.reduceByKeyAndWindow()

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/master e97a2e671 - 0088cede5 SPARK-1663. (Addendum) Fix signature of one version of JavaPairRDDStream.reduceByKeyAndWindow() It appears that one of these methods doesn't use `org.apache.spark.api.java.function.Function2` like all the others,

git commit: SPARK-1663. (Addendum) Fix signature of one version of JavaPairRDDStream.reduceByKeyAndWindow()

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 2ee5f0447 - 40d05a48c SPARK-1663. (Addendum) Fix signature of one version of JavaPairRDDStream.reduceByKeyAndWindow() It appears that one of these methods doesn't use `org.apache.spark.api.java.function.Function2` like all the

git commit: SPARK-1703 Warn users if Spark is run on JRE6 but compiled with JDK7.

2014-05-04 Thread pwendell
: Patrick Wendell pwend...@gmail.com Closes #627 from pwendell/jdk6 and squashes the following commits: a38a958 [Patrick Wendell] Code review feedback 94e9f84 [Patrick Wendell] SPARK-1703 Warn users if Spark is run on JRE6 but compiled with JDK7. (cherry picked from commit

git commit: SPARK-1703 Warn users if Spark is run on JRE6 but compiled with JDK7.

2014-05-04 Thread pwendell
Wendell pwend...@gmail.com Closes #627 from pwendell/jdk6 and squashes the following commits: a38a958 [Patrick Wendell] Code review feedback 94e9f84 [Patrick Wendell] SPARK-1703 Warn users if Spark is run on JRE6 but compiled with JDK7. Project: http://git-wip-us.apache.org/repos/asf/spark/repo

git commit: SPARK-1693: Most of the tests throw a java.lang.SecurityException when s...

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/master f5041579f - d940e4c16 SPARK-1693: Most of the tests throw a java.lang.SecurityException when s... ...park built for hadoop 2.3.0 , 2.4.0 Author: witgo wi...@qq.com Closes #628 from witgo/SPARK-1693_new and squashes the following commits:

git commit: EC2 script should exit with non-zero code on UsageError

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 ec0bce10d - 7305278a0 EC2 script should exit with non-zero code on UsageError This is specially import because some ssh errors are raised as UsageError, preventing an automated usage of the script from detecting the failure. Author:

git commit: Address SPARK-1717

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 afcb9aefc - b5c62c887 Address SPARK-1717 I tested the change locally with Spark 0.9.1, but I can't test with 1.0.0 because there was no AMI for it at the time. It's a trivial fix, so it shouldn't cause any problems. Author:

git commit: Address SPARK-1717

2014-05-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/master b48a55ae9 - bb2bb0cf6 Address SPARK-1717 I tested the change locally with Spark 0.9.1, but I can't test with 1.0.0 because there was no AMI for it at the time. It's a trivial fix, so it shouldn't cause any problems. Author:

git commit: SPARK-1721: Reset the thread classLoader in the Mesos Executor

2014-05-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 73b0cbcc2 - 3292e2a71 SPARK-1721: Reset the thread classLoader in the Mesos Executor This is because Mesos calls it with a different environment or something, the result is that the Spark jar is missing and it can't load classes. This

git commit: [SPARK-1681] Include datanucleus jars in Spark Hive distribution

2014-05-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master a975a19f2 - cf0a8f020 [SPARK-1681] Include datanucleus jars in Spark Hive distribution This copies the datanucleus jars over from `lib_managed` into `dist/lib`, if any. The `CLASSPATH` must also be updated to reflect this change. Author:

git commit: [SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default

2014-05-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 32c960a01 - 2853e56f6 [SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default In-memory compression is now configurable in `SparkConf` by the

git commit: [SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default

2014-05-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 98750a74d - 6d721c5f7 [SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default In-memory compression is now configurable in `SparkConf` by the `spark.sql.inMemoryCompression.enabled`

git commit: [SPARK-1735] Add the missing special profiles to make-distribution.sh

2014-05-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 2853e56f6 - 4708eff67 [SPARK-1735] Add the missing special profiles to make-distribution.sh 73b0cbcc241cca3d318ff74340e80b02f884acbd introduced a few special profiles that are not covered in the `make-distribution.sh`. This affects

git commit: SPARK-1474: Spark on yarn assembly doesn't include AmIpFilter

2014-05-06 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 4ff39292c - 0c3e4150f SPARK-1474: Spark on yarn assembly doesn't include AmIpFilter We use org.apache.hadoop.yarn.server.webproxy.amfilter.AmIpFilter in spark on yarn but are not included it in the assembly jar. I tested this on yarn

git commit: SPARK-1474: Spark on yarn assembly doesn't include AmIpFilter

2014-05-06 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 0a5a46811 - 1e829905c SPARK-1474: Spark on yarn assembly doesn't include AmIpFilter We use org.apache.hadoop.yarn.server.webproxy.amfilter.AmIpFilter in spark on yarn but are not included it in the assembly jar. I tested this on yarn

git commit: Fix two download suggestions in the docs:

2014-05-06 Thread pwendell
won't. Author: Patrick Wendell pwend...@gmail.com Closes #662 from pwendell/quick-start and squashes the following commits: 0622f27 [Patrick Wendell] Fix two download suggestions in the docs: Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos

git commit: SPARK-1734: spark-submit throws an exception: Exception in thread main...

2014-05-06 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 0aaa2c6d0 - 48cc9a965 SPARK-1734: spark-submit throws an exception: Exception in thread main... ... java.lang.ClassNotFoundException: org.apache.spark.broadcast.TorrentBroadcastFactory Author: witgo wi...@qq.com Closes #665 from

git commit: SPARK-1737: Warn rather than fail when Java 7+ is used to create distributions

2014-05-06 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 d7ddb26e1 - d8f1b33f4 SPARK-1737: Warn rather than fail when Java 7+ is used to create distributions Also moves a few lines of code around in make-distribution.sh. Author: Patrick Wendell pwend...@gmail.com Closes #669 from pwendell

git commit: [SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark

2014-05-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 879bd - bb90e87f6 [SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark Make loading/saving labeled data easier for pyspark users. Also changed type check in `SparseVector` to allow numpy integers. Author:

git commit: [HOTFIX] SPARK-1637: There are some Streaming examples added after the PR #571 was last updated.

2014-05-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 51e277557 - ade47562b [HOTFIX] SPARK-1637: There are some Streaming examples added after the PR #571 was last updated. This resulted in Compilation Errors. cc @mateiz project not compiling currently. Author: Sandeep

git commit: MLlib documentation fix

2014-05-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 322b1808d - d38febee4 MLlib documentation fix Fixed the documentation for that `loadLibSVMData` is changed to `loadLibSVMFile`. Author: DB Tsai dbt...@alpinenow.com Closes #703 from dbtsai/dbtsai-docfix and squashes the following

git commit: Add Python includes to path before depickling broadcast values

2014-05-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 71ad53f81 - 2a669a70d Add Python includes to path before depickling broadcast values This fixes https://issues.apache.org/jira/browse/SPARK-1731 by adding the Python includes to the PYTHONPATH before depickling the broadcast values

git commit: Add Python includes to path before depickling broadcast values

2014-05-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/master c05d11bb3 - 3776f2f28 Add Python includes to path before depickling broadcast values This fixes https://issues.apache.org/jira/browse/SPARK-1731 by adding the Python includes to the PYTHONPATH before depickling the broadcast values

git commit: [SPARK-1778] [SQL] Add 'limit' transformation to SchemaRDD.

2014-05-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 a61b71cad - 7486474d6 [SPARK-1778] [SQL] Add 'limit' transformation to SchemaRDD. Add `limit` transformation to `SchemaRDD`. Author: Takuya UESHIN ues...@happy-camper.st Closes #711 from ueshin/issues/SPARK-1778 and squashes the

git commit: [SPARK-1690] Tolerating empty elements when saving Python RDD to text files

2014-05-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 3776f2f28 - 6c2691d0a [SPARK-1690] Tolerating empty elements when saving Python RDD to text files Tolerate empty strings in PythonRDD Author: Kan Zhang kzh...@apache.org Closes #644 from kanzhang/SPARK-1690 and squashes the following

git commit: fix broken in link in python docs

2014-05-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 9fbb22c20 - 71ad53f81 fix broken in link in python docs Author: Andy Konwinski andykonwin...@gmail.com Closes #650 from andyk/python-docs-link-fix and squashes the following commits: a1f9d51 [Andy Konwinski] fix broken in link in

git commit: Revert Enabled incremental build that comes with sbt 0.13.2

2014-05-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 71ce7eb0e - 758e5439f Revert Enabled incremental build that comes with sbt 0.13.2 This reverts commit 71ce7eb0e5878f0bafd64bdd201ae257a3bfe106. I meant only to merge this into master. It's an experimental build feature. Project:

git commit: SPARK-1708. Add a ClassTag on Serializer and things that depend on it

2014-05-11 Thread pwendell
-- Accumulator is passed as part of a bigger Task or TaskResult object via the closure serializer so it doesn't seem super useful to add the ClassTag there. Broadcast and AccumulableCollection in contrast were being serialized directly. CC @rxin, @pwendell, @heathermiller Author: Matei Zaharia ma

git commit: [SPARK-1157][MLlib] Bug fix: lossHistory should exclude rejection steps, and remove miniBatch

2014-05-11 Thread pwendell
Repository: spark Updated Branches: refs/heads/master d38febee4 - 910a13b3c [SPARK-1157][MLlib] Bug fix: lossHistory should exclude rejection steps, and remove miniBatch Getting the lossHistory from Breeze's API which already excludes the rejection steps in line search. Also, remove the

git commit: MINOR: Removing dead code.

2014-05-12 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 5c8e8de99 - 1d56cd544 MINOR: Removing dead code. Meant to do this when patching up the last merge. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1d56cd54 Tree:

git commit: Revert SPARK-1786: Edge Partition Serialization

2014-05-12 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 09e7aa4ee - 19ccf2093 Revert SPARK-1786: Edge Partition Serialization This reverts commit 09e7aa4eed8834b446c0f59ebfc1034e1f109ed6. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

git commit: SPARK-1772 Stop catching Throwable, let Executors die

2014-05-12 Thread pwendell
Repository: spark Updated Branches: refs/heads/master af15c82bf - 3af1f3864 SPARK-1772 Stop catching Throwable, let Executors die The main issue this patch fixes is [SPARK-1772](https://issues.apache.org/jira/browse/SPARK-1772), in which Executors may not die when fatal exceptions (e.g.,

git commit: SPARK-1772 Stop catching Throwable, let Executors die

2014-05-12 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 19ccf2093 - c88adbb17 SPARK-1772 Stop catching Throwable, let Executors die The main issue this patch fixes is [SPARK-1772](https://issues.apache.org/jira/browse/SPARK-1772), in which Executors may not die when fatal exceptions

git commit: SPARK-1806: Upgrade Mesos dependency to 0.18.1

2014-05-12 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 c88adbb17 - 0be8b45d7 SPARK-1806: Upgrade Mesos dependency to 0.18.1 Enabled Mesos (0.18.1) dependency with shaded protobuf Why is this needed? Avoids any protobuf version collision between Mesos and any other dependency in Spark e.g.

git commit: SPARK-1786: Reopening PR 724

2014-05-12 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 0be8b45d7 - 642ad497d SPARK-1786: Reopening PR 724 Addressing issue in MimaBuild.scala. Author: Ankur Dave ankurd...@gmail.com Author: Joseph E. Gonzalez joseph.e.gonza...@gmail.com Closes #742 from

git commit: BUILD: Include Hive with default packages when creating a release

2014-05-12 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 0e2bde203 - 1e4a65e69 BUILD: Include Hive with default packages when creating a release Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1e4a65e6 Tree:

<    1   2   3   4   5   6   7   8   9   10   >