git commit: [SPARK-1824] Remove master from Python examples

2014-05-16 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 318739a07 - 03b424263 [SPARK-1824] Remove master from Python examples A recent PR (#552) fixed this for all Scala / Java examples. We need to do it for python too. Note that this blocks on #799, which makes `bin/pyspark` go through

git commit: Version bump of spark-ec2 scripts

2014-05-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 a16a19fbd - 9cd12f33d Version bump of spark-ec2 scripts This will allow us to change things in spark-ec2 related to the 1.0 release. Author: Patrick Wendell pwend...@gmail.com Closes #809 from pwendell/spark-ec2 and squashes

git commit: [SPARK-1824] Remove master from Python examples

2014-05-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 4b8ec6fcf - cf6cbe9f7 [SPARK-1824] Remove master from Python examples A recent PR (#552) fixed this for all Scala / Java examples. We need to do it for python too. Note that this blocks on #799, which makes `bin/pyspark` go through Spark

[1/2] git commit: Revert [maven-release-plugin] prepare for next development iteration

2014-05-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 3b3d7c8ec - 80889110a Revert [maven-release-plugin] prepare for next development iteration This reverts commit e5436b8c1a79ce108f3af402455ac5f6dc5d1eb3. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

git commit: [maven-release-plugin] prepare release v1.0.0-rc9

2014-05-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 80889110a - 920f947eb [maven-release-plugin] prepare release v1.0.0-rc9 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/920f947e Tree:

git commit: [maven-release-plugin] prepare for next development iteration

2014-05-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 920f947eb - f8e611955 [maven-release-plugin] prepare for next development iteration Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/f8e61195 Tree:

Git Push Summary

2014-05-17 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.0.0-rc9 [created] 093aeb40f

[2/2] git commit: Revert [maven-release-plugin] prepare release v1.0.0-rc8

2014-05-17 Thread pwendell
Revert [maven-release-plugin] prepare release v1.0.0-rc8 This reverts commit 80eea0f111c06260ffaa780d2f3f7facd09c17bc. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/80889110 Tree:

git commit: Make deprecation warning less severe

2014-05-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/master cf6cbe9f7 - 442808a74 Make deprecation warning less severe Just a small change. I think it's good not to scare people who are using the old options. Author: Patrick Wendell pwend...@gmail.com Closes #810 from pwendell/warnings

git commit: Fix spark-submit path in spark-shell pyspark

2014-05-18 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 442808a74 - ebcd2d688 Fix spark-submit path in spark-shell pyspark Author: Neville Li nevi...@spotify.com Closes #812 from nevillelyh/neville/v1.0 and squashes the following commits: 0dc33ed [Neville Li] Fix spark-submit path in pyspark

svn commit: r5420 - /release/incubator/spark/spark-0.8.1-incubating/

2014-05-26 Thread pwendell
Author: pwendell Date: Mon May 26 19:46:30 2014 New Revision: 5420 Log: Moving Spark 0.8.1 release to our new dist location. Removed: release/incubator/spark/spark-0.8.1-incubating/

svn commit: r5420 - /release/spark/spark-0.8.1-incubating/

2014-05-26 Thread pwendell
Author: pwendell Date: Mon May 26 19:46:30 2014 New Revision: 5420 Log: Moving Spark 0.8.1 release to our new dist location. Added: release/spark/spark-0.8.1-incubating/ - copied from r5419, release/incubator/spark/spark-0.8.1-incubating/

svn commit: r5421 - /release/incubator/spark/spark-0.8.0-incubating/

2014-05-26 Thread pwendell
Author: pwendell Date: Mon May 26 19:47:24 2014 New Revision: 5421 Log: Moving Spark 0.8.0 release to our new dist location. Removed: release/incubator/spark/spark-0.8.0-incubating/

svn commit: r5421 - /release/spark/spark-0.8.0-incubating/

2014-05-26 Thread pwendell
Author: pwendell Date: Mon May 26 19:47:24 2014 New Revision: 5421 Log: Moving Spark 0.8.0 release to our new dist location. Added: release/spark/spark-0.8.0-incubating/ - copied from r5420, release/incubator/spark/spark-0.8.0-incubating/

git commit: Fix doc about NetworkWordCount/JavaNetworkWordCount usage of spark streaming

2014-05-27 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 9df86835b - 82eadc3b0 Fix doc about NetworkWordCount/JavaNetworkWordCount usage of spark streaming Usage: NetworkWordCount master hostname port -- Usage: NetworkWordCount hostname port Usage: JavaNetworkWordCount master hostname port --

git commit: Fix doc about NetworkWordCount/JavaNetworkWordCount usage of spark streaming

2014-05-27 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 24a1cac4e - 3669bb8e6 Fix doc about NetworkWordCount/JavaNetworkWordCount usage of spark streaming Usage: NetworkWordCount master hostname port -- Usage: NetworkWordCount hostname port Usage: JavaNetworkWordCount master hostname port

git commit: Spark 1916

2014-05-28 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 7801d44fd - 4312cf0ba Spark 1916 The changes could be ported back to 0.9 as well. Changing in.read to in.readFully to read the whole input stream rather than the first 1020 bytes. This should ok considering that Flume caps the body size

git commit: Spark 1916

2014-05-28 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-0.9 763394951 - 234a3786a Spark 1916 The changes could be ported back to 0.9 as well. Changing in.read to in.readFully to read the whole input stream rather than the first 1020 bytes. This should ok considering that Flume caps the body

Git Push Summary

2014-05-29 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.0.0-rc11 [deleted] f39d38eae

Git Push Summary

2014-05-29 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.0.0 [created] 2f1dc868e

Git Push Summary

2014-05-29 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.0.0-rc8 [deleted] 9abe0192a

Git Push Summary

2014-05-29 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.0.0-rc9 [deleted] 093aeb40f

Git Push Summary

2014-05-29 Thread pwendell
Repository: spark Updated Tags: refs/tags/v1.0.0-rc10 [deleted] eafc6f6f4

git commit: [SPARK-1820] Make GenerateMimaIgnore @DeveloperApi annotation aware.

2014-05-30 Thread pwendell
Repository: spark Updated Branches: refs/heads/master b7e28fa45 - 978a3 [SPARK-1820] Make GenerateMimaIgnore @DeveloperApi annotation aware. We add all the classes annotated as `DeveloperApi` to `~/.mima-excludes`. Author: Prashant Sharma prashan...@imaginea.com Author: nikhil7sh

[3/5] [SPARK-1566] consolidate programming guide, and general doc updates

2014-05-30 Thread pwendell
http://git-wip-us.apache.org/repos/asf/spark/blob/c8bf4131/docs/graphx-programming-guide.md -- diff --git a/docs/graphx-programming-guide.md b/docs/graphx-programming-guide.md index 42ab27b..fdb9f98 100644 ---

[5/5] git commit: [SPARK-1566] consolidate programming guide, and general doc updates

2014-05-30 Thread pwendell
[SPARK-1566] consolidate programming guide, and general doc updates This is a fairly large PR to clean up and update the docs for 1.0. The major changes are: * A unified programming guide for all languages replaces language-specific ones and shows language-specific info in tabs * New

[5/5] git commit: [SPARK-1566] consolidate programming guide, and general doc updates

2014-05-30 Thread pwendell
[SPARK-1566] consolidate programming guide, and general doc updates This is a fairly large PR to clean up and update the docs for 1.0. The major changes are: * A unified programming guide for all languages replaces language-specific ones and shows language-specific info in tabs * New

svn commit: r1598518 - in /spark: releases/_posts/2014-05-30-spark-release-1-0-0.md site/releases/spark-release-1-0-0.html

2014-05-30 Thread pwendell
Author: pwendell Date: Fri May 30 09:29:14 2014 New Revision: 1598518 URL: http://svn.apache.org/r1598518 Log: Adding release notes for Spark 1.0.0 Added: spark/releases/_posts/2014-05-30-spark-release-1-0-0.md spark/site/releases/spark-release-1-0-0.html Added: spark/releases/_posts

svn commit: r1598533 - in /spark: releases/_posts/2014-05-30-spark-release-1-0-0.md site/releases/spark-release-1-0-0.html

2014-05-30 Thread pwendell
Author: pwendell Date: Fri May 30 10:15:40 2014 New Revision: 1598533 URL: http://svn.apache.org/r1598533 Log: Fixing links in release post. Modified: spark/releases/_posts/2014-05-30-spark-release-1-0-0.md spark/site/releases/spark-release-1-0-0.html Modified: spark/releases/_posts

svn commit: r1598548 - in /spark: downloads.md site/downloads.html

2014-05-30 Thread pwendell
Author: pwendell Date: Fri May 30 10:54:11 2014 New Revision: 1598548 URL: http://svn.apache.org/r1598548 Log: Fixing link on downloads doc. Modified: spark/downloads.md spark/site/downloads.html Modified: spark/downloads.md URL: http://svn.apache.org/viewvc/spark/downloads.md?rev

git commit: Fixed a typo

2014-06-03 Thread pwendell
Repository: spark Updated Branches: refs/heads/master b1feb6020 - f4dd665c8 Fixed a typo in RowMatrix.scala Author: DB Tsai dbt...@dbtsai.com Closes #959 from dbtsai/dbtsai-typo and squashes the following commits: fab0e0e [DB Tsai] Fixed typo Project:

git commit: Update spark-ec2 scripts for 1.0.0 on master

2014-06-03 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 5284ca78d - ab7c62d57 Update spark-ec2 scripts for 1.0.0 on master The change was previously committed only to branch-1.0 as part of https://github.com/apache/spark/commit/a34e6fda1d6fb8e769c21db70845f1a6dde968d8 Author: Aaron Davidson

git commit: SPARK-1806 (addendum) Use non-deprecated methods in Mesos 0.18

2014-06-03 Thread pwendell
Repository: spark Updated Branches: refs/heads/master ab7c62d57 - 4ca062566 SPARK-1806 (addendum) Use non-deprecated methods in Mesos 0.18 The update to Mesos 0.18 caused some deprecation warnings in the build. The change to the non-deprecated version is straightforward as it emulates what

git commit: SPARK-1518: FileLogger: Fix compile against Hadoop trunk

2014-06-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 189df165b - 1765c8d0d SPARK-1518: FileLogger: Fix compile against Hadoop trunk In Hadoop trunk (currently Hadoop 3.0.0), the deprecated FSDataOutputStream#sync() method has been removed. Instead, we should call FSDataOutputStream#hflush,

git commit: SPARK-1518: FileLogger: Fix compile against Hadoop trunk

2014-06-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 d96794132 - 3df55cb69 SPARK-1518: FileLogger: Fix compile against Hadoop trunk In Hadoop trunk (currently Hadoop 3.0.0), the deprecated FSDataOutputStream#sync() method has been removed. Instead, we should call

git commit: SPARK-1790: Update EC2 scripts to support r3 instance types

2014-06-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 1765c8d0d - 11ded3f66 SPARK-1790: Update EC2 scripts to support r3 instance types Author: Varakhedi Sujeet svarakh...@gopivotal.com Closes #960 from sujeetv/ec2-r3 and squashes the following commits: 3cb9fd5 [Varakhedi Sujeet]

git commit: SPARK-1790: Update EC2 scripts to support r3 instance types

2014-06-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 3df55cb69 - 914d98ecd SPARK-1790: Update EC2 scripts to support r3 instance types Author: Varakhedi Sujeet svarakh...@gopivotal.com Closes #960 from sujeetv/ec2-r3 and squashes the following commits: 3cb9fd5 [Varakhedi Sujeet]

git commit: SPARK-1790: Update EC2 scripts to support r3 instance types

2014-06-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-0.9 41e7853fc - 6634a348a SPARK-1790: Update EC2 scripts to support r3 instance types Author: Varakhedi Sujeet svarakh...@gopivotal.com Closes #960 from sujeetv/ec2-r3 and squashes the following commits: 3cb9fd5 [Varakhedi Sujeet]

git commit: Fix issue in ReplSuite with hadoop-provided profile.

2014-06-04 Thread pwendell
Repository: spark Updated Branches: refs/heads/master abea2d4ff - b77c19be0 Fix issue in ReplSuite with hadoop-provided profile. When building the assembly with the maven hadoop-provided profile, the executors were failing to come up because Hadoop classes were not found in the classpath

git commit: [SPARK-2029] Bump pom.xml version number of master branch to 1.1.0-SNAPSHOT.

2014-06-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master b77c19be0 - 7c160293d [SPARK-2029] Bump pom.xml version number of master branch to 1.1.0-SNAPSHOT. Author: Takuya UESHIN ues...@happy-camper.st Closes #974 from ueshin/issues/SPARK-2029 and squashes the following commits: e19e8f4 [Takuya

git commit: [SPARK-2030] Bump SparkBuild.scala version number of branch-1.0 to 1.0.1-SNAPSHOT.

2014-06-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 914d98ecd - d365fbf94 [SPARK-2030] Bump SparkBuild.scala version number of branch-1.0 to 1.0.1-SNAPSHOT. Author: Takuya UESHIN ues...@happy-camper.st Closes #975 from ueshin/issues/SPARK-2030 and squashes the following commits:

git commit: SPARK-1677: allow user to disable output dir existence checking

2014-06-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 7c160293d - 89cdbb087 SPARK-1677: allow user to disable output dir existence checking https://issues.apache.org/jira/browse/SPARK-1677 For compatibility with older versions of Spark it would be nice to have an option

git commit: SPARK-1677: allow user to disable output dir existence checking

2014-06-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 d365fbf94 - 8100cbdb7 SPARK-1677: allow user to disable output dir existence checking https://issues.apache.org/jira/browse/SPARK-1677 For compatibility with older versions of Spark it would be nice to have an option

git commit: sbt 0.13.X should be using sbt-assembly 0.11.X

2014-06-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master f6143f127 - 5473aa7c0 sbt 0.13.X should be using sbt-assembly 0.11.X https://github.com/sbt/sbt-assembly/blob/master/README.md Author: Kalpit Shah shahkalpi...@gmail.com Closes #555 from kalpit/upgrade/sbtassembly and squashes the

git commit: Remove compile-scoped junit dependency.

2014-06-05 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 5473aa7c0 - 668cb1def Remove compile-scoped junit dependency. This avoids having junit classes showing up in the assembly jar. I verified that only test classes in the jtransforms package use junit. Author: Marcelo Vanzin

git commit: [SPARK-1841]: update scalatest to version 2.1.5

2014-06-06 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 8d210560b - 41c4a3310 [SPARK-1841]: update scalatest to version 2.1.5 Author: witgo wi...@qq.com Closes #713 from witgo/scalatest and squashes the following commits: b627a6a [witgo] merge master 51fb3d6 [witgo] merge master 3771474

git commit: HOTFIX: Support empty body in merge script

2014-06-07 Thread pwendell
Repository: spark Updated Branches: refs/heads/master a6c72ab16 - 3ace10dc9 HOTFIX: Support empty body in merge script Discovered in #992 Author: Patrick Wendell pwend...@gmail.com Closes #1007 from pwendell/hotfix and squashes the following commits: af90aa0 [Patrick Wendell] HOTFIX

git commit: SPARK-2056 Set RDD name to input path

2014-06-07 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 3ace10dc9 - 7b877b270 SPARK-2056 Set RDD name to input path Author: Neville Li nevi...@spotify.com Closes #992 from nevillelyh/master and squashes the following commits: 3011739 [Neville Li] [SPARK-2056] Set RDD name to input path

git commit: SPARK-2026: Maven Hadoop Profiles Should Set The Hadoop Version

2014-06-08 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 7b877b270 - a338834f9 SPARK-2026: Maven Hadoop Profiles Should Set The Hadoop Version The Maven Profiles that refer to hadoopX, e.g. `hadoop2.4`, should set the expected `hadoop.version` and `yarn.version`. e.g. ``` profile

git commit: SPARK-1898: In deploy.yarn.Client, use YarnClient not YarnClientImpl

2014-06-08 Thread pwendell
Repository: spark Updated Branches: refs/heads/master a338834f9 - ee96e9406 SPARK-1898: In deploy.yarn.Client, use YarnClient not YarnClientImpl https://issues.apache.org/jira/browse/SPARK-1898 Author: Colin Patrick McCabe cmcc...@cloudera.com Closes #850 from cmccabe/master and squashes

git commit: Update run-example

2014-06-08 Thread pwendell
Repository: spark Updated Branches: refs/heads/master a71c6d1cf - e9261d086 Update run-example Old code can only be ran under spark_home and use bin/run-example. Error ./run-example: line 55: ./bin/spark-submit: No such file or directory appears when running in other place. So change this

git commit: [SPARK-2067] use relative path for Spark logo in UI

2014-06-09 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 3f8450ec6 - 502a8f795 [SPARK-2067] use relative path for Spark logo in UI Author: Neville Li nevi...@spotify.com Closes #1006 from nevillelyh/gh/SPARK-2067 and squashes the following commits: 9ee64cf [Neville Li] [SPARK-2067] use

git commit: HOTFIX: Fix Python tests on Jenkins.

2014-06-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 35894af8c - 1d9b7651e HOTFIX: Fix Python tests on Jenkins. Author: Patrick Wendell pwend...@gmail.com Closes #1036 from pwendell/jenkins-test and squashes the following commits: 9c99856 [Patrick Wendell] Better output during tests

git commit: HOTFIX: Increase time limit for Bagel test

2014-06-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/master fb499be1a - 55a0e87ee HOTFIX: Increase time limit for Bagel test The test was timing out on some slow EC2 workers. Author: Ankur Dave ankurd...@gmail.com Closes #1037 from ankurdave/bagel-test-time-limit and squashes the following

git commit: HOTFIX: Increase time limit for Bagel test

2014-06-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 1d9b7651e - 5bc186dc7 HOTFIX: Increase time limit for Bagel test The test was timing out on some slow EC2 workers. Author: Ankur Dave ankurd...@gmail.com Closes #1037 from ankurdave/bagel-test-time-limit and squashes the following

git commit: [SPARK-1998] SparkFlumeEvent with body bigger than 1020 bytes are not re...

2014-06-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 86c4a79dc - ac8c27bdf [SPARK-1998] SparkFlumeEvent with body bigger than 1020 bytes are not re... flume event sent to Spark will fail if the body is too large and numHeaders is greater than zero Author: joyyoj suns...@gmail.com

git commit: [SPARK-1998] SparkFlumeEvent with body bigger than 1020 bytes are not re...

2014-06-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 1abbde0e8 - 296604430 [SPARK-1998] SparkFlumeEvent with body bigger than 1020 bytes are not re... flume event sent to Spark will fail if the body is too large and numHeaders is greater than zero Author: joyyoj suns...@gmail.com Closes

git commit: [SPARK-1940] Enabling rolling of executor logs, and automatic cleanup of old executor logs

2014-06-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 296604430 - 4823bf470 [SPARK-1940] Enabling rolling of executor logs, and automatic cleanup of old executor logs Currently, in the default log4j configuration, all the executor logs get sent to the file

git commit: Resolve scalatest warnings during build

2014-06-10 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 4823bf470 - c48b6222e Resolve scalatest warnings during build Author: witgo wi...@qq.com Closes #1032 from witgo/ShouldMatchers and squashes the following commits: 7ebf34c [witgo] Resolve scalatest warnings during build Project:

git commit: [SPARK-2108] Mark SparkContext methods that return block information as developer API's

2014-06-11 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 54ff00547 - 9ef076510 [SPARK-2108] Mark SparkContext methods that return block information as developer API's Author: Prashant Sharma prashan...@imaginea.com Closes #1047 from ScrapCodes/SPARK-2108/mark-as-dev-api and squashes the

git commit: [SPARK-2108] Mark SparkContext methods that return block information as developer API's

2014-06-11 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 5b754b45f - e508f599f [SPARK-2108] Mark SparkContext methods that return block information as developer API's Author: Prashant Sharma prashan...@imaginea.com Closes #1047 from ScrapCodes/SPARK-2108/mark-as-dev-api and squashes the

git commit: HOTFIX: PySpark tests should be order insensitive.

2014-06-11 Thread pwendell
Repository: spark Updated Branches: refs/heads/master fe78b8b6f - 14e6dc94f HOTFIX: PySpark tests should be order insensitive. This has been messing up the SQL PySpark tests on Jenkins. Author: Patrick Wendell pwend...@gmail.com Closes #1054 from pwendell/pyspark and squashes the following

git commit: HOTFIX: Forgot to remove false change in previous commit

2014-06-11 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 14e6dc94f - d45e0c6b9 HOTFIX: Forgot to remove false change in previous commit Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/d45e0c6b Tree:

git commit: HOTFIX: PySpark tests should be order insensitive.

2014-06-11 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 597c7efdc - 81049eb54 HOTFIX: PySpark tests should be order insensitive. This has been messing up the SQL PySpark tests on Jenkins. Author: Patrick Wendell pwend...@gmail.com Closes #1054 from pwendell/pyspark and squashes

git commit: fixed typo in docstring for min()

2014-06-12 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 4d8ae709f - 43d53d51c fixed typo in docstring for min() Hi, I found this typo while learning spark and thought I'd do a pull request. Author: Jeff Thompson jeffreykeatingthomp...@gmail.com Closes #1065 from

git commit: SPARK-554. Add aggregateByKey.

2014-06-12 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 43d53d51c - ce92a9c18 SPARK-554. Add aggregateByKey. Author: Sandy Ryza sa...@cloudera.com Closes #705 from sryza/sandy-spark-554 and squashes the following commits: 2302b8f [Sandy Ryza] Add MIMA exclude f52e0ad [Sandy Ryza] Fix Python

git commit: SPARK-1843: Replace assemble-deps with env variable.

2014-06-12 Thread pwendell
using compiled classes $ unset SPARK_PREPEND_CLASSES $ ./bin/spark-shell # Back to normal ``` Author: Patrick Wendell pwend...@gmail.com Closes #877 from pwendell/assemble-deps and squashes the following commits: 8a11345 [Patrick Wendell] Merge remote-tracking branch 'apache/master

git commit: SPARK-1999: StorageLevel in storage tab and RDD Storage Info never changes

2014-06-16 Thread pwendell
Repository: spark Updated Branches: refs/heads/master ca5d9d43b - a63aa1adb SPARK-1999: StorageLevel in storage tab and RDD Storage Info never changes StorageLevel in 'storage tab' and 'RDD Storage Info' never changes even if you call rdd.unpersist() and then you give the rdd another

git commit: SPARK-1999: StorageLevel in storage tab and RDD Storage Info never changes

2014-06-16 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 609e5ff20 - 5e85f87ac SPARK-1999: StorageLevel in storage tab and RDD Storage Info never changes StorageLevel in 'storage tab' and 'RDD Storage Info' never changes even if you call rdd.unpersist() and then you give the rdd another

git commit: SPARK-2148 Add link to requirements for custom equals() and hashcode() methods

2014-06-16 Thread pwendell
Repository: spark Updated Branches: refs/heads/master a63aa1adb - 9672ee07f SPARK-2148 Add link to requirements for custom equals() and hashcode() methods https://issues.apache.org/jira/browse/SPARK-2148 Author: Andrew Ash and...@andrewash.com Closes #1092 from ash211/SPARK-2148 and

git commit: Updating docs to include missing information about reducers and clarify ...

2014-06-16 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 9672ee07f - 119b06a04 Updating docs to include missing information about reducers and clarify ... ...how the OFFHEAP storage level works (there has been confusion around this). Author: Ali Ghodsi a...@cs.berkeley.edu Closes #1089 from

git commit: SPARK-2039: apply output dir existence checking for all output formats

2014-06-16 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 119b06a04 - 716c88aa1 SPARK-2039: apply output dir existence checking for all output formats https://issues.apache.org/jira/browse/SPARK-2039 apply output dir existence checking for all output formats Author: CodingCat

git commit: [SPARK-2130] End-user friendly String repr for StorageLevel in Python

2014-06-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 7afa912e7 - d81c08bac [SPARK-2130] End-user friendly String repr for StorageLevel in Python JIRA issue https://issues.apache.org/jira/browse/SPARK-2130 This PR adds an end-user friendly String representation for StorageLevel in Python,

git commit: SPARK-1990: added compatibility for python 2.6 for ssh_read command

2014-06-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/master d81c08bac - 8cd04c3ee SPARK-1990: added compatibility for python 2.6 for ssh_read command https://issues.apache.org/jira/browse/SPARK-1990 There were some posts on the lists that spark-ec2 does not work with Python 2.6. In addition, we

git commit: [SPARK-2144] ExecutorsPage reports incorrect # of RDD blocks

2014-06-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 6b5f64aaf - 3d4fa2dab [SPARK-2144] ExecutorsPage reports incorrect # of RDD blocks This is reproducible whenever we drop a block because of memory pressure. This is because StorageStatusListener actually never removes anything from

git commit: [SPARK-2144] ExecutorsPage reports incorrect # of RDD blocks

2014-06-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 23a12ce20 - 09deb3eee [SPARK-2144] ExecutorsPage reports incorrect # of RDD blocks This is reproducible whenever we drop a block because of memory pressure. This is because StorageStatusListener actually never removes anything from the

git commit: SPARK-2146. Fix takeOrdered doc

2014-06-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 6ece39bd6 - 7f5df6a17 SPARK-2146. Fix takeOrdered doc Removes Python syntax in Scaladoc, corrects result in Scaladoc, and removes irrelevant cache() call in Python doc. Author: Sandy Ryza sa...@cloudera.com Closes #1086 from

git commit: Revert SPARK-2038: rename conf parameters in the saveAsHadoop functions

2014-06-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/master d2f4f30b1 - 9e4b4bd08 Revert SPARK-2038: rename conf parameters in the saveAsHadoop functions This reverts commit 443f5e1bbcf9ec55e5ce6e4f738a002a47818100. This commit unfortunately would break source compatibility if users have named the

git commit: [STREAMING] SPARK-2009 Key not found exception when slow receiver starts

2014-06-17 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 9e4b4bd08 - 889f7b762 [STREAMING] SPARK-2009 Key not found exception when slow receiver starts I got java.util.NoSuchElementException: key not found: 1401756085000 ms exception when using kafka stream and 1 sec batchPeriod. Investigation

git commit: Minor fix

2014-06-19 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 640c29436 - 67fca189c Minor fix The value env is never used in SparkContext.scala. Add detailed comment for method setDelaySeconds in MetadataCleaner.scala instead of the unsure one. Author: WangTao barneystin...@aliyun.com Closes #1105

git commit: HOTFIX: SPARK-2208 local metrics tests can fail on fast machines

2014-06-19 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 5464e7917 - e5514790d HOTFIX: SPARK-2208 local metrics tests can fail on fast machines Author: Patrick Wendell pwend...@gmail.com Closes #1141 from pwendell/hotfix and squashes the following commits: 83e4c79 [Patrick Wendell] HOTFIX

git commit: SPARK-1902 Silence stacktrace from logs when doing port failover to port n+1

2014-06-20 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 204478491 - 08d0aca78 SPARK-1902 Silence stacktrace from logs when doing port failover to port n+1 Before: ``` 14/06/08 23:58:23 WARN AbstractLifeCycle: FAILED SelectChannelConnector@0.0.0.0:4040: java.net.BindException: Address already

git commit: [SPARK-2061] Made splits deprecated in JavaRDDLike

2014-06-20 Thread pwendell
Repository: spark Updated Branches: refs/heads/master a67864249 - 010c460d6 [SPARK-2061] Made splits deprecated in JavaRDDLike The jira for the issue can be found at: https://issues.apache.org/jira/browse/SPARK-2061 Most of spark has used over to consistently using `partitions` instead of

git commit: HOTFIX: Fix missing MIMA ignore

2014-06-21 Thread pwendell
Repository: spark Updated Branches: refs/heads/master ec935abce - 0a432d6a0 HOTFIX: Fix missing MIMA ignore Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/0a432d6a Tree:

git commit: SPARK-2034. KafkaInputDStream doesn't close resources and may prevent JVM shutdown

2014-06-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 58b32f347 - 476581e8c SPARK-2034. KafkaInputDStream doesn't close resources and may prevent JVM shutdown Tobias noted today on the mailing list: I am trying to use Spark Streaming with Kafka, which works like a charm – except

git commit: SPARK-2034. KafkaInputDStream doesn't close resources and may prevent JVM shutdown

2014-06-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 4881fc62d - 64316af5a SPARK-2034. KafkaInputDStream doesn't close resources and may prevent JVM shutdown Tobias noted today on the mailing list: I am trying to use Spark Streaming with Kafka, which works like a charm –

git commit: SPARK-2241: quote command line args in ec2 script

2014-06-22 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 67bffd3c7 - dedd70903 SPARK-2241: quote command line args in ec2 script To preserve quoted command line args (in case options have space in them). Author: Ori Kremer ori.kre...@gmail.com Closes #1169 from

git commit: [SPARK-1768] History server enhancements.

2014-06-23 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 6dc6722a6 - 21ddd7d1e [SPARK-1768] History server enhancements. Two improvements to the history server: - Separate the HTTP handling from history fetching, so that it's easy to add new backends later (thinking about SPARK-1537 in the

git commit: [SPARK-2264][SQL] Fix failing CachedTableSuite

2014-06-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 1978a9033 - a162c9b33 [SPARK-2264][SQL] Fix failing CachedTableSuite Author: Michael Armbrust mich...@databricks.com Closes #1201 from marmbrus/fixCacheTests and squashes the following commits: 9d87ed1 [Michael Armbrust] Use analyzer

git commit: [SPARK-1112, 2156] Bootstrap to fetch the driver's Spark properties.

2014-06-24 Thread pwendell
executor's spark properties. This should be better than #1124. @pwendell Are there spark properties that might be different on the driver and on the executors? Author: Xiangrui Meng m...@databricks.com Closes #1132 from mengxr/akka-bootstrap and squashes the following commits: 77ff32d

git commit: Autodetect JAVA_HOME on RPM-based systems

2014-06-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 133495d82 - 54055fb2b Autodetect JAVA_HOME on RPM-based systems Author: Matthew Farrellee m...@redhat.com Closes #1185 from mattf/master-1 and squashes the following commits: 42150fc [Matthew Farrellee] Autodetect JAVA_HOME on RPM-based

git commit: SPARK-2248: spark.default.parallelism does not apply in local mode

2014-06-24 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 2714968e1 - b6b44853c SPARK-2248: spark.default.parallelism does not apply in local mode Author: witgo wi...@qq.com Closes #1194 from witgo/SPARK-2248 and squashes the following commits: 6ac950b [witgo] spark.default.parallelism does not

git commit: [SPARK-2258 / 2266] Fix a few worker UI bugs

2014-06-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 5603e4c47 - 9aa603296 [SPARK-2258 / 2266] Fix a few worker UI bugs **SPARK-2258.** Worker UI displays zombie processes if the executor throws an exception before a process is launched. This is because we only inform the Worker of the

git commit: [SPARK-2258 / 2266] Fix a few worker UI bugs

2014-06-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 731a788eb - bb0b1645d [SPARK-2258 / 2266] Fix a few worker UI bugs **SPARK-2258.** Worker UI displays zombie processes if the executor throws an exception before a process is launched. This is because we only inform the Worker of the

git commit: [SPARK-2270] Kryo cannot serialize results returned by asJavaIterable

2014-06-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 bb0b1645d - 15fd9f2bb [SPARK-2270] Kryo cannot serialize results returned by asJavaIterable and thus groupBy/cogroup are broken in Java APIs when Kryo is used). @pwendell this should be merged into 1.0.1. Thanks @sorenmacbeth

git commit: [SPARK-2270] Kryo cannot serialize results returned by asJavaIterable

2014-06-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 9aa603296 - 7ff2c754f [SPARK-2270] Kryo cannot serialize results returned by asJavaIterable and thus groupBy/cogroup are broken in Java APIs when Kryo is used). @pwendell this should be merged into 1.0.1. Thanks @sorenmacbeth

git commit: [SPARK-2204] Launch tasks on the proper executors in mesos fine-grained mode

2014-06-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 7ff2c754f - 1132e472e [SPARK-2204] Launch tasks on the proper executors in mesos fine-grained mode The scheduler for Mesos in fine-grained mode launches tasks on the wrong executors. `MesosSchedulerBackend.resourceOffers(SchedulerDriver,

git commit: [SPARK-1912] fix compress memory issue during reduce

2014-06-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-0.9 ef8501d33 - 950981971 [SPARK-1912] fix compress memory issue during reduce When we need to read a compressed block, we will first create a compress stream instance(LZF or Snappy) and use it to wrap that block. Let's say a reducer task

git commit: [SPARK-1749] Job cancellation when SchedulerBackend does not implement killTask

2014-06-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 7f196b009 - b88a59a66 [SPARK-1749] Job cancellation when SchedulerBackend does not implement killTask This is a fixed up version of #686 (cc @markhamstra @pwendell). The last commit (the only one I authored) reflects the changes I made

git commit: [SPARK-1749] Job cancellation when SchedulerBackend does not implement killTask

2014-06-25 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 5869f8bf1 - 92b012502 [SPARK-1749] Job cancellation when SchedulerBackend does not implement killTask This is a fixed up version of #686 (cc @markhamstra @pwendell). The last commit (the only one I authored) reflects the changes I

<    1   2   3   4   5   6   7   8   9   10   >