spark git commit: [SPARK-9236] [CORE] Make defaultPartitioner not reuse a parent RDD's partitioner if it has 0 partitions

2015-07-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 6a7e537f3 -> 6cd28cc21 [SPARK-9236] [CORE] Make defaultPartitioner not reuse a parent RDD's partitioner if it has 0 partitions See also comments on https://issues.apache.org/jira/browse/SPARK-9236 Author: François Garillot Closes #7616

spark git commit: [SPARK-9236] [CORE] Make defaultPartitioner not reuse a parent RDD's partitioner if it has 0 partitions

2015-07-24 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.3 b20a9abc3 -> f941482b0 [SPARK-9236] [CORE] Make defaultPartitioner not reuse a parent RDD's partitioner if it has 0 partitions See also comments on https://issues.apache.org/jira/browse/SPARK-9236 Author: François Garillot Closes #

spark git commit: [SPARK-9236] [CORE] Make defaultPartitioner not reuse a parent RDD's partitioner if it has 0 partitions

2015-07-24 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.4 07f778978 -> a0b047bf2 [SPARK-9236] [CORE] Make defaultPartitioner not reuse a parent RDD's partitioner if it has 0 partitions See also comments on https://issues.apache.org/jira/browse/SPARK-9236 Author: François Garillot Closes #

spark git commit: [SPARK-9250] Make change-scala-version more helpful w.r.t. valid Scala versions

2015-07-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 846cf4628 -> 428cde5d1 [SPARK-9250] Make change-scala-version more helpful w.r.t. valid Scala versions Author: François Garillot Closes #7595 from huitseeker/issue/SPARK-9250 and squashes the following commits: 80a0218 [François Garil

spark git commit: [SPARK-9304] [BUILD] Improve backwards compatibility of SPARK-8401

2015-07-25 Thread srowen
Repository: spark Updated Branches: refs/heads/master 215713e19 -> c980e20cf [SPARK-9304] [BUILD] Improve backwards compatibility of SPARK-8401 Add back change-version-to-X.sh scripts, as wrappers for new script, for backwards compatibility Author: Sean Owen Closes #7639 from srowen/SP

spark git commit: [SPARK-9337] [MLLIB] Add an ut for Word2Vec to verify the empty vocabulary check

2015-07-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4a01bfc2a -> b79bf1df6 [SPARK-9337] [MLLIB] Add an ut for Word2Vec to verify the empty vocabulary check jira: https://issues.apache.org/jira/browse/SPARK-9337 Word2Vec should throw exception when vocabulary is empty Author: Yuhao Yang C

spark git commit: [EC2] Cosmetic fix for usage of spark-ec2 --ebs-vol-num option

2015-07-28 Thread srowen
Repository: spark Updated Branches: refs/heads/master 15724fac5 -> ac8c549e2 [EC2] Cosmetic fix for usage of spark-ec2 --ebs-vol-num option The last line of the usage seems ugly. ``` $ spark-ec2 --help --ebs-vol-num=EBS_VOL_NUM Number of EBS volumes to attach to eac

spark git commit: [SPARK-9267] [CORE] Retire stringify(Partial)?Value from Accumulators

2015-07-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master c0cc0eaec -> 7bbf02f0b [SPARK-9267] [CORE] Retire stringify(Partial)?Value from Accumulators cc srowen Author: François Garillot Closes #7678 from huitseeker/master and squashes the following commits: 5e99f57 [François Garil

spark git commit: [SPARK-9446] Clear Active SparkContext in stop() method

2015-07-31 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.4 3d6a9214e -> 5ad9f950c [SPARK-9446] Clear Active SparkContext in stop() method In thread 'stopped SparkContext remaining active' on mailing list, Andres observed the following in driver log: ``` 15/07/29 15:17:09 WARN YarnSchedulerBack

spark git commit: [SPARK-9446] Clear Active SparkContext in stop() method

2015-07-31 Thread srowen
Repository: spark Updated Branches: refs/heads/master 04a49edfd -> 27ae851ce [SPARK-9446] Clear Active SparkContext in stop() method In thread 'stopped SparkContext remaining active' on mailing list, Andres observed the following in driver log: ``` 15/07/29 15:17:09 WARN YarnSchedulerBackend$

spark git commit: [SPARK-9202] capping maximum number of executor&driver information kept in Worker

2015-07-31 Thread srowen
Repository: spark Updated Branches: refs/heads/master a8340fa7d -> c0686668a [SPARK-9202] capping maximum number of executor&driver information kept in Worker https://issues.apache.org/jira/browse/SPARK-9202 Author: CodingCat Closes #7714 from CodingCat/SPARK-9202 and squashes the followin

spark git commit: [SPARK-9507] [BUILD] Remove dependency reduced POM hack now that shade plugin is updated

2015-07-31 Thread srowen
ease? Author: Sean Owen Closes #7826 from srowen/SPARK-9507 and squashes the following commits: e0b0fd2 [Sean Owen] Update to shade plugin 2.4.1, which removes the need for the dependency-reduced-POM workaround and the 'release' profile. Fix management of shade plugin version so ch

spark git commit: [SPARK-9507] [BUILD] Remove dependency reduced POM hack now that shade plugin is updated

2015-07-31 Thread srowen
ease? Author: Sean Owen Closes #7826 from srowen/SPARK-9507 and squashes the following commits: e0b0fd2 [Sean Owen] Update to shade plugin 2.4.1, which removes the need for the dependency-reduced-POM workaround and the 'release' profile. Fix management of shade plugin version so ch

spark git commit: [SPARK-9507] [BUILD] Remove dependency reduced POM hack now that shade plugin is updated

2015-07-31 Thread srowen
ease? Author: Sean Owen Closes #7826 from srowen/SPARK-9507 and squashes the following commits: e0b0fd2 [Sean Owen] Update to shade plugin 2.4.1, which removes the need for the dependency-reduced-POM workaround and the 'release' profile. Fix management of shade plugin version so ch

spark git commit: [SPARK-9521] [BUILD] Require Maven 3.3.3+ in the build

2015-08-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master 16b928c54 -> 9d1c02526 [SPARK-9521] [BUILD] Require Maven 3.3.3+ in the build Enforce Maven 3.3.3+ in the build. (Also update the scala compiler plugin while we're at it.) Author: Sean Owen Closes #7852 from srowen/SPARK-

spark git commit: [SPARK-9149] [ML] [EXAMPLES] Add an example of spark.ml KMeans

2015-08-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9d1c02526 -> 244016a95 [SPARK-9149] [ML] [EXAMPLES] Add an example of spark.ml KMeans [SPARK-9149] Add an example of spark.ml KMeans - ASF JIRA https://issues.apache.org/jira/browse/SPARK-9149 jkbradley Should we support other data format

spark git commit: [SPARK-9254] [BUILD] [HOTFIX] sbt-launch-lib.bash should support HTTP/HTTPS redirection

2015-08-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.3 047a61365 -> cc5f711c0 [SPARK-9254] [BUILD] [HOTFIX] sbt-launch-lib.bash should support HTTP/HTTPS redirection Target file(s) can be hosted on CDN nodes. HTTP/HTTPS redirection must be supported to download these files. Author: Cheng

spark git commit: [SPARK-7563] (backport for 1.3) OutputCommitCoordinator.stop() should only run on the driver

2015-08-03 Thread srowen
or: Sean Owen Closes #7865 from srowen/SPARK-7563-1.3 and squashes the following commits: f4479bc [Sean Owen] Backport of "[SPARK-7563] OutputCommitCoordinator.stop() should only run on the driver" for 1.3 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-

spark git commit: [SPARK-9521] [DOCS] Addendum. Require Maven 3.3.3+ in the build

2015-08-03 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 ebe42b98c -> 1f7dbcd6f [SPARK-9521] [DOCS] Addendum. Require Maven 3.3.3+ in the build Follow on for #7852: Building Spark doc needs to refer to new Maven requirement too Author: Sean Owen Closes #7905 from srowen/SPARK-9521.2

spark git commit: [SPARK-9534] [BUILD] Enable javac lint for scalac parity; fix a lot of build warnings, 1.5.0 edition

2015-08-04 Thread srowen
the process. I'll explain several of the changes inline in comments. Author: Sean Owen Closes #7862 from srowen/SPARK-9534 and squashes the following commits: ea51618 [Sean Owen] Enable most javac lint warnings; fix a lot of build warnings. In a few cases, touch up surrounding code in the

spark git commit: [SPARK-9534] [BUILD] Enable javac lint for scalac parity; fix a lot of build warnings, 1.5.0 edition

2015-08-04 Thread srowen
in the process. I'll explain several of the changes inline in comments. Author: Sean Owen Closes #7862 from srowen/SPARK-9534 and squashes the following commits: ea51618 [Sean Owen] Enable most javac lint warnings; fix a lot of build warnings. In a few cases, touch up surrounding cod

spark git commit: [SPARK-8064] [BUILD] Follow-up. Undo change from SPARK-9507 that was accidentally reverted

2015-08-04 Thread srowen
Repository: spark Updated Branches: refs/heads/master 76d74090d -> b211cbc73 [SPARK-8064] [BUILD] Follow-up. Undo change from SPARK-9507 that was accidentally reverted This PR removes the dependency reduced POM hack brought back by #7191 Author: tedyu Closes #7919 from tedyu/master and squ

spark git commit: [SPARK-8064] [BUILD] Follow-up. Undo change from SPARK-9507 that was accidentally reverted

2015-08-04 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 5ae675360 -> bd9b75213 [SPARK-8064] [BUILD] Follow-up. Undo change from SPARK-9507 that was accidentally reverted This PR removes the dependency reduced POM hack brought back by #7191 Author: tedyu Closes #7919 from tedyu/master and

spark git commit: [SPARK-9607] [SPARK-9608] fix zinc-port handling in build/mvn

2015-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/master 26b06f1c4 -> e27a8c4cb [SPARK-9607] [SPARK-9608] fix zinc-port handling in build/mvn - pass `$ZINC_PORT` to zinc status/shutdown commands - fix path check that sets `$ZINC_INSTALL_FLAG`, which was incorrectly causing zinc to be shutdown a

spark git commit: [SPARK-9607] [SPARK-9608] fix zinc-port handling in build/mvn

2015-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 93c166a91 -> 350006497 [SPARK-9607] [SPARK-9608] fix zinc-port handling in build/mvn - pass `$ZINC_PORT` to zinc status/shutdown commands - fix path check that sets `$ZINC_INSTALL_FLAG`, which was incorrectly causing zinc to be shutdo

spark git commit: [SPARK-9607] [SPARK-9608] fix zinc-port handling in build/mvn

2015-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.3 cd5d1be6e -> 384793dff [SPARK-9607] [SPARK-9608] fix zinc-port handling in build/mvn - pass `$ZINC_PORT` to zinc status/shutdown commands - fix path check that sets `$ZINC_INSTALL_FLAG`, which was incorrectly causing zinc to be shutdo

spark git commit: [SPARK-9607] [SPARK-9608] fix zinc-port handling in build/mvn

2015-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.4 dea04bf84 -> 369510c5a [SPARK-9607] [SPARK-9608] fix zinc-port handling in build/mvn - pass `$ZINC_PORT` to zinc status/shutdown commands - fix path check that sets `$ZINC_INSTALL_FLAG`, which was incorrectly causing zinc to be shutdo

spark git commit: [SPARK-9641] [DOCS] spark.shuffle.service.port is not documented

2015-08-06 Thread srowen
hor: Sean Owen Closes #7991 from srowen/SPARK-9641 and squashes the following commits: 3bb946e [Sean Owen] Add link to docs for setup and config of external shuffle service 2302e01 [Sean Owen] Document spark.shuffle.service.{enabled,port} (cherry picked from com

spark git commit: [SPARK-9641] [DOCS] spark.shuffle.service.port is not documented

2015-08-06 Thread srowen
ean Owen Closes #7991 from srowen/SPARK-9641 and squashes the following commits: 3bb946e [Sean Owen] Add link to docs for setup and config of external shuffle service 2302e01 [Sean Owen] Document spark.shuffle.service.{enabled,port} Project: http://git-wip-us.apache.org/repos/asf/spark/repo Com

spark git commit: [SPARK-9633] [BUILD] SBT download locations outdated; need an update

2015-08-06 Thread srowen
ull/7792 Author: Sean Owen Closes #7956 from srowen/SPARK-9633 and squashes the following commits: caa40bd [Sean Owen] Remove 2 defunct SBT download URLs and replace with the 1 known download URL. Also, use https. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-

spark git commit: [SPARK-9633] [BUILD] SBT download locations outdated; need an update

2015-08-06 Thread srowen
ark/pull/7792 Author: Sean Owen Closes #7956 from srowen/SPARK-9633 and squashes the following commits: caa40bd [Sean Owen] Remove 2 defunct SBT download URLs and replace with the 1 known download URL. Also, use https. (cherry picked from commit 681e3024b6c2fcb54b42180d94d3ba3eed52a2d4) Signed-

spark git commit: [SPARK-9633] [BUILD] SBT download locations outdated; need an update

2015-08-06 Thread srowen
ark/pull/7792 Author: Sean Owen Closes #7956 from srowen/SPARK-9633 and squashes the following commits: caa40bd [Sean Owen] Remove 2 defunct SBT download URLs and replace with the 1 known download URL. Also, use https. (cherry picked from commit 681e3024b6c2fcb54b42180d94d3ba3eed52a2d4) Signed-

spark git commit: [SPARK-9633] [BUILD] SBT download locations outdated; need an update

2015-08-06 Thread srowen
ark/pull/7792 Author: Sean Owen Closes #7956 from srowen/SPARK-9633 and squashes the following commits: caa40bd [Sean Owen] Remove 2 defunct SBT download URLs and replace with the 1 known download URL. Also, use https. (cherry picked from commit 681e3024b6c2fcb54b42180d94d3ba3eed52a2d4) Signed-

spark git commit: [SPARK-9737] [YARN] Add the suggested configuration when required executor memory is above the max threshold of this cluster on YARN mode

2015-08-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 68ccc6e18 -> 86fa4ba6d [SPARK-9737] [YARN] Add the suggested configuration when required executor memory is above the max threshold of this cluster on YARN mode Author: Yadong Qi Closes #8028 from watermen/SPARK-9737 and squashes the fol

spark git commit: [SPARK-9923] [CORE] ShuffleMapStage.numAvailableOutputs should be an Int instead of Long

2015-08-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master 34d610be8 -> 57c2d0880 [SPARK-9923] [CORE] ShuffleMapStage.numAvailableOutputs should be an Int instead of Long Modified type of ShuffleMapStage.numAvailableOutputs from Long to Int Author: Neelesh Srinivas Salian Closes #8183 from nssa

spark git commit: [SPARK-9980] [BUILD] Fix SBT publishLocal error due to invalid characters in doc

2015-08-15 Thread srowen
Repository: spark Updated Branches: refs/heads/master 7c1e56825 -> a85fb6c07 [SPARK-9980] [BUILD] Fix SBT publishLocal error due to invalid characters in doc Tiny modification to a few comments ```sbt publishLocal``` work again. Author: Herman van Hovell Closes #8209 from hvanhovell/SPARK-9

spark git commit: [SPARK-9980] [BUILD] Fix SBT publishLocal error due to invalid characters in doc

2015-08-15 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 d97af68af -> 1a6f0af9f [SPARK-9980] [BUILD] Fix SBT publishLocal error due to invalid characters in doc Tiny modification to a few comments ```sbt publishLocal``` work again. Author: Herman van Hovell Closes #8209 from hvanhovell/SPA

spark git commit: [SPARK-9199] [CORE] Upgrade Tachyon version from 0.7.0 -> 0.7.1.

2015-08-17 Thread srowen
Repository: spark Updated Branches: refs/heads/master 26e760581 -> 3ff81ad2d [SPARK-9199] [CORE] Upgrade Tachyon version from 0.7.0 -> 0.7.1. Updates the tachyon-client version to the latest release. The main difference between 0.7.0 and 0.7.1 on the client side is to support running Tachyon

spark git commit: [MINOR] fix the comments in IndexShuffleBlockResolver

2015-08-18 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 40b89c38a -> 42a0b4890 [MINOR] fix the comments in IndexShuffleBlockResolver it might be a typo introduced at the first moment or some leftover after some renaming.. the name of the method accessing the index file is called `getB

spark git commit: [MINOR] fix the comments in IndexShuffleBlockResolver

2015-08-18 Thread srowen
Repository: spark Updated Branches: refs/heads/master dd0614fd6 -> c34e9ff0e [MINOR] fix the comments in IndexShuffleBlockResolver it might be a typo introduced at the first moment or some leftover after some renaming.. the name of the method accessing the index file is called `getBlock

spark git commit: [SPARK-10070] [DOCS] Remove Guava dependencies in user guides

2015-08-19 Thread srowen
source code too? this method isn't useful in Java 7 and beyond. Author: Sean Owen Closes #8272 from srowen/SPARK-10070. (cherry picked from commit f141efeafb42b14b5fcfd9aa8c5275162042349f) Signed-off-by: Sean Owen Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: htt

spark git commit: [SPARK-10070] [DOCS] Remove Guava dependencies in user guides

2015-08-19 Thread srowen
code too? this method isn't useful in Java 7 and beyond. Author: Sean Owen Closes #8272 from srowen/SPARK-10070. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/f141efea Tree: http://git-wip-us.apache.org/repos/asf

spark git commit: [DOCS] [SQL] [PYSPARK] Fix typo in ntile function

2015-08-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master f141efeaf -> 865a3df3d [DOCS] [SQL] [PYSPARK] Fix typo in ntile function Fix typo in ntile function. Author: Moussa Taifi Closes #8261 from moutai/patch-2. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-w

spark git commit: [DOCS] [SQL] [PYSPARK] Fix typo in ntile function

2015-08-19 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 561390dbc -> e56bcc638 [DOCS] [SQL] [PYSPARK] Fix typo in ntile function Fix typo in ntile function. Author: Moussa Taifi Closes #8261 from moutai/patch-2. (cherry picked from commit 865a3df3d578c0442c97d749c81f554b560da406) Signed-

spark git commit: [SPARK-9977] [DOCS] Update documentation for StringIndexer

2015-08-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master 865a3df3d -> ba2a07e2b [SPARK-9977] [DOCS] Update documentation for StringIndexer By using `StringIndexer`, we can obtain indexed label on new column. So a following estimator should use this new column through pipeline if it wants to use

spark git commit: [SPARK-9977] [DOCS] Update documentation for StringIndexer

2015-08-19 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 e56bcc638 -> 5553f02be [SPARK-9977] [DOCS] Update documentation for StringIndexer By using `StringIndexer`, we can obtain indexed label on new column. So a following estimator should use this new column through pipeline if it wants to

spark git commit: [SPARK-8949] Print warnings when using preferred locations feature

2015-08-19 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 5553f02be -> 522b0b646 [SPARK-8949] Print warnings when using preferred locations feature Add warnings according to SPARK-8949 in `SparkContext` - warnings in scaladoc - log warnings when preferred locations feature is used through

spark git commit: [SPARK-8949] Print warnings when using preferred locations feature

2015-08-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master ba2a07e2b -> 3d16a5450 [SPARK-8949] Print warnings when using preferred locations feature Add warnings according to SPARK-8949 in `SparkContext` - warnings in scaladoc - log warnings when preferred locations feature is used through `Sp

spark git commit: [SPARK-8889] [CORE] Fix for OOM for graph creation

2015-08-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5b62bef8c -> f3391ff2b [SPARK-8889] [CORE] Fix for OOM for graph creation Fix for OOM for graph creation Author: Joshi Author: Rekha Joshi Closes #7602 from rekhajoshm/SPARK-8889. Project: http://git-wip-us.apache.org/repos/asf/spark/

spark git commit: [SPARK-15430][SQL] Fix potential ConcurrentModificationException for ListAccumulator

2016-05-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1ffa608ba -> 7920296bf [SPARK-15430][SQL] Fix potential ConcurrentModificationException for ListAccumulator ## What changes were proposed in this pull request? In `ListAccumulator` we create an unmodifiable view for underlying list. Howe

spark git commit: [SPARK-15430][SQL] Fix potential ConcurrentModificationException for ListAccumulator

2016-05-22 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 24ea16598 -> 831c7c085 [SPARK-15430][SQL] Fix potential ConcurrentModificationException for ListAccumulator ## What changes were proposed in this pull request? In `ListAccumulator` we create an unmodifiable view for underlying list.

spark git commit: [SPARK-15468][SQL] fix some typos

2016-05-22 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 831c7c085 -> d39eb4bc3 [SPARK-15468][SQL] fix some typos ## What changes were proposed in this pull request? Fix some typos while browsing the codes. ## How was this patch tested? None and obvious. Author: Bo Meng Author: bomeng

spark git commit: [SPARK-15468][SQL] fix some typos

2016-05-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 7920296bf -> 72288fd67 [SPARK-15468][SQL] fix some typos ## What changes were proposed in this pull request? Fix some typos while browsing the codes. ## How was this patch tested? None and obvious. Author: Bo Meng Author: bomeng Clos

spark git commit: [MINOR] More than 100 chars in line in SparkSubmitCommandBuilderSuite

2016-05-22 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 d39eb4bc3 -> 92c87f74d [MINOR] More than 100 chars in line in SparkSubmitCommandBuilderSuite ## What changes were proposed in this pull request? More than 100 chars in line. ## How was this patch tested? Author: Sandeep Singh Closes

spark git commit: [MINOR] More than 100 chars in line in SparkSubmitCommandBuilderSuite

2016-05-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 72288fd67 -> 3eff65f82 [MINOR] More than 100 chars in line in SparkSubmitCommandBuilderSuite ## What changes were proposed in this pull request? More than 100 chars in line. ## How was this patch tested? Author: Sandeep Singh Closes #13

spark git commit: Log warnings for numIterations * miniBatchFraction < 1.0

2016-05-25 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9c297df3d -> 589cce93c Log warnings for numIterations * miniBatchFraction < 1.0 ## What changes were proposed in this pull request? Add a warning log for the case that `numIterations * miniBatchFraction <1.0` during gradient descent. If t

spark git commit: Log warnings for numIterations * miniBatchFraction < 1.0

2016-05-25 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 f7158c482 -> 0064a4dcb Log warnings for numIterations * miniBatchFraction < 1.0 ## What changes were proposed in this pull request? Add a warning log for the case that `numIterations * miniBatchFraction <1.0` during gradient descent.

spark git commit: [SPARK-15523][ML][MLLIB] Update JPMML to 1.2.15

2016-05-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master e451f7f0c -> 6d506c9ae [SPARK-15523][ML][MLLIB] Update JPMML to 1.2.15 ## What changes were proposed in this pull request? See https://issues.apache.org/jira/browse/SPARK-15523 This PR replaces PR #13293. It's isolated to a new branch, an

spark git commit: [SPARK-15523][ML][MLLIB] Update JPMML to 1.2.15

2016-05-26 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 c54a07348 -> 87374de43 [SPARK-15523][ML][MLLIB] Update JPMML to 1.2.15 ## What changes were proposed in this pull request? See https://issues.apache.org/jira/browse/SPARK-15523 This PR replaces PR #13293. It's isolated to a new branch

spark git commit: [SPARK-10722] RDDBlockId not found in driver-heartbeater

2016-05-26 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 9a18115a8 -> 5cc1e2cec [SPARK-10722] RDDBlockId not found in driver-heartbeater ## What changes were proposed in this pull request? To ensure that the deserialization of TaskMetrics uses a ClassLoader that knows about RDDBlockIds. The

spark git commit: [SPARK-15542][SPARKR] Make error message clear for script './R/install-dev.sh' when R is missing on Mac

2016-05-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master 008a5377d -> 6ab973ec5 [SPARK-15542][SPARKR] Make error message clear for script './R/install-dev.sh' when R is missing on Mac https://issues.apache.org/jira/browse/SPARK-15542 ## What changes were proposed in this pull request? When run

spark git commit: [SPARK-15542][SPARKR] Make error message clear for script './R/install-dev.sh' when R is missing on Mac

2016-05-26 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 c99ad9215 -> 702755f92 [SPARK-15542][SPARKR] Make error message clear for script './R/install-dev.sh' when R is missing on Mac https://issues.apache.org/jira/browse/SPARK-15542 ## What changes were proposed in this pull request? When

spark git commit: [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4a2fb8b87 -> 5d4dafe8f [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) In the MLLib naivebayes example, scala and pytho

spark git commit: [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 80a40e8e2 -> 8467e2102 [SPARK-15449][MLLIB][EXAMPLE] Wrong Data Format - Documentation Issue ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) In the MLLib naivebayes example, scala and p

spark git commit: [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 8467e2102 -> 6d82e0c1b [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample ## What changes were proposed in this pull request? Temp directory used to save records is not deleted after program exit in DataFram

spark git commit: [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5d4dafe8f -> 88c9c467a [SPARK-15562][ML] Delete temp directory after program exit in DataFrameExample ## What changes were proposed in this pull request? Temp directory used to save records is not deleted after program exit in DataFrameExa

spark git commit: [SPARK-15610][ML] update error message for k in pca

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 6d82e0c1b -> 3801fb4f3 [SPARK-15610][ML] update error message for k in pca ## What changes were proposed in this pull request? Fix the wrong bound of `k` in `PCA` `require(k <= sources.first().size, ...` -> `require(k < sources.first(

spark git commit: [SPARK-15610][ML] update error message for k in pca

2016-05-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 88c9c467a -> 9893dc975 [SPARK-15610][ML] update error message for k in pca ## What changes were proposed in this pull request? Fix the wrong bound of `k` in `PCA` `require(k <= sources.first().size, ...` -> `require(k < sources.first().si

spark git commit: [MINOR] Resolve a number of miscellaneous build warnings

2016-05-29 Thread srowen
s a large number of deprecation warnings, especially related to the Accumulator API. That will happen separately. ## How was this patch tested? Jenkins Author: Sean Owen Closes #13377 from srowen/BuildWarnings. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-

spark git commit: [MINOR] Resolve a number of miscellaneous build warnings

2016-05-29 Thread srowen
lly related to the Accumulator API. That will happen separately. Jenkins Author: Sean Owen Closes #13377 from srowen/BuildWarnings. (cherry picked from commit ce1572d16f03d383071bcc1f30ede551e8ded49f) Signed-off-by: Sean Owen Project: http://git-wip-us.apache.org/repos/asf/spark/repo Com

spark git commit: [SPARK-15645][STREAMING] Fix some typos of Streaming module

2016-05-30 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 6dae027a6 -> 7076b3707 [SPARK-15645][STREAMING] Fix some typos of Streaming module ## What changes were proposed in this pull request? No code change, just some typo fixing. ## How was this patch tested? Manually run project build wi

spark git commit: [SPARK-15645][STREAMING] Fix some typos of Streaming module

2016-05-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1360a6d63 -> 5728aa558 [SPARK-15645][STREAMING] Fix some typos of Streaming module ## What changes were proposed in this pull request? No code change, just some typo fixing. ## How was this patch tested? Manually run project build with t

spark git commit: [DOCS] fix example code issues in documentation

2016-05-30 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 7076b3707 -> f208a9dcd [DOCS] fix example code issues in documentation ## What changes were proposed in this pull request? Fixed broken java code examples in streaming documentation Attn: tdas Author: Matthew Wise Closes #13388 fro

spark git commit: [DOCS] fix example code issues in documentation

2016-05-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5728aa558 -> 2d34183b2 [DOCS] fix example code issues in documentation ## What changes were proposed in this pull request? Fixed broken java code examples in streaming documentation Attn: tdas Author: Matthew Wise Closes #13388 from ma

spark git commit: [SPARK-15641] HistoryServer to not show invalid date for incomplete application

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/master 675921040 -> 6878f3e2e [SPARK-15641] HistoryServer to not show invalid date for incomplete application ## What changes were proposed in this pull request? For incomplete applications in HistoryServer, the complete column will show "-" inst

spark git commit: [SPARK-15641] HistoryServer to not show invalid date for incomplete application

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 2b35ce818 -> 3d00c7015 [SPARK-15641] HistoryServer to not show invalid date for incomplete application ## What changes were proposed in this pull request? For incomplete applications in HistoryServer, the complete column will show "-"

spark git commit: [SPARK-15528][SQL] Fix race condition in NumberConverter

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/master 6878f3e2e -> 95db8a44f [SPARK-15528][SQL] Fix race condition in NumberConverter ## What changes were proposed in this pull request? A local variable in NumberConverter is wrongly shared between threads. This pr fixes the race condition. ##

spark git commit: [SPARK-15528][SQL] Fix race condition in NumberConverter

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 3d00c7015 -> 2e3ead20c [SPARK-15528][SQL] Fix race condition in NumberConverter ## What changes were proposed in this pull request? A local variable in NumberConverter is wrongly shared between threads. This pr fixes the race condition.

spark git commit: [SPARK-15528][SQL] Fix race condition in NumberConverter

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 c53c83ce8 -> ea84b3373 [SPARK-15528][SQL] Fix race condition in NumberConverter ## What changes were proposed in this pull request? A local variable in NumberConverter is wrongly shared between threads. This pr fixes the race condition.

spark git commit: [CORE][MINOR][DOC] Removing incorrect scaladoc

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/master 57adb77e6 -> 695470429 [CORE][MINOR][DOC] Removing incorrect scaladoc ## What changes were proposed in this pull request? I don't think the method will ever throw an exception so removing a false comment. Sorry srowen and rxin ag

spark git commit: [CORE][MINOR][DOC] Removing incorrect scaladoc

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 ef8d16ec4 -> 1f4c4def9 [CORE][MINOR][DOC] Removing incorrect scaladoc ## What changes were proposed in this pull request? I don't think the method will ever throw an exception so removing a false comment. Sorry srowen and rx

spark git commit: [SPARK-15601][CORE] CircularBuffer's toString() to print only the contents written if buffer isn't full

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/master 04f925ede -> ac38bdc75 [SPARK-15601][CORE] CircularBuffer's toString() to print only the contents written if buffer isn't full ## What changes were proposed in this pull request? 1. The class allocated 4x space than needed as it was using

spark git commit: [SPARK-15601][CORE] CircularBuffer's toString() to print only the contents written if buffer isn't full

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 db96f398b -> d34c0fc10 [SPARK-15601][CORE] CircularBuffer's toString() to print only the contents written if buffer isn't full ## What changes were proposed in this pull request? 1. The class allocated 4x space than needed as it was u

spark git commit: [SPARK-15601][CORE] CircularBuffer's toString() to print only the contents written if buffer isn't full

2016-05-31 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 ea84b3373 -> 714f4d78a [SPARK-15601][CORE] CircularBuffer's toString() to print only the contents written if buffer isn't full 1. The class allocated 4x space than needed as it was using `Int` to store the `Byte` values 2. If Circula

spark git commit: [SPARK-15659][SQL] Ensure FileSystem is gotten from path

2016-06-01 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1dd925644 -> e4ce1bc4f [SPARK-15659][SQL] Ensure FileSystem is gotten from path ## What changes were proposed in this pull request? Currently `spark.sql.warehouse.dir` is pointed to local dir by default, which will throw exception when HA

spark git commit: [SPARK-15659][SQL] Ensure FileSystem is gotten from path

2016-06-01 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 0ade44cc4 -> 29a1cdfc4 [SPARK-15659][SQL] Ensure FileSystem is gotten from path ## What changes were proposed in this pull request? Currently `spark.sql.warehouse.dir` is pointed to local dir by default, which will throw exception whe

spark git commit: [SPARK-15664][MLLIB] Replace FileSystem.get(conf) with path.getFileSystem(conf) when removing CheckpointFile in MLlib

2016-06-01 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 29a1cdfc4 -> 8fb125bdf [SPARK-15664][MLLIB] Replace FileSystem.get(conf) with path.getFileSystem(conf) when removing CheckpointFile in MLlib ## What changes were proposed in this pull request? if sparkContext.set CheckpointDir to anoth

spark git commit: [SPARK-15664][MLLIB] Replace FileSystem.get(conf) with path.getFileSystem(conf) when removing CheckpointFile in MLlib

2016-06-01 Thread srowen
Repository: spark Updated Branches: refs/heads/master e4ce1bc4f -> 6563d72b1 [SPARK-15664][MLLIB] Replace FileSystem.get(conf) with path.getFileSystem(conf) when removing CheckpointFile in MLlib ## What changes were proposed in this pull request? if sparkContext.set CheckpointDir to another D

spark git commit: [SPARK-15208][WIP][CORE][STREAMING][DOCS] Update Spark examples with AccumulatorV2

2016-06-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 22eb08369 -> 8f98eef69 [SPARK-15208][WIP][CORE][STREAMING][DOCS] Update Spark examples with AccumulatorV2 ## What changes were proposed in this pull request? The patch updates the codes & docs in the example module as well as the rela

spark git commit: [SPARK-15208][WIP][CORE][STREAMING][DOCS] Update Spark examples with AccumulatorV2

2016-06-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5eea33230 -> a0eec8e8f [SPARK-15208][WIP][CORE][STREAMING][DOCS] Update Spark examples with AccumulatorV2 ## What changes were proposed in this pull request? The patch updates the codes & docs in the example module as well as the related

spark git commit: [SPARK-15605][ML][EXAMPLES] Fix broken ML JavaDeveloperApiExample.

2016-06-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master a0eec8e8f -> 4fe7c7bd1 [SPARK-15605][ML][EXAMPLES] Fix broken ML JavaDeveloperApiExample. ## What changes were proposed in this pull request? See [SPARK-15605](https://issues.apache.org/jira/browse/SPARK-15605) for the detail of this bug.

spark git commit: [SPARK-15605][ML][EXAMPLES] Fix broken ML JavaDeveloperApiExample.

2016-06-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 8f98eef69 -> b78352fc3 [SPARK-15605][ML][EXAMPLES] Fix broken ML JavaDeveloperApiExample. ## What changes were proposed in this pull request? See [SPARK-15605](https://issues.apache.org/jira/browse/SPARK-15605) for the detail of this b

spark git commit: [SPARK-15709][SQL] Prevent `freqItems` from raising `UnsupportedOperationException: empty.min`

2016-06-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4fe7c7bd1 -> b85d18f3b [SPARK-15709][SQL] Prevent `freqItems` from raising `UnsupportedOperationException: empty.min` ## What changes were proposed in this pull request? Currently, `freqItems` raises `UnsupportedOperationException` on `em

spark git commit: [SPARK-15709][SQL] Prevent `freqItems` from raising `UnsupportedOperationException: empty.min`

2016-06-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 b78352fc3 -> a5cb7fc6d [SPARK-15709][SQL] Prevent `freqItems` from raising `UnsupportedOperationException: empty.min` ## What changes were proposed in this pull request? Currently, `freqItems` raises `UnsupportedOperationException` on

spark git commit: [SPARK-15322][SQL][FOLLOWUP] Use the new long accumulator for old int accumulators.

2016-06-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master b85d18f3b -> 252417fa2 [SPARK-15322][SQL][FOLLOWUP] Use the new long accumulator for old int accumulators. ## What changes were proposed in this pull request? This PR corrects the remaining cases for using old accumulators. This does not

spark git commit: [SPARK-15322][SQL][FOLLOWUP] Use the new long accumulator for old int accumulators.

2016-06-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 a5cb7fc6d -> fe639adea [SPARK-15322][SQL][FOLLOWUP] Use the new long accumulator for old int accumulators. ## What changes were proposed in this pull request? This PR corrects the remaining cases for using old accumulators. This does

spark git commit: [SPARK-14204][SQL] register driverClass rather than user-specified class

2016-06-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 714f4d78a -> 0a13e4c07 [SPARK-14204][SQL] register driverClass rather than user-specified class This pull request fixes an issue in which cluster-mode executors fail to properly register a JDBC driver when the driver is provided in a j

spark git commit: [SPARK-15617][ML][DOC] Clarify that fMeasure in MulticlassMetrics is "micro" f1_score

2016-06-04 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2ca563cc4 -> 2099e05f9 [SPARK-15617][ML][DOC] Clarify that fMeasure in MulticlassMetrics is "micro" f1_score ## What changes were proposed in this pull request? 1, del precision,recall in `ml.MulticlassClassificationEvaluator` 2, update u

spark git commit: [SPARK-15617][ML][DOC] Clarify that fMeasure in MulticlassMetrics is "micro" f1_score

2016-06-04 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 cf8782116 -> 729730159 [SPARK-15617][ML][DOC] Clarify that fMeasure in MulticlassMetrics is "micro" f1_score ## What changes were proposed in this pull request? 1, del precision,recall in `ml.MulticlassClassificationEvaluator` 2, upda

spark git commit: [MINOR][BUILD] Add modernizr MIT license; specify "2014 and onwards" in license copyright

2016-06-04 Thread srowen
ense. Specify "2014 and onwards" in copyright statement. ## How was this patch tested? (none required) Author: Sean Owen Closes #13510 from srowen/ModernizrLicense. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/68

  1   2   3   4   5   6   7   8   9   10   >