git commit: SPARK-1426: Make MLlib work with NumPy versions older than 1.7

2014-04-15 Thread matei
Repository: spark Updated Branches: refs/heads/master c99bcb7fe - df3609179 SPARK-1426: Make MLlib work with NumPy versions older than 1.7 Currently it requires NumPy 1.7 due to using the copyto method (http://docs.scipy.org/doc/numpy/reference/generated/numpy.copyto.html) for extracting

git commit: SPARK-1501: Ensure assertions in Graph.apply are asserted.

2014-04-15 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.0 1491b2a0d - 692dd6936 SPARK-1501: Ensure assertions in Graph.apply are asserted. The Graph.apply test in GraphSuite had some assertions in a closure in a graph transformation. As a consequence, these assertions never actually executed.

git commit: [SPARK-1157][MLlib] L-BFGS Optimizer based on Breeze's implementation.

2014-04-15 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 692dd6936 - 5812472c1 [SPARK-1157][MLlib] L-BFGS Optimizer based on Breeze's implementation. This PR uses Breeze's L-BFGS implement, and Breeze dependency has already been introduced by Xiangrui's sparse input format work in

[2/2] git commit: Decision Tree documentation for MLlib programming guide

2014-04-15 Thread pwendell
Decision Tree documentation for MLlib programming guide Added documentation for user to use the decision tree algorithms for classification and regression in Spark 1.0 release. Apart from a general review, I need specific input on the following: * I had to move a lot of the existing

git commit: SPARK-1455: Better isolation for unit tests.

2014-04-15 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 194ed067b - 110e825aa SPARK-1455: Better isolation for unit tests. This is a simple first step towards avoiding running the Hive tests whenever possible. Author: Patrick Wendell pwend...@gmail.com Closes #420 from

git commit: [FIX] update sbt-idea to version 1.6.0

2014-04-15 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-1.0 110e825aa - 33d6e37cd [FIX] update sbt-idea to version 1.6.0 I saw `No scala-library*.jar in Scala compiler library` error in IDEA. It seems upgrading `sbt-idea` to 1.6.0 fixed the problem. Author: Xiangrui Meng m...@databricks.com

git commit: [FIX] update sbt-idea to version 1.6.0

2014-04-15 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 5aaf9836f - 8517911ef [FIX] update sbt-idea to version 1.6.0 I saw `No scala-library*.jar in Scala compiler library` error in IDEA. It seems upgrading `sbt-idea` to 1.6.0 fixed the problem. Author: Xiangrui Meng m...@databricks.com

[1/2] [WIP] SPARK-1430: Support sparse data in Python MLlib

2014-04-15 Thread pwendell
Repository: spark Updated Branches: refs/heads/master 8517911ef - 63ca581d9 http://git-wip-us.apache.org/repos/asf/spark/blob/63ca581d/python/pyspark/mllib/tests.py -- diff --git a/python/pyspark/mllib/tests.py

git commit: [SQL] SPARK-1424 Generalize insertIntoTable functions on SchemaRDDs

2014-04-15 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-1.0 95647fad1 - e5130d978 [SQL] SPARK-1424 Generalize insertIntoTable functions on SchemaRDDs This makes it possible to create tables and insert into them using the DSL and SQL for the scala and java apis. Author: Michael Armbrust