git commit: [SPARK-4108][SQL] Fixed usage of deprecated in sql/catalyst/types/datatypes

2014-10-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ad3bd0dff - d31517a3c [SPARK-4108][SQL] Fixed usage of deprecated in sql/catalyst/types/datatypes Fixed usage of deprecated in sql/catalyst/types/datatypes to have versio...n parameter Author: Anant anant.a...@gmail.com Closes #2970

git commit: [SPARK-4143] [SQL] Move inner class DeferredObjectAdapter to top level

2014-10-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d31517a3c - 58a6077e5 [SPARK-4143] [SQL] Move inner class DeferredObjectAdapter to top level The class DeferredObjectAdapter is the inner class of HiveGenericUdf, which may cause some overhead in closure ser/de-ser. Move it to top level.

git commit: SPARK-3837. Warn when YARN kills containers for exceeding memory limits

2014-10-31 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 58a6077e5 - acd4ac7c9 SPARK-3837. Warn when YARN kills containers for exceeding memory limits I triggered the issue and verified the message gets printed on a pseudo-distributed cluster. Author: Sandy Ryza sa...@cloudera.com Closes

git commit: [SPARK-4016] Allow user to show/hide UI metrics.

2014-10-31 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master acd4ac7c9 - adb6415c1 [SPARK-4016] Allow user to show/hide UI metrics. This commit adds a set of checkboxes to the stage detail page that the user can use to show additional task metrics, including the GC time, result serialization time,

git commit: [SPARK-3826][SQL]enable hive-thriftserver to support hive-0.13.1

2014-10-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master adb6415c1 - 7c41d1357 [SPARK-3826][SQL]enable hive-thriftserver to support hive-0.13.1 In #2241 hive-thriftserver is not enabled. This patch enable hive-thriftserver to support hive-0.13.1 by using a shim layer refer to #2241. 1 A

git commit: [SPARK-4077][SQL] Spark SQL return wrong values for valid string timestamp values

2014-10-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 7c41d1357 - fa712b309 [SPARK-4077][SQL] Spark SQL return wrong values for valid string timestamp values In org.apache.hadoop.hive.serde2.io.TimestampWritable.set , if the next entry is null then current time stamp object is being reset.

git commit: [SPARK-4154][SQL] Query does not work if it has not between in Spark SQL and HQL

2014-10-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master fa712b309 - ea465af12 [SPARK-4154][SQL] Query does not work if it has not between in Spark SQL and HQL if the query contains not between does not work like. SELECT * FROM src where key not between 10 and 20' Author: ravipesala

git commit: [SPARK-2220][SQL] Fixes remaining Hive commands

2014-10-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ea465af12 - 23468e7e9 [SPARK-2220][SQL] Fixes remaining Hive commands This PR adds support for the `ADD FILE` Hive command, and removes `ShellCommand` and `SourceCommand`. The reason is described in [this SPARK-2220

git commit: [SPARK-4141] Hide Accumulators column on stage page when no accumulators exist

2014-10-31 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 23468e7e9 - a68ecf328 [SPARK-4141] Hide Accumulators column on stage page when no accumulators exist WebUI Author: Mark Mims mark.m...@canonical.com This patch had conflicts when merged, resolved by Committer: Josh Rosen

git commit: [SPARK-4150][PySpark] return self in rdd.setName

2014-10-31 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master a68ecf328 - f1e7361f6 [SPARK-4150][PySpark] return self in rdd.setName Then we can do `rdd.setName('abc').cache().count()`. Author: Xiangrui Meng m...@databricks.com Closes #3011 from mengxr/rdd-setname and squashes the following

git commit: [SPARK-3870] EOL character enforcement

2014-10-31 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master f1e7361f6 - 55ab77707 [SPARK-3870] EOL character enforcement We have shell scripts and Windows batch files, so we should enforce proper EOL character. Author: Kousuke Saruta saru...@oss.nttdata.co.jp Closes #2726 from

git commit: [HOT FIX] Yarn stable tests don't compile

2014-10-31 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 55ab77707 - 087e31a7a [HOT FIX] Yarn stable tests don't compile This is caused by this commit: acd4ac7c9a503445e27739708cf36e19119b8ddc Author: andrewor14 and...@databricks.com Author: Andrew Or and...@databricks.com Closes #3041 from

git commit: [MLLIB] SPARK-2329 Add multi-label evaluation metrics

2014-10-31 Thread meng
Repository: spark Updated Branches: refs/heads/master 23f73f525 - 62d01d255 [MLLIB] SPARK-2329 Add multi-label evaluation metrics Implementation of various multi-label classification measures, including: Hamming-loss, strict and default Accuracy, macro-averaged Precision, Recall and

[2/2] git commit: [MLLIB] SPARK-1547: Add Gradient Boosting to MLlib

2014-10-31 Thread meng
[MLLIB] SPARK-1547: Add Gradient Boosting to MLlib Given the popular demand for gradient boosting and AdaBoost in MLlib, I am creating a WIP branch for early feedback on gradient boosting with AdaBoost to follow soon after this PR is accepted. This is based on work done along with hirakendu

[1/2] [MLLIB] SPARK-1547: Add Gradient Boosting to MLlib

2014-10-31 Thread meng
Repository: spark Updated Branches: refs/heads/master e07fb6a41 - 860219551 http://git-wip-us.apache.org/repos/asf/spark/blob/86021955/mllib/src/test/scala/org/apache/spark/mllib/tree/RandomForestSuite.scala -- diff --git