[GitHub] spark pull request: [WIP] [SPARK-3996]: Shade Jetty in Spark deliv...

2015-01-28 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/4252#issuecomment-71927775 Hey @pwendell, Just remembered that there's an extra bit that needs to be taken care of when shading. `core/pom.xml` has a `copy-dependencies` task to copy

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728761 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PIClustering.scala --- @@ -0,0 +1,433 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728918 --- Diff: mllib/src/test/resources/log4j.mllib.properties --- @@ -0,0 +1,41 @@ +# --- End diff -- OK --- If your project is set up for it,

[GitHub] spark pull request: [SPARK-1405] [mllib] Latent Dirichlet Allocati...

2015-01-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4047#discussion_r23729109 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/LDA.scala --- @@ -0,0 +1,472 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] spark pull request: [SPARK-4964] [Streaming] Exactly-once semantic...

2015-01-28 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3798#discussion_r23729113 --- Diff: external/kafka/src/main/scala/org/apache/spark/streaming/kafka/KafkaUtils.scala --- @@ -144,4 +150,116 @@ object KafkaUtils {

[GitHub] spark pull request: [SPARK-5311][core] Corrected EventLoggingListe...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4120#issuecomment-71929366 [Test build #26259 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26259/consoleFull) for PR 4120 at commit

[GitHub] spark pull request: [SPARK-5341] Use maven coordinates as dependen...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4215#issuecomment-71929299 [Test build #26258 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26258/consoleFull) for PR 4215 at commit

[GitHub] spark pull request: [SPARK-4969][STREAMING][PYTHON] Add binaryReco...

2015-01-28 Thread freeman-lab
Github user freeman-lab commented on the pull request: https://github.com/apache/spark/pull/3803#issuecomment-71929328 Great thanks @JoshRosen will finish this up ASAP! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: Spark 3883: SSL support for HttpServer and Akk...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3571#issuecomment-71929324 [Test build #26260 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26260/consoleFull) for PR 3571 at commit

[GitHub] spark pull request: [SPARK-5176] The thrift server does not suppor...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4137#issuecomment-71933280 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5176] The thrift server does not suppor...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4137#issuecomment-71933270 [Test build #26249 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26249/consoleFull) for PR 4137 at commit

[GitHub] spark pull request: [SPARK-1405] [mllib] Latent Dirichlet Allocati...

2015-01-28 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/4047#issuecomment-71934257 @EntilZha I'll see about merging my testing branch back into this now, though it's not quite ready yet b/c of issues with checkpointing:

[GitHub] spark pull request: [SPARK-5097][SQL] Address DataFrame code revie...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71934459 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5097][SQL] Address DataFrame code revie...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71934451 [Test build #26251 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26251/consoleFull) for PR 4241 at commit

[GitHub] spark pull request: [SPARK-5341] Use maven coordinates as dependen...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4215#issuecomment-71935867 [Test build #26255 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26255/consoleFull) for PR 4215 at commit

[GitHub] spark pull request: [SPARK-4989][CORE] backport for branch-1.1 cat...

2015-01-28 Thread liyezhang556520
Github user liyezhang556520 closed the pull request at: https://github.com/apache/spark/pull/3970 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request: [SPARK-5430] move treeReduce and treeAggregate...

2015-01-28 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/4228#discussion_r23740428 --- Diff: core/src/main/scala/org/apache/spark/rdd/RDD.scala --- @@ -901,6 +901,38 @@ abstract class RDD[T: ClassTag]( } /** + *

[GitHub] spark pull request: [SPARK-4859][Core][Streaming] Refactor LiveLis...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4006#issuecomment-71955607 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4859][Core][Streaming] Refactor LiveLis...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4006#issuecomment-71955606 [Test build #26279 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26279/consoleFull) for PR 4006 at commit

[GitHub] spark pull request: [SPARK-4859][Core][Streaming] Refactor LiveLis...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4006#issuecomment-71955528 [Test build #26279 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26279/consoleFull) for PR 4006 at commit

[GitHub] spark pull request: [SPARK-5212][SQL] Add support of schema-less, ...

2015-01-28 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/4014#issuecomment-71956530 Can you explain in the PR what is schema-less delimiter? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: [SPARK-4859][Core][Streaming] Refactor LiveLis...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4006#issuecomment-71956401 [Test build #26280 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26280/consoleFull) for PR 4006 at commit

[GitHub] spark pull request: [SPARK-5430] move treeReduce and treeAggregate...

2015-01-28 Thread sryza
Github user sryza commented on the pull request: https://github.com/apache/spark/pull/4228#issuecomment-71957158 @mengxr makes sense --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-4964] [Streaming] Exactly-once semantic...

2015-01-28 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3798#discussion_r23743753 --- Diff: external/kafka/src/main/scala/org/apache/spark/rdd/kafka/KafkaCluster.scala --- @@ -0,0 +1,320 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-5253] [ML] LinearRegression with L1/L2 ...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4259#issuecomment-71963887 [Test build #26283 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26283/consoleFull) for PR 4259 at commit

[GitHub] spark pull request: [SQL] Various DataFrame DSL update.

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4260#issuecomment-71963926 [Test build #26285 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26285/consoleFull) for PR 4260 at commit

[GitHub] spark pull request: A JDBC driver for Spark SQL.

2015-01-28 Thread tmyklebu
GitHub user tmyklebu opened a pull request: https://github.com/apache/spark/pull/4261 A JDBC driver for Spark SQL. This pull request contains a Spark SQL data source that can pull data from, and can put data into, a JDBC database. I have tested both read and write support

[GitHub] spark pull request: A JDBC driver for Spark SQL.

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4261#issuecomment-71964607 [Test build #26286 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26286/consoleFull) for PR 4261 at commit

[GitHub] spark pull request: A JDBC driver for Spark SQL.

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4261#issuecomment-71964610 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: A JDBC driver for Spark SQL.

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4261#issuecomment-71964600 [Test build #26286 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26286/consoleFull) for PR 4261 at commit

[GitHub] spark pull request: [SQL] Various DataFrame DSL update.

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4260#issuecomment-71963928 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5470][Core]use defaultClassLoader to lo...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4258#issuecomment-71964178 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71965181 [Test build #26282 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26282/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71965185 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SQL] Various DataFrame DSL update.

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4260#issuecomment-71965288 [Test build #26287 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26287/consoleFull) for PR 4260 at commit

[GitHub] spark pull request: [SPARK-5472][SQL] A JDBC driver for Spark SQL.

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4261#issuecomment-71966019 [Test build #26288 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26288/consoleFull) for PR 4261 at commit

[GitHub] spark pull request: [SPARK-3872][streaming], Rewrite the test for ...

2015-01-28 Thread ScrapCodes
Github user ScrapCodes commented on the pull request: https://github.com/apache/spark/pull/4248#issuecomment-71966684 Jenkins, retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-3872][streaming], Rewrite the test for ...

2015-01-28 Thread ScrapCodes
Github user ScrapCodes commented on the pull request: https://github.com/apache/spark/pull/4248#issuecomment-7199 I was confused here whether to use manual clock or system clock ? Since system clock made the test considerably faster. P.S. Test failures seems to be

[GitHub] spark pull request: [SPARK-3872][streaming], Rewrite the test for ...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4248#issuecomment-71966791 [Test build #26289 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26289/consoleFull) for PR 4248 at commit

[GitHub] spark pull request: [SPARK-5262] [SPARK-5244] [SQL] add coalesce i...

2015-01-28 Thread adrian-wang
Github user adrian-wang commented on the pull request: https://github.com/apache/spark/pull/4057#issuecomment-71967465 @yhuai Thanks for your comments! I'll modify this PR accordingly. --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark pull request: [SPARK-5278][SQL] complete the check of ambigu...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4068#issuecomment-71967503 [Test build #26290 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26290/consoleFull) for PR 4068 at commit

[GitHub] spark pull request: [SPARK-4964] [Streaming] Exactly-once semantic...

2015-01-28 Thread koeninger
Github user koeninger commented on a diff in the pull request: https://github.com/apache/spark/pull/3798#discussion_r23746235 --- Diff: external/kafka/src/main/scala/org/apache/spark/rdd/kafka/OffsetRange.scala --- @@ -0,0 +1,70 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-4964] [Streaming] Exactly-once semantic...

2015-01-28 Thread koeninger
Github user koeninger commented on a diff in the pull request: https://github.com/apache/spark/pull/3798#discussion_r23746262 --- Diff: external/kafka/src/main/scala/org/apache/spark/streaming/kafka/KafkaUtils.scala --- @@ -144,4 +150,116 @@ object KafkaUtils {

[GitHub] spark pull request: [SPARK-3977] Conversion methods for BlockMatri...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4256#issuecomment-71953603 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-3977] Conversion methods for BlockMatri...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4256#issuecomment-71953597 [Test build #26270 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26270/consoleFull) for PR 4256 at commit

[GitHub] spark pull request: SPARK-3290 [GRAPHX] No unpersist callls in SVD...

2015-01-28 Thread srowen
Github user srowen commented on the pull request: https://github.com/apache/spark/pull/4234#issuecomment-71953824 OK, like add some `count()` calls to make it persist the RDDs? or just treat this as wont-fix? --- If your project is set up for it, you can reply to this email and have

[GitHub] spark pull request: Add a config option to print DAG.

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4257#issuecomment-71953996 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: SPARK-3290 [GRAPHX] No unpersist callls in SVD...

2015-01-28 Thread ankurdave
Github user ankurdave commented on the pull request: https://github.com/apache/spark/pull/4234#issuecomment-71953568 @srowen Caching here is necessary since we do reuse some uncached datasets, so I don't think there's a good solution for uncaching in this kind of situation.

[GitHub] spark pull request: [SPARK-3977] Conversion methods for BlockMatri...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4256#issuecomment-71953573 [Test build #26276 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26276/consoleFull) for PR 4256 at commit

[GitHub] spark pull request: Add a config option to print DAG.

2015-01-28 Thread KaiXinXiaoLei
GitHub user KaiXinXiaoLei opened a pull request: https://github.com/apache/spark/pull/4257 Add a config option to print DAG. Add a config option spark.rddDebug.enable to check whether to print DAG info. When spark.rddDebug.enable is true, it will print information about DAG in the

[GitHub] spark pull request: [SPARK-5341] Use maven coordinates as dependen...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4215#issuecomment-71955530 [Test build #26277 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26277/consoleFull) for PR 4215 at commit

[GitHub] spark pull request: [SPARK-3977] Conversion methods for BlockMatri...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4256#issuecomment-71955527 [Test build #26278 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26278/consoleFull) for PR 4256 at commit

[GitHub] spark pull request: Replace mapToPair - reduceByKey with flatMap ...

2015-01-28 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/4198#issuecomment-71957049 Thanks @smarthi ! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request: [SPARK-5212][SQL] Add support of schema-less, ...

2015-01-28 Thread viirya
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/4014#issuecomment-71957621 [Schema-less Map-reduce Scripts](https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Transform#LanguageManualTransform-Schema-lessMap-reduceScripts) is a

[GitHub] spark pull request: [MLLIB] SPARK-4846: throw a RuntimeException a...

2015-01-28 Thread jinntrance
Github user jinntrance commented on a diff in the pull request: https://github.com/apache/spark/pull/4247#discussion_r23744568 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala --- @@ -290,111 +290,126 @@ class Word2Vec extends Serializable with Logging

[GitHub] spark pull request: [SPARK-4859][Core][Streaming] Refactor LiveLis...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4006#issuecomment-71962481 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4859][Core][Streaming] Refactor LiveLis...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4006#issuecomment-71962472 [Test build #26280 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26280/consoleFull) for PR 4006 at commit

[GitHub] spark pull request: [SPARK-5429][SQL] Use javaXML plan serializati...

2015-01-28 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/4223#issuecomment-71963599 The root cause is likely you aren't using the right version of hive. Where right means our hacked version that has a version of kryo that doesn't conflict with spark.

[GitHub] spark pull request: [SQL] Various DataFrame DSL update.

2015-01-28 Thread rxin
GitHub user rxin opened a pull request: https://github.com/apache/spark/pull/4260 [SQL] Various DataFrame DSL update. 1. Added foreach, foreachPartition, flatMap to DataFrame. 2. Added col() in dsl. 3. Support renaming columns in toDataFrame. 4. Support type inference on

[GitHub] spark pull request: [SPARK-5253] [ML] LinearRegression with L1/L2 ...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4259#issuecomment-71961789 [Test build #26284 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26284/consoleFull) for PR 4259 at commit

[GitHub] spark pull request: [SPARK-2309][MLlib] Multinomial Logistic Regre...

2015-01-28 Thread dbtsai
Github user dbtsai commented on a diff in the pull request: https://github.com/apache/spark/pull/3833#discussion_r23743161 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/optimization/Gradient.scala --- @@ -55,24 +57,79 @@ abstract class Gradient extends Serializable {

[GitHub] spark pull request: [SPARK-3977] Conversion methods for BlockMatri...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4256#issuecomment-71959946 [Test build #26276 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26276/consoleFull) for PR 4256 at commit

[GitHub] spark pull request: [SPARK-5445][SQL] Made DataFrame dsl usable in...

2015-01-28 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/4241 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-5470][Core]use defaultClassLoader to lo...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4258#issuecomment-71958741 [Test build #26281 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26281/consoleFull) for PR 4258 at commit

[GitHub] spark pull request: [SPARK-3977] Conversion methods for BlockMatri...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4256#issuecomment-71961728 [Test build #26278 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26278/consoleFull) for PR 4256 at commit

[GitHub] spark pull request: [SPARK-5253] [ML] LinearRegression with L1/L2 ...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4259#issuecomment-71960804 [Test build #26283 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26283/consoleFull) for PR 4259 at commit

[GitHub] spark pull request: [SPARK-5445][SQL] Made DataFrame dsl usable in...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71961604 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SQL] Various DataFrame DSL update.

2015-01-28 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/4260#issuecomment-71963705 cc @mengxr for mllib changes --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-5429][SQL] Use javaXML plan serializati...

2015-01-28 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/4223#issuecomment-71963496 I am going to let @marmbrus to comment on this one ... --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: [SPARK-5253] [ML] LinearRegression with L1/L2 ...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4259#issuecomment-71961315 [Test build #26284 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26284/consoleFull) for PR 4259 at commit

[GitHub] spark pull request: [SPARK-5445][SQL] Made DataFrame dsl usable in...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71961595 [Test build #26275 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26275/consoleFull) for PR 4241 at commit

[GitHub] spark pull request: [SPARK-5470][Core]use defaultClassLoader to lo...

2015-01-28 Thread zzcclp
Github user zzcclp commented on the pull request: https://github.com/apache/spark/pull/4258#issuecomment-71959063 LGTM, I test this with spark-branch 1.2, it works correctly. :) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request: [SPARK-5470][Core]use defaultClassLoader to lo...

2015-01-28 Thread lianhuiwang
GitHub user lianhuiwang opened a pull request: https://github.com/apache/spark/pull/4258 [SPARK-5470][Core]use defaultClassLoader to load classes of classesToRegister in KryoSeria... Now KryoSerializer load classes of classesToRegister at the time of its initialization. when we

[GitHub] spark pull request: [SPARK-5253] [ML] LinearRegression with L1/L2 ...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4259#issuecomment-71961793 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-3977] Conversion methods for BlockMatri...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4256#issuecomment-71959952 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4964] [Streaming] Exactly-once semantic...

2015-01-28 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3798#discussion_r23742746 --- Diff: external/kafka/src/main/scala/org/apache/spark/streaming/kafka/KafkaUtils.scala --- @@ -144,4 +150,116 @@ object KafkaUtils {

[GitHub] spark pull request: [SQL] Various DataFrame DSL update.

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4260#issuecomment-71963853 [Test build #26285 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26285/consoleFull) for PR 4260 at commit

[GitHub] spark pull request: [SPARK-5253] [ML] LinearRegression with L1/L2 ...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4259#issuecomment-71963892 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5445][SQL] Made DataFrame dsl usable in...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71960238 [Test build #26274 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26274/consoleFull) for PR 4241 at commit

[GitHub] spark pull request: [SPARK-5445][SQL] Made DataFrame dsl usable in...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71960243 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-3977] Conversion methods for BlockMatri...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4256#issuecomment-71961733 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4964] [Streaming] Exactly-once semantic...

2015-01-28 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/3798#discussion_r23743523 --- Diff: external/kafka/src/test/scala/org/apache/spark/streaming/kafka/KafkaStreamSuite.scala --- @@ -130,7 +130,7 @@ abstract class KafkaStreamSuiteBase

[GitHub] spark pull request: [SPARK-4964] [Streaming] Exactly-once semantic...

2015-01-28 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/3798#issuecomment-71960076 At a high level these are the high level design issues that are still pending - Packaging - I think all the classes should be org.apache.spark.streaming.kafka

[GitHub] spark pull request: [SPARK-5253] [ML] LinearRegression with L1/L2 ...

2015-01-28 Thread dbtsai
GitHub user dbtsai opened a pull request: https://github.com/apache/spark/pull/4259 [SPARK-5253] [ML] LinearRegression with L1/L2 (ElasticNet) using OWLQN You can merge this pull request into a Git repository by running: $ git pull https://github.com/AlpineNow/spark lir

[GitHub] spark pull request: [SPARK-5341] Use maven coordinates as dependen...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4215#issuecomment-71960138 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71959963 [Test build #26282 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26282/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-2309][MLlib] Multinomial Logistic Regre...

2015-01-28 Thread dbtsai
Github user dbtsai commented on a diff in the pull request: https://github.com/apache/spark/pull/3833#discussion_r23743197 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/optimization/Gradient.scala --- @@ -55,24 +57,79 @@ abstract class Gradient extends Serializable {

[GitHub] spark pull request: [SPARK-4879] Use the Spark driver to authorize...

2015-01-28 Thread mccheah
Github user mccheah commented on a diff in the pull request: https://github.com/apache/spark/pull/4155#discussion_r23743012 --- Diff: core/src/main/scala/org/apache/spark/scheduler/OutputCommitCoordinator.scala --- @@ -0,0 +1,258 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-5470][Core]use defaultClassLoader to lo...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4258#issuecomment-71964173 [Test build #26281 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26281/consoleFull) for PR 4258 at commit

[GitHub] spark pull request: [SPARK-4879] Use the Spark driver to authorize...

2015-01-28 Thread JoshRosen
Github user JoshRosen commented on a diff in the pull request: https://github.com/apache/spark/pull/4155#discussion_r23742870 --- Diff: core/src/main/scala/org/apache/spark/scheduler/OutputCommitCoordinator.scala --- @@ -0,0 +1,258 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-5429][SQL] Use javaXML plan serializati...

2015-01-28 Thread viirya
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/4223#issuecomment-71958179 @rxin This is a quick fix for the problem when generating Hive golden answer files on Hive 0.13.1. Do not know if it is useful for others. It would be great if you can

[GitHub] spark pull request: [SPARK-4964] [Streaming] Exactly-once semantic...

2015-01-28 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/3798#issuecomment-71960172 Jenkins, this is okay to test. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-5341] Use maven coordinates as dependen...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4215#issuecomment-71960123 [Test build #26277 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26277/consoleFull) for PR 4215 at commit

[GitHub] spark pull request: [SPARK-5445][SQL] Made DataFrame dsl usable in...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71951941 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5445][SQL] Made DataFrame dsl usable in...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71952029 [Test build #26274 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26274/consoleFull) for PR 4241 at commit

[GitHub] spark pull request: add support for zipping a sequence of RDDs

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/2429#issuecomment-71951971 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-5445][SQL] Made DataFrame dsl usable in...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71951933 [Test build #26264 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26264/consoleFull) for PR 4241 at commit

[GitHub] spark pull request: [SPARK-5430] move treeReduce and treeAggregate...

2015-01-28 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/4228 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-5430] move treeReduce and treeAggregate...

2015-01-28 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/4228#issuecomment-71952501 Merged into master. @sryza I kept the current `treeAggregate`/`treeReduce` names. Please check my inline comments about overloading `aggregate`. --- If your project is

[GitHub] spark pull request: [SPARK-5445][SQL] Made DataFrame dsl usable in...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4241#issuecomment-71952548 [Test build #26275 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26275/consoleFull) for PR 4241 at commit

<    1   2   3   4   5   6   7   8   >