[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26100551 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,370 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: [SPARK-5205][Streaming]:Inconsistent behaviour...

2015-03-10 Thread uncleGen
Github user uncleGen commented on the pull request: https://github.com/apache/spark/pull/4135#issuecomment-78000658 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-5205][Streaming]:Inconsistent behaviour...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4135#issuecomment-78000862 [Test build #28423 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28423/consoleFull) for PR 4135 at commit

[GitHub] spark pull request: [SQL] [WIP] Blacklists several Hive 0.13.1 spe...

2015-03-10 Thread liancheng
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/4851#issuecomment-78001073 The Hive shim layer shirinking part in #4107 is not that essential, but hard to review and touches lots of files. However, the Hive 0.12.0 tests part is essential.

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26108854 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,370 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: [SPARK-6244] Implement VectorSpace to easy cre...

2015-03-10 Thread catap
GitHub user catap opened a pull request: https://github.com/apache/spark/pull/4962 [SPARK-6244] Implement VectorSpace to easy create a complicated feature vector VectorSpace is wrapper what implement three operation: - concat -- concat all vectors to single vector - sum

[GitHub] spark pull request: [SPARK-2199] [mllib] topic modeling

2015-03-10 Thread renchengchang
Github user renchengchang commented on the pull request: https://github.com/apache/spark/pull/1269#issuecomment-78023208 @akopich how to assign document id? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-2199] [mllib] topic modeling

2015-03-10 Thread renchengchang
Github user renchengchang commented on the pull request: https://github.com/apache/spark/pull/1269#issuecomment-78023104 how to assign document id? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-6211][Streaming] Add Python Kafka API u...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4961#issuecomment-78027047 [Test build #28427 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28427/consoleFull) for PR 4961 at commit

[GitHub] spark pull request: Minor doc: Remove the extra blank line in data...

2015-03-10 Thread srowen
Github user srowen commented on the pull request: https://github.com/apache/spark/pull/4955#issuecomment-78028700 Obviously, LGTM, go ahead if you want to make this change. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4588#issuecomment-78034996 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4588#issuecomment-78034988 [Test build #28428 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28428/consoleFull) for PR 4588 at commit

[GitHub] spark pull request: [SPARK-6244] Implement VectorSpace to easy cre...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4962#issuecomment-78040426 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-6177][MLlib]Add note in LDA example to ...

2015-03-10 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/4899 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: Spark 5659 Flaky test: o.a.s.streaming.Receive...

2015-03-10 Thread kevinpetersavage
Github user kevinpetersavage commented on a diff in the pull request: https://github.com/apache/spark/pull/4957#discussion_r26113197 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/ReceiverSuite.scala --- @@ -161,50 +161,27 @@ class ReceiverSuite extends

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4588#issuecomment-78024140 [Test build #28428 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28428/consoleFull) for PR 4588 at commit

[GitHub] spark pull request: Spark 5659 Flaky test: o.a.s.streaming.Receive...

2015-03-10 Thread kevinpetersavage
Github user kevinpetersavage commented on a diff in the pull request: https://github.com/apache/spark/pull/4957#discussion_r26113001 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/ReceiverSuite.scala --- @@ -161,50 +161,27 @@ class ReceiverSuite extends

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread zsxwing
Github user zsxwing commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26109964 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,370 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: [SPARK-6087][CORE] Provide actionable exceptio...

2015-03-10 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/4947 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-6186] [EC2] Make Tachyon version config...

2015-03-10 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/4901 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-6191] [EC2] Generalize ability to downl...

2015-03-10 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/4919 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-6211][Streaming] Add Python Kafka API u...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4961#issuecomment-78027054 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-6211][Streaming] Add Python Kafka API u...

2015-03-10 Thread jerryshao
Github user jerryshao commented on the pull request: https://github.com/apache/spark/pull/4961#issuecomment-78060991 Jenkins, retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-6211][Streaming] Add Python Kafka API u...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4961#issuecomment-78061865 [Test build #28431 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28431/consoleFull) for PR 4961 at commit

[GitHub] spark pull request: [SPARK-6051][Streaming] Add ZooKeeper offest p...

2015-03-10 Thread koeninger
Github user koeninger commented on a diff in the pull request: https://github.com/apache/spark/pull/4805#discussion_r26120488 --- Diff: external/kafka/src/main/scala/org/apache/spark/streaming/kafka/DirectKafkaInputDStream.scala --- @@ -118,6 +123,7 @@ class

[GitHub] spark pull request: [SPARK-2033] Automatically cleanup checkpoint

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/855#issuecomment-78057557 [Test build #28430 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28430/consoleFull) for PR 855 at commit

[GitHub] spark pull request: [SPARK-2199] [mllib] topic modeling

2015-03-10 Thread akopich
Github user akopich commented on the pull request: https://github.com/apache/spark/pull/1269#issuecomment-78050367 @renchengchang 1. Hi. 2. Don't use code from this PR. Use either LDA (which is merged with mllib) or https://github.com/akopich/dplsa which is a further

[GitHub] spark pull request: [SPARK-6051][Streaming] Add ZooKeeper offest p...

2015-03-10 Thread koeninger
Github user koeninger commented on the pull request: https://github.com/apache/spark/pull/4805#issuecomment-78053440 I am now confused about what the purpose of this PR is. The jira seemed to indicate that the problem was several third-party offset monitoring tools fail to monitor

[GitHub] spark pull request: [SPARK-5929] Pyspark: Register a pip requireme...

2015-03-10 Thread buckheroux
Github user buckheroux commented on the pull request: https://github.com/apache/spark/pull/4897#issuecomment-78065029 Great, will get to that this week and add tests. Do you think bundling importlib with pyspark is reasonable? Or is finding another way to track down the local

[GitHub] spark pull request: [SPARK-6051][Streaming] Add ZooKeeper offest p...

2015-03-10 Thread koeninger
Github user koeninger commented on a diff in the pull request: https://github.com/apache/spark/pull/4805#discussion_r26120607 --- Diff: external/kafka/src/main/scala/org/apache/spark/streaming/kafka/DirectKafkaInputDStream.scala --- @@ -84,6 +83,11 @@ class

[GitHub] spark pull request: [SPARK-6211][Streaming] Add Python Kafka API u...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4961#issuecomment-78058217 [Test build #28429 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28429/consoleFull) for PR 4961 at commit

[GitHub] spark pull request: [SPARK-6211][Streaming] Add Python Kafka API u...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4961#issuecomment-78046862 [Test build #28429 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28429/consoleFull) for PR 4961 at commit

[GitHub] spark pull request: [SPARK-6211][Streaming] Add Python Kafka API u...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4961#issuecomment-78058237 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78180500 [Test build #28451 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28451/consoleFull) for PR 4958 at commit

[GitHub] spark pull request: [SPARK-4924] Add a library for launching Spark...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3916#issuecomment-78182670 [Test build #28447 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28447/consoleFull) for PR 3916 at commit

[GitHub] spark pull request: [SPARK-4924] Add a library for launching Spark...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3916#issuecomment-78182678 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: SPARK-6245 [SQL] jsonRDD() of empty RDD result...

2015-03-10 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/4971#issuecomment-78183462 LGTM, thanks @srowen --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-5986][MLLib] Add save/load for k-means

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4951#issuecomment-78183339 [Test build #28453 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28453/consoleFull) for PR 4951 at commit

[GitHub] spark pull request: [SPARK-6269] Use a different implementation of...

2015-03-10 Thread mccheah
GitHub user mccheah opened a pull request: https://github.com/apache/spark/pull/4972 [SPARK-6269] Use a different implementation of java.lang.reflect.Array This patch uses a different implementation of java.lang.reflect.Array. The code is copied and pasted from

[GitHub] spark pull request: [SPARK-6243][SQL] The Operation of match did n...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4959#issuecomment-78183487 [Test build #28448 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28448/consoleFull) for PR 4959 at commit

[GitHub] spark pull request: [SPARK-2199] [mllib] topic modeling

2015-03-10 Thread renchengchang
Github user renchengchang commented on the pull request: https://github.com/apache/spark/pull/1269#issuecomment-78184357 Thanks. I have a question: if there is not document id ,how can I know the relation between topic vector and raw text? 发件人: Avanesov Valeriy

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26183675 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,370 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: [SPARK-6211][Streaming] Add Python Kafka API u...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4961#issuecomment-78186540 [Test build #28458 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28458/consoleFull) for PR 4961 at commit

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78188394 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78188389 [Test build #28452 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28452/consoleFull) for PR 4958 at commit

[GitHub] spark pull request: [SPARK-6185][SQL] Deltele repeated TOKEN. TOK...

2015-03-10 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/4907#issuecomment-78188942 +1 to sorting. On Mar 10, 2015 7:26 PM, Daoyuan Wang notificati...@github.com wrote: Again, I think we'd better sort these tokens in alphabetic order to

[GitHub] spark pull request: [SPARK-6185][SQL] Deltele repeated TOKEN. TOK...

2015-03-10 Thread DoingDone9
Github user DoingDone9 commented on the pull request: https://github.com/apache/spark/pull/4907#issuecomment-78190562 It is a good idea that to sorting , i will do it @adrian-wang @marmbrus @chenghao-intel --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [SPARK-5986][MLLib] Add save/load for k-means

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4951#issuecomment-78190589 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-2199] [mllib] topic modeling

2015-03-10 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/1269#issuecomment-78190574 @akopich Since this is no longer an active PR, could you please close it? It was very helpful to have this PR as a major basis for the initial LDA PR. If you

[GitHub] spark pull request: [SPARK-1503][MLLIB] Initial AcceleratedGradien...

2015-03-10 Thread staple
Github user staple commented on the pull request: https://github.com/apache/spark/pull/4934#issuecomment-78192837 Hi, replying to some of the statements above: It seems @staple has already implemented backtracking (because he has results in the JIRA), but kept them out of

[GitHub] spark pull request: [SPARK-5205][Streaming]:Inconsistent behaviour...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4135#issuecomment-78201355 [Test build #28459 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28459/consoleFull) for PR 4135 at commit

[GitHub] spark pull request: [SQL][Minor] fix typo in comments

2015-03-10 Thread liuhb86
GitHub user liuhb86 opened a pull request: https://github.com/apache/spark/pull/4976 [SQL][Minor] fix typo in comments Removed an repeated from in the comments. You can merge this pull request into a Git repository by running: $ git pull https://github.com/liuhb86/spark mine

[GitHub] spark pull request: [SPARK-6274][Streaming][Examples] Added exampl...

2015-03-10 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/4975#issuecomment-78208286 lgtm -- assuming it compiles --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6274][Streaming][Examples] Added exampl...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4975#issuecomment-78208900 [Test build #28462 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28462/consoleFull) for PR 4975 at commit

[GitHub] spark pull request: [SPARK-6274][Streaming][Examples] Added exampl...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4975#issuecomment-78208901 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: SPARK-6245 [SQL] jsonRDD() of empty RDD result...

2015-03-10 Thread srowen
GitHub user srowen opened a pull request: https://github.com/apache/spark/pull/4971 SPARK-6245 [SQL] jsonRDD() of empty RDD results in exception Avoid `UnsupportedOperationException` from JsonRDD.inferSchema on empty RDD. Not sure if this is supposed to be an error (but a

[GitHub] spark pull request: SPARK-6225 [CORE] [SQL] [STREAMING] Resolve mo...

2015-03-10 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/4950#discussion_r26181023 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/recommendation/MatrixFactorizationModel.scala --- @@ -199,12 +199,12 @@ object

[GitHub] spark pull request: [SPARK-6269] Use a different implementation of...

2015-03-10 Thread mccheah
Github user mccheah commented on the pull request: https://github.com/apache/spark/pull/4972#issuecomment-78184492 I believe we use Array.get in the visitArray method in SizeEstimator, so there's that as well. --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78184673 [Test build #28456 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28456/consoleFull) for PR 4958 at commit

[GitHub] spark pull request: [SPARK-6185][SQL] Deltele repeated TOKEN. TOK...

2015-03-10 Thread adrian-wang
Github user adrian-wang commented on the pull request: https://github.com/apache/spark/pull/4907#issuecomment-78188846 Again, I think we'd better sort these tokens in alphabetic order to avoid further duplicate --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [SPARK-5556][MLLib][WIP] Gibbs LDA, Refactor L...

2015-03-10 Thread EntilZha
Github user EntilZha commented on a diff in the pull request: https://github.com/apache/spark/pull/4807#discussion_r26189097 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/LDA.scala --- @@ -311,165 +319,319 @@ private[clustering] object LDA {

[GitHub] spark pull request: [SPARK-5556][MLLib][WIP] Gibbs LDA, Refactor L...

2015-03-10 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/4807#discussion_r26189615 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/LDA.scala --- @@ -311,165 +319,319 @@ private[clustering] object LDA {

[GitHub] spark pull request: SPARK-6245 [SQL] jsonRDD() of empty RDD result...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4971#issuecomment-78179942 [Test build #28450 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28450/consoleFull) for PR 4971 at commit

[GitHub] spark pull request: [SPARK-6198][SQL] Support select current_data...

2015-03-10 Thread DoingDone9
Github user DoingDone9 commented on the pull request: https://github.com/apache/spark/pull/4926#issuecomment-78182986 could you test it @marmbrus --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-6269] Use a different implementation of...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4972#issuecomment-78183787 [Test build #28454 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28454/consoleFull) for PR 4972 at commit

[GitHub] spark pull request: [SPARK-6269] Use a different implementation of...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4972#issuecomment-78184695 [Test build #28455 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28455/consoleFull) for PR 4972 at commit

[GitHub] spark pull request: [SPARK-6222][STREAMING] Make sure batches are ...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4964#issuecomment-78186076 [Test build #28457 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28457/consoleFull) for PR 4964 at commit

[GitHub] spark pull request: [SPARK-6268][MLlib] KMeans parameter getter me...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4974#issuecomment-78209124 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-6268][MLlib] KMeans parameter getter me...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4974#issuecomment-78209117 [Test build #28460 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28460/consoleFull) for PR 4974 at commit

[GitHub] spark pull request: Spark 5659 Flaky test: o.a.s.streaming.Receive...

2015-03-10 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/4957#discussion_r26146158 --- Diff: streaming/src/test/scala/org/apache/spark/streaming/ReceiverSuite.scala --- @@ -161,50 +161,27 @@ class ReceiverSuite extends TestSuiteBase with

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26147851 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,370 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: SPARK-4921. TaskSetManager.dequeueTask returns...

2015-03-10 Thread sryza
Github user sryza closed the pull request at: https://github.com/apache/spark/pull/3816 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26147605 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,370 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26148233 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,373 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26149013 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,373 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26148973 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,373 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: [SPARK-2033] Automatically cleanup checkpoint

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/855#issuecomment-78075491 [Test build #28430 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28430/consoleFull) for PR 855 at commit

[GitHub] spark pull request: [SPARK-5556][MLLib][WIP] Gibbs LDA, Refactor L...

2015-03-10 Thread witgo
Github user witgo commented on the pull request: https://github.com/apache/spark/pull/4807#issuecomment-78089533 @EntilZha @mengxr This branch can be merged into master? I want merge the PR to [LightLDA](https://github.com/witgo/spark/tree/LightLDA) and

[GitHub] spark pull request: [SPARK-5124][Core] A standard RPC interface an...

2015-03-10 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/4588#discussion_r26140416 --- Diff: core/src/main/scala/org/apache/spark/rpc/RpcEnv.scala --- @@ -0,0 +1,370 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark pull request: [SPARK-6269] Use a different implementation of...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4972#issuecomment-78183941 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-2199] [mllib] topic modeling

2015-03-10 Thread akopich
Github user akopich commented on the pull request: https://github.com/apache/spark/pull/1269#issuecomment-78184948 @renchengchang What do you mean by topic vector? A vector of p(t|d) \forall t? If so, you can find these vectors in `RDD[DocumentParameters]` which is returned by

[GitHub] spark pull request: SPARK-6245 [SQL] jsonRDD() of empty RDD result...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4971#issuecomment-78186536 [Test build #28450 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28450/consoleFull) for PR 4971 at commit

[GitHub] spark pull request: SPARK-6245 [SQL] jsonRDD() of empty RDD result...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4971#issuecomment-78186541 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-6211][Streaming] Add Python Kafka API u...

2015-03-10 Thread jerryshao
Github user jerryshao commented on the pull request: https://github.com/apache/spark/pull/4961#issuecomment-78202972 Hi @davies @tdas , I added the Python Kafka API unit test, it works very well in my local test, but seems always failed in Jenkins test with error like: ```

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78180187 lgtm --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78181015 [Test build #28452 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28452/consoleFull) for PR 4958 at commit

[GitHub] spark pull request: SPARK-6245 [SQL] jsonRDD() of empty RDD result...

2015-03-10 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/4971#discussion_r26182739 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/json/JsonRDD.scala --- @@ -48,7 +48,11 @@ private[sql] object JsonRDD extends Logging {

[GitHub] spark pull request: [SPARK-6243][SQL] The Operation of match did n...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4959#issuecomment-78183498 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78187934 [Test build #28451 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28451/consoleFull) for PR 4958 at commit

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78187940 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-6269] Use a different implementation of...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4972#issuecomment-78191541 [Test build #28455 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28455/consoleFull) for PR 4972 at commit

[GitHub] spark pull request: [SPARK-6269] Use a different implementation of...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4972#issuecomment-78191546 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78191538 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-5183][SQL] Update SQL Docs with JDBC an...

2015-03-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4958#issuecomment-78191533 [Test build #28456 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28456/consoleFull) for PR 4958 at commit

[GitHub] spark pull request: [SPARK-5205][Streaming]:Inconsistent behaviour...

2015-03-10 Thread uncleGen
Github user uncleGen commented on the pull request: https://github.com/apache/spark/pull/4135#issuecomment-78201163 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-5556][MLLib][WIP] Gibbs LDA, Refactor L...

2015-03-10 Thread EntilZha
Github user EntilZha commented on a diff in the pull request: https://github.com/apache/spark/pull/4807#discussion_r26187624 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/LDA.scala --- @@ -311,165 +319,319 @@ private[clustering] object LDA {

[GitHub] spark pull request: [SPARK-5556][MLLib][WIP] Gibbs LDA, Refactor L...

2015-03-10 Thread EntilZha
Github user EntilZha commented on a diff in the pull request: https://github.com/apache/spark/pull/4807#discussion_r26187620 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/LDA.scala --- @@ -311,165 +319,319 @@ private[clustering] object LDA {

[GitHub] spark pull request: [SPARK-5556][MLLib][WIP] Gibbs LDA, Refactor L...

2015-03-10 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/4807#discussion_r26188380 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/LDA.scala --- @@ -311,165 +319,319 @@ private[clustering] object LDA {

[GitHub] spark pull request: [SPARK-6268][MLlib] KMeans parameter getter me...

2015-03-10 Thread hhbyyh
GitHub user hhbyyh opened a pull request: https://github.com/apache/spark/pull/4974 [SPARK-6268][MLlib] KMeans parameter getter methods jira: https://issues.apache.org/jira/browse/SPARK-6268 KMeans has many setters for parameters. It should have matching getters. You can

[GitHub] spark pull request: [SPARK-5556][MLLib][WIP] Gibbs LDA, Refactor L...

2015-03-10 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/4807#discussion_r26188379 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/LDA.scala --- @@ -311,165 +319,319 @@ private[clustering] object LDA {

  1   2   3   >