Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-56989958
@witgo Since we are converging on a GraphX-based implementation and
distributed representation of the topic model, do you mind closing this PR?
Thanks!
---
If your
Github user witgo closed the pull request at:
https://github.com/apache/spark/pull/1983
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-5274
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20321/consoleFull)
for PR 1983 at commit
Github user allwefantasy commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55549348
@witgo i have saw you new performance test configurationã I will try your
new code and test in my data today
---
If your project is set up for it, you can reply
Github user allwefantasy commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-1073
@witgo i have try ur latest code in my corpus ã it will not Stuck in
broadcasting . However ,some exception are throwã
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-3050
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20321/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55514423
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20294/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55515317
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20294/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55376707
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20222/consoleFull)
for PR 1983 at commit
Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55381531
@witgo @allwefantasy
English | èªå¨ç¿»è¯çä¸æ
|
Let's try to keep the comments in English as much as possible. |
Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55381621
@witgo @allwefantasy We had an offline discussion about LDA's
implementation. Please check the JIRA page for the notes.
--
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55382369
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20222/consoleFull)
for PR 1983 at commit
Github user witgo commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55391092
@mengxr @allwefantasy
The current broadcast-based implementation, especially in the corpus is
large, the performance loss is more serious. Next week I will
Github user witgo commented on a diff in the pull request:
https://github.com/apache/spark/pull/1983#discussion_r17490277
--- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/LDA.scala
---
@@ -0,0 +1,397 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55478709
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20246/consoleFull)
for PR 1983 at commit
Github user witgo commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55479363
@allwefantasy
I have updated the code, you can try the latest code.
---
If your project is set up for it, you can reply to this email and have your
reply appear on
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55479772
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20246/consoleFull)
for PR 1983 at commit
Github user allwefantasy commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55343319
@witgo
好çãå¦æææ´æ°å请éç¥æãæè¿éä¹å¯ä»¥ç¬¬ä¸æ¶é´è¿è¡æµè¯ã
---
If your project is set up for it, you can reply to this email
Github user witgo commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55223673
@allwefantasy Sparkæ¯å¯ä»¥è°æ´executoråæ¶è¿è¡çtaskæ°éç.
å¦æä½ æ³è®©æ¯ä¸ªexecutoråæ¶å¯ä»¥è¿è¡17个task.
Github user allwefantasy commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55238978
@witgo æè°¢è¿ä¸ªæå·§çå享ã
æç®åè¿éå°ä¸ä¸ªé®é¢ãæ¨å¤©ä½
é®æè¿è¾¹24wææ¡£çè¯æ°æ¯å¤å°ï¼æç»è®¡äºä¸ï¼æ¯ 2400w
Github user witgo commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55280269
@allwefantasy ç°æç代ç
å¨è¿ä»£è®¡ç®è¿ç¨ä¸å建äºå¤ªå¤çTopicModelå®ä¾,
æç°å¨æ£å¨å°è¯è§£å³è¿ä¸ªé®é¢.
æè°¢ä½ çåé¦.
---
If your
Github user allwefantasy commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55089256
@witgo çäºä½ çæ§è½æµè¯ ä½
éé¢æ²¡ææå°è¿ä»£æ¬¡æ°ãæ¯å¤å°æ¬¡è¿ä»£å¢ï¼ä¸ä¸ªå°æ¶å°±å®æäºã
Github user witgo commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55094784
@allwefantasy
æçæµè¯è¯æåºå¤§å°æ¯`196558` 个ææ¡£, `7897767` 个è¯.
è¿ä»£æ¬¡æ°æ¯`100`次.
ä½ ç24ä¸ææ¡£æ»å
±æå¤ä¸ªè¯?
ä½
Github user srowen commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55095034
(Pardon, I think it's important to also summarize in English, as the lingua
franca of the project, for the benefit of other readers.)
---
If your project is set up for
Github user witgo commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55095890
@allwefantasy
æ认为è¿éç代ç ` Document(parts(0).toInt,(0 until
wordInfo.value.size).map(k= values.getOrElse(k,0)).toArray)`
æ¯æç¹é®é¢ç..
Github user witgo commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55096559
@srowen I will try to translate the comments into English
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well.
Github user allwefantasy commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55129263
@witgo é£å°±æ¯æç¯äºé误ï¼å¯¹Document ä¸content
ç解éäºãæ以为content
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55213212
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20133/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55216611
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20133/consoleFull)
for PR 1983 at commit
Github user allwefantasy commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-55221324
@witgo 西é¢è¿ä¸æ®µä»£ç å¯ä»¥å¤çº¿ç¨åä¹ï¼
for (i - 0 until content.length) {
val term = content(i)
val
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54952703
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20037/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54957948
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/20037/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54720047
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19918/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54722195
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19918/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54696954
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19874/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54698906
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19874/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54064930
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19562/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54065121
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19562/consoleFull)
for PR 1983 at commit
Github user witgo commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54065708
test this please
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54066312
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19563/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-54072638
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19563/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53855239
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19467/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53864655
**[Tests timed
out](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19467/consoleFull)**
after a configured wait of `120m`.
---
If your project is
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53698415
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19395/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53699785
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19396/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53699939
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19396/consoleFull)
for PR 1983 at commit
Github user witgo commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53700053
@mengxr This patch removed the `accumulable` operation . repair formula
errors in `dropOneDistSampler ` method and some of the performance
optimization. About how I
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53704442
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19398/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53704512
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19395/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53709038
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19398/consoleFull)
for PR 1983 at commit
Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53547793
@witgo Thanks for working on LDA! Could you briefly describe what you
changed in this PR? The major feedback of #476 is how we store the model, which
may be worth more
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53440660
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19213/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53449444
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19213/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53273744
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19139/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-53280797
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19139/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-52587323
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18809/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-52589954
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18809/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-52419415
[QA tests have
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18702/consoleFull)
for PR 1983 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/1983#issuecomment-52420410
[QA tests have
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18702/consoleFull)
for PR 1983 at commit
59 matches
Mail list logo