[
https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15061761#comment-15061761
]
Yanbo Liang edited comment on SPARK-12363 at 12/17/15 9:20 AM:
---------------------------------------------------------------
{quote}
Does it improve if you increase the number of iterations KMeans runs for when
called from PIC?
{quote}
No, it converged in 1 iterations in any condition.
I'm confused that we have two test case:
{code}
test("power iteration clustering")
test("power iteration clustering on graph")
{code}
They use the same input data, but call different train API.
It should get same result theoretically, but the first test case succeed and
the second one failed.
[~josephkb]
was (Author: yanboliang):
{quote}
Does it improve if you increase the number of iterations KMeans runs for when
called from PIC?
{quote}
No, it converged in 1 iterations in any condition.
I'm confused that we have two test case:
{code}
test("power iteration clustering")
test("power iteration clustering on graph")
{code}
They use the same input data, but call different train API. It should get same
result theoretically.
But the first test case succeed and the second one failed.
[~josephkb]
> PowerIterationClustering test case failed if we deprecated KMeans.setRuns
> -------------------------------------------------------------------------
>
> Key: SPARK-12363
> URL: https://issues.apache.org/jira/browse/SPARK-12363
> Project: Spark
> Issue Type: Bug
> Components: MLlib
> Reporter: Yanbo Liang
> Priority: Minor
>
> We plan to deprecated `runs` of KMeans, PowerIterationClustering will
> leverage KMeans to train model.
> I removed `setRuns` used in PowerIterationClustering, but one of the test
> cases failed.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]