[jira] [Created] (SPARK-5962) [MLLIB] Python support for Power Iteration Clustering

Stephen Boesch (JIRA) Mon, 23 Feb 2015 18:15:38 -0800

Stephen Boesch created SPARK-5962:
-------------------------------------

             Summary: [MLLIB] Python support for Power Iteration Clustering
                 Key: SPARK-5962
                 URL: https://issues.apache.org/jira/browse/SPARK-5962
             Project: Spark
          Issue Type: Bug
          Components: MLlib
            Reporter: Stephen Boesch



Add python support for the Power Iteration Clustering feature.  Here is a 
fragment of the python API as we plan to implement it:

  /**
   * Java stub for Python mllib PowerIterationClustering.run()
   */
  def trainPowerIterationClusteringModel(
      data: JavaRDD[(java.lang.Long, java.lang.Long, java.lang.Double)],
      k: Int,
      maxIterations: Int,
      runs: Int,
      initializationMode: String,
      seed: java.lang.Long): PowerIterationClusteringModel = {
    val picAlg = new PowerIterationClustering()
      .setK(k)
      .setMaxIterations(maxIterations)

    try {
      picAlg.run(data.rdd.persist(StorageLevel.MEMORY_AND_DISK))
    } finally {
      data.rdd.unpersist(blocking = false)
    }
  }




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Created] (SPARK-5962) [MLLIB] Python support for Power Iteration Clustering

Reply via email to