[jira] [Commented] (SPARK-11685) Find duplicate content under examples/

2015-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15054056#comment-15054056 ] Yanbo Liang commented on SPARK-11685: - [~josephkb] I have checked thoroughly, there a

[jira] [Commented] (SPARK-11685) Find duplicate content under examples/

2015-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15054055#comment-15054055 ] Yanbo Liang commented on SPARK-11685: - [~josephkb] I have checked thoroughly, there a

[jira] [Issue Comment Deleted] (SPARK-11685) Find duplicate content under examples/

2015-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11685: Comment: was deleted (was: [~josephkb] I have checked thoroughly, there are no other examples we s

[jira] [Issue Comment Deleted] (SPARK-11685) Find duplicate content under examples/

2015-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11685: Comment: was deleted (was: [~josephkb] I have checked thoroughly, there are no other examples we s

[jira] [Resolved] (SPARK-11685) Find duplicate content under examples/

2015-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-11685. - Resolution: Fixed > Find duplicate content under examples/ >

[jira] [Commented] (SPARK-11959) Document normal equation solver for ordinary least squares in user guide

2015-12-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15054080#comment-15054080 ] Yanbo Liang commented on SPARK-11959: - OK, I can take this one. > Document normal eq

[jira] [Created] (SPARK-12309) Use sqlContext from MLlibTestSparkContext for spark.ml test suites

2015-12-12 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12309: --- Summary: Use sqlContext from MLlibTestSparkContext for spark.ml test suites Key: SPARK-12309 URL: https://issues.apache.org/jira/browse/SPARK-12309 Project: Spark

[jira] [Updated] (SPARK-12309) Use sqlContext from MLlibTestSparkContext for spark.ml test suites

2015-12-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12309: Description: Use sqlContext from MLlibTestSparkContext rather than creating new one for each spark.

[jira] [Updated] (SPARK-12309) Use sqlContext from MLlibTestSparkContext for spark.ml test suites

2015-12-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12309: Description: Use sqlContext from MLlibTestSparkContext rather than creating new one for spark.ml te

[jira] [Created] (SPARK-12310) Add write.json and write.parquet for SparkR

2015-12-13 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12310: --- Summary: Add write.json and write.parquet for SparkR Key: SPARK-12310 URL: https://issues.apache.org/jira/browse/SPARK-12310 Project: Spark Issue Type: Sub-tas

[jira] [Created] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12363: --- Summary: PowerIterationClustering test case failed if we deprecated KMeans.setRuns Key: SPARK-12363 URL: https://issues.apache.org/jira/browse/SPARK-12363 Project: Spar

[jira] [Commented] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15059753#comment-15059753 ] Yanbo Liang commented on SPARK-12363: - After I removed [this line|https://github.com

[jira] [Comment Edited] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15059753#comment-15059753 ] Yanbo Liang edited comment on SPARK-12363 at 12/16/15 9:38 AM:

[jira] [Updated] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12363: Component/s: (was: GraphX) > PowerIterationClustering test case failed if we deprecated KMeans.

[jira] [Comment Edited] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15059753#comment-15059753 ] Yanbo Liang edited comment on SPARK-12363 at 12/16/15 9:40 AM:

[jira] [Comment Edited] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15059753#comment-15059753 ] Yanbo Liang edited comment on SPARK-12363 at 12/16/15 9:41 AM:

[jira] [Commented] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15059763#comment-15059763 ] Yanbo Liang commented on SPARK-12363: - cc [~mengxr] [~josephkb] [~viirya] Would you m

[jira] [Commented] (SPARK-12350) VectorAssembler#transform() initially throws an exception

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15059788#comment-15059788 ] Yanbo Liang commented on SPARK-12350: - I can reproduce this issue, but it not caused

[jira] [Comment Edited] (SPARK-12350) VectorAssembler#transform() initially throws an exception

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15059788#comment-15059788 ] Yanbo Liang edited comment on SPARK-12350 at 12/16/15 10:15 AM: ---

[jira] [Created] (SPARK-12364) Add ML example for SparkR

2015-12-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12364: --- Summary: Add ML example for SparkR Key: SPARK-12364 URL: https://issues.apache.org/jira/browse/SPARK-12364 Project: Spark Issue Type: Improvement Com

[jira] [Commented] (SPARK-11478) ML StringIndexer return inconsistent schema

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15061371#comment-15061371 ] Yanbo Liang commented on SPARK-11478: - [~wjur] I'm not working on this. You can work

[jira] [Created] (SPARK-12393) Add read.text and write.text for SparkR

2015-12-16 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12393: --- Summary: Add read.text and write.text for SparkR Key: SPARK-12393 URL: https://issues.apache.org/jira/browse/SPARK-12393 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-12393) Add read.text and write.text for SparkR

2015-12-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12393: Issue Type: Sub-task (was: New Feature) Parent: SPARK-12144 > Add read.text and write.text

[jira] [Commented] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15061761#comment-15061761 ] Yanbo Liang commented on SPARK-12363: - {quote} Does it improve if you increase the nu

[jira] [Comment Edited] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15061761#comment-15061761 ] Yanbo Liang edited comment on SPARK-12363 at 12/17/15 9:19 AM:

[jira] [Comment Edited] (SPARK-12363) PowerIterationClustering test case failed if we deprecated KMeans.setRuns

2015-12-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15061761#comment-15061761 ] Yanbo Liang edited comment on SPARK-12363 at 12/17/15 9:20 AM:

[jira] [Updated] (SPARK-11939) PySpark support model export/import for Pipeline API

2015-12-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11939: Issue Type: Umbrella (was: Sub-task) Parent: (was: SPARK-11937) > PySpark support mode

[jira] [Commented] (SPARK-11939) PySpark support model export/import for Pipeline API

2015-12-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15070816#comment-15070816 ] Yanbo Liang commented on SPARK-11939: - [~josephkb] I have implemented MLWriter/MLWrit

[jira] [Commented] (SPARK-12494) Array out of bound Exception in KMeans Yarn Mode

2015-12-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15070840#comment-15070840 ] Yanbo Liang commented on SPARK-12494: - [~anandr...@gmail.com] Can this issue be repro

[jira] [Comment Edited] (SPARK-12494) Array out of bound Exception in KMeans Yarn Mode

2015-12-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15070840#comment-15070840 ] Yanbo Liang edited comment on SPARK-12494 at 12/24/15 10:14 AM: ---

[jira] [Comment Edited] (SPARK-12494) Array out of bound Exception in KMeans Yarn Mode

2015-12-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15070840#comment-15070840 ] Yanbo Liang edited comment on SPARK-12494 at 12/24/15 10:15 AM: ---

[jira] [Commented] (SPARK-12461) Add ExpressionDescription to math functions

2015-12-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15070844#comment-15070844 ] Yanbo Liang commented on SPARK-12461: - I can work on it. > Add ExpressionDescription

[jira] [Issue Comment Deleted] (SPARK-12461) Add ExpressionDescription to math functions

2015-12-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12461: Comment: was deleted (was: I can work on it.) > Add ExpressionDescription to math functions >

[jira] [Created] (SPARK-12597) Use udf replace callUDF for ML

2016-01-01 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12597: --- Summary: Use udf replace callUDF for ML Key: SPARK-12597 URL: https://issues.apache.org/jira/browse/SPARK-12597 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-12597) Use udf replace callUDF for ML

2016-01-01 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12597: Description: callUDF has been deprecated and will be removed in Spark 2.0. We should replace the us

[jira] [Updated] (SPARK-12597) Use udf to replace callUDF for ML

2016-01-01 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12597: Summary: Use udf to replace callUDF for ML (was: Use udf replace callUDF for ML) > Use udf to rep

[jira] [Created] (SPARK-12603) MLlib GaussianMixtureModel should support single instance predict/predictSoft

2016-01-02 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12603: --- Summary: MLlib GaussianMixtureModel should support single instance predict/predictSoft Key: SPARK-12603 URL: https://issues.apache.org/jira/browse/SPARK-12603 Project:

[jira] [Updated] (SPARK-12603) PySpark MLlib GaussianMixtureModel should support single instance predict/predictSoft

2016-01-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12603: Component/s: PySpark > PySpark MLlib GaussianMixtureModel should support single instance > predict

[jira] [Updated] (SPARK-12603) PySpark MLlib GaussianMixtureModel should support single instance predict/predictSoft

2016-01-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12603: Description: PySpark MLlib GaussianMixtureModel should support single instance predict/predictSoft.

[jira] [Updated] (SPARK-12603) PySpark MLlib GaussianMixtureModel should support single instance predict/predictSoft

2016-01-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12603: Summary: PySpark MLlib GaussianMixtureModel should support single instance predict/predictSoft (wa

[jira] [Updated] (SPARK-12603) PySpark MLlib GaussianMixtureModel should support single instance predict/predictSoft

2016-01-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12603: Description: PySpark MLlib GaussianMixtureModel should support single instance predict/predictSoft

[jira] [Closed] (SPARK-12597) Use udf to replace callUDF for ML

2016-01-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang closed SPARK-12597. --- Resolution: Duplicate > Use udf to replace callUDF for ML > - > >

[jira] [Updated] (SPARK-12603) PySpark MLlib GaussianMixtureModel should support single instance predict/predictSoft

2016-01-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12603: Description: PySpark MLlib GaussianMixtureModel should support single instance predict/predictSoft

[jira] [Commented] (SPARK-9835) Iteratively reweighted least squares solver for GLMs

2016-01-03 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15080618#comment-15080618 ] Yanbo Liang commented on SPARK-9835: [~mengxr] Are you working on this issue? If you a

[jira] [Created] (SPARK-12645) SparkR add function hash

2016-01-05 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12645: --- Summary: SparkR add function hash Key: SPARK-12645 URL: https://issues.apache.org/jira/browse/SPARK-12645 Project: Spark Issue Type: Improvement Comp

[jira] [Updated] (SPARK-12645) SparkR add function hash

2016-01-05 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12645: Summary: SparkR add function hash (was: SparkR add function hash) > SparkR add function hash > -

[jira] [Updated] (SPARK-12645) SparkR support hash function

2016-01-05 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12645: Summary: SparkR support hash function (was: SparkR add function hash ) > SparkR support hash func

[jira] [Updated] (SPARK-12645) SparkR support hash function

2016-01-05 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12645: Description: Add hash function for SparkR (was: SparkR add function hash for DataFrame) > SparkR

[jira] [Updated] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2016-01-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12664: Priority: Major (was: Minor) > Expose raw prediction scores in MultilayerPerceptronClassificationM

[jira] [Commented] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2016-01-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093164#comment-15093164 ] Yanbo Liang commented on SPARK-12664: - I vote this as an important feature. Multilaye

[jira] [Updated] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2016-01-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12664: Component/s: (was: MLlib) > Expose raw prediction scores in MultilayerPerceptronClassificationM

[jira] [Commented] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2016-01-11 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093170#comment-15093170 ] Yanbo Liang commented on SPARK-12664: - cc [~avulanov] [~mengxr] > Expose raw predict

[jira] [Created] (SPARK-12903) Add covar_samp and covar_pop for SparkR

2016-01-19 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12903: --- Summary: Add covar_samp and covar_pop for SparkR Key: SPARK-12903 URL: https://issues.apache.org/jira/browse/SPARK-12903 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-12905) PCAModel return eigenvalues for PySpark

2016-01-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12905: Priority: Trivial (was: Minor) > PCAModel return eigenvalues for PySpark > ---

[jira] [Created] (SPARK-12905) PCAModel return eigenvalues for PySpark

2016-01-19 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12905: --- Summary: PCAModel return eigenvalues for PySpark Key: SPARK-12905 URL: https://issues.apache.org/jira/browse/SPARK-12905 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-12905) PCAModel return eigenvalues for PySpark

2016-01-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12905: Priority: Minor (was: Trivial) > PCAModel return eigenvalues for PySpark > ---

[jira] [Created] (SPARK-12962) PySpark support covar_samp and covar_pop

2016-01-21 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12962: --- Summary: PySpark support covar_samp and covar_pop Key: SPARK-12962 URL: https://issues.apache.org/jira/browse/SPARK-12962 Project: Spark Issue Type: Improvemen

[jira] [Created] (SPARK-12974) Add Python API for spark.ml bisecting k-means

2016-01-24 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-12974: --- Summary: Add Python API for spark.ml bisecting k-means Key: SPARK-12974 URL: https://issues.apache.org/jira/browse/SPARK-12974 Project: Spark Issue Type: Impro

[jira] [Updated] (SPARK-12974) Add Python API for spark.ml bisecting k-means

2016-01-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12974: Component/s: PySpark ML > Add Python API for spark.ml bisecting k-means >

[jira] [Created] (SPARK-13032) Basic ML Pipeline export/import functions for PySpark

2016-01-27 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13032: --- Summary: Basic ML Pipeline export/import functions for PySpark Key: SPARK-13032 URL: https://issues.apache.org/jira/browse/SPARK-13032 Project: Spark Issue Typ

[jira] [Created] (SPARK-13033) PySpark regression support export/import

2016-01-27 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13033: --- Summary: PySpark regression support export/import Key: SPARK-13033 URL: https://issues.apache.org/jira/browse/SPARK-13033 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-13034) PySpark ml.classification support export/import

2016-01-27 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13034: --- Summary: PySpark ml.classification support export/import Key: SPARK-13034 URL: https://issues.apache.org/jira/browse/SPARK-13034 Project: Spark Issue Type: Sub

[jira] [Updated] (SPARK-13033) PySpark ml.regression support export/import

2016-01-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13033: Summary: PySpark ml.regression support export/import (was: PySpark regression support export/impor

[jira] [Created] (SPARK-13035) PySpark ml.clustering support export/import

2016-01-27 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13035: --- Summary: PySpark ml.clustering support export/import Key: SPARK-13035 URL: https://issues.apache.org/jira/browse/SPARK-13035 Project: Spark Issue Type: Sub-tas

[jira] [Created] (SPARK-13036) PySpark ml.feature support export/import

2016-01-27 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13036: --- Summary: PySpark ml.feature support export/import Key: SPARK-13036 URL: https://issues.apache.org/jira/browse/SPARK-13036 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-13037) PySpark ml.recommendation support export/import

2016-01-27 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13037: --- Summary: PySpark ml.recommendation support export/import Key: SPARK-13037 URL: https://issues.apache.org/jira/browse/SPARK-13037 Project: Spark Issue Type: Sub

[jira] [Created] (SPARK-13038) PySpark ml.pipeline support export/import

2016-01-27 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13038: --- Summary: PySpark ml.pipeline support export/import Key: SPARK-13038 URL: https://issues.apache.org/jira/browse/SPARK-13038 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-11560) Optimize KMeans implementation

2016-01-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15118937#comment-15118937 ] Yanbo Liang commented on SPARK-11560: - [~yuhaoyan] I think the first step is to use B

[jira] [Commented] (SPARK-13010) Survival analysis in SparkR

2016-01-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15119054#comment-15119054 ] Yanbo Liang commented on SPARK-13010: - There are two issues that we should discuss: 1

[jira] [Comment Edited] (SPARK-13010) Survival analysis in SparkR

2016-01-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15119054#comment-15119054 ] Yanbo Liang edited comment on SPARK-13010 at 1/27/16 11:09 AM:

[jira] [Comment Edited] (SPARK-13010) Survival analysis in SparkR

2016-01-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15119054#comment-15119054 ] Yanbo Liang edited comment on SPARK-13010 at 1/27/16 11:09 AM:

[jira] [Commented] (SPARK-13034) PySpark ml.classification support export/import

2016-01-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120648#comment-15120648 ] Yanbo Liang commented on SPARK-13034: - Hi Miao, Please feel free to take this one, t

[jira] [Commented] (SPARK-12811) Estimator interface for generalized linear models (GLMs)

2016-01-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15121176#comment-15121176 ] Yanbo Liang commented on SPARK-12811: - Should we put it under a new folder named "ml/

[jira] [Updated] (SPARK-13153) PySpark ML persistence failed when handle no default value parameter

2016-02-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13153: Affects Version/s: (was: 1.6.0) > PySpark ML persistence failed when handle no default value pa

[jira] [Issue Comment Deleted] (SPARK-12811) Estimator interface for generalized linear models (GLMs)

2016-02-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12811: Comment: was deleted (was: Should we put it under a new folder named "ml/glm"?) > Estimator interf

[jira] [Commented] (SPARK-8000) SQLContext.read.load() should be able to auto-detect input data

2016-02-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15144285#comment-15144285 ] Yanbo Liang commented on SPARK-8000: [~hyukjin.kwon] I'm not working on this issue now

[jira] [Created] (SPARK-13322) AFTSurvivalRegression should handle lossSum infinity

2016-02-15 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13322: --- Summary: AFTSurvivalRegression should handle lossSum infinity Key: SPARK-13322 URL: https://issues.apache.org/jira/browse/SPARK-13322 Project: Spark Issue Type

[jira] [Commented] (SPARK-13322) AFTSurvivalRegression should handle lossSum infinity

2016-02-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15147036#comment-15147036 ] Yanbo Liang commented on SPARK-13322: - I will send a PR for this issue, please assign

[jira] [Created] (SPARK-13334) ML KMeansModel / BisectingKMeansModel should be set parent

2016-02-15 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13334: --- Summary: ML KMeansModel / BisectingKMeansModel should be set parent Key: SPARK-13334 URL: https://issues.apache.org/jira/browse/SPARK-13334 Project: Spark Issu

[jira] [Updated] (SPARK-13334) ML KMeansModel / BisectingKMeansModel should be set parent

2016-02-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13334: Description: ML KMeansModel / BisectingKMeansModel / QuantileDiscretizer should be set parent. I h

[jira] [Updated] (SPARK-13334) ML KMeansModel / BisectingKMeansModel should be set parent

2016-02-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13334: Description: ML KMeansModel / BisectingKMeansModel / QuantileDiscretizer should be set parent. I h

[jira] [Updated] (SPARK-13334) ML KMeansModel / BisectingKMeansModel should be set parent

2016-02-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13334: Description: ML KMeansModel / BisectingKMeansModel / QuantileDiscretizer should be set parent. I h

[jira] [Updated] (SPARK-13334) ML KMeansModel / BisectingKMeansModel should be set parent

2016-02-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13334: Description: ML KMeansModel / BisectingKMeansModel / QuantileDiscretizer should be set parent. I h

[jira] [Updated] (SPARK-13334) ML KMeansModel/BisectingKMeansModel should be set parent

2016-02-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13334: Summary: ML KMeansModel/BisectingKMeansModel should be set parent (was: ML KMeansModel/BisectingKM

[jira] [Updated] (SPARK-13334) ML KMeansModel/BisectingKMeansModel/QuantileDiscretizerModel should be set parent

2016-02-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13334: Summary: ML KMeansModel/BisectingKMeansModel/QuantileDiscretizerModel should be set parent (was: M

[jira] [Updated] (SPARK-13322) AFTSurvivalRegression should support feature standardization

2016-02-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13322: Summary: AFTSurvivalRegression should support feature standardization (was: AFTSurvivalRegression

[jira] [Updated] (SPARK-13322) AFTSurvivalRegression should support feature standardization

2016-02-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13322: Description: This bug is reported by Stuti Awasthi. https://www.mail-archive.com/user@spark.apache.

[jira] [Updated] (SPARK-13322) AFTSurvivalRegression should support feature standardization

2016-02-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13322: Description: This bug is reported by Stuti Awasthi. https://www.mail-archive.com/user@spark.apache.

[jira] [Updated] (SPARK-13322) AFTSurvivalRegression should support feature standardization

2016-02-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-13322: Description: This bug is reported by Stuti Awasthi. https://www.mail-archive.com/user@spark.apache.

[jira] [Commented] (SPARK-9662) ML 1.5 QA: API: Python API coverage

2015-08-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14693641#comment-14693641 ] Yanbo Liang commented on SPARK-9662: [~josephkb] Checking for inconsistency and breaki

[jira] [Created] (SPARK-9940) PySpark DenseVector, SparseVector implement __hash__ method

2015-08-13 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-9940: -- Summary: PySpark DenseVector, SparseVector implement __hash__ method Key: SPARK-9940 URL: https://issues.apache.org/jira/browse/SPARK-9940 Project: Spark Issue

[jira] [Commented] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__

2015-08-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14695415#comment-14695415 ] Yanbo Liang commented on SPARK-9793: [~josephkb] PySpark vector currently did not impl

[jira] [Comment Edited] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__

2015-08-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14695415#comment-14695415 ] Yanbo Liang edited comment on SPARK-9793 at 8/13/15 3:55 PM: -

[jira] [Created] (SPARK-10009) PySpark Param of Vector type can be set with Python array or numpy.array

2015-08-15 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-10009: --- Summary: PySpark Param of Vector type can be set with Python array or numpy.array Key: SPARK-10009 URL: https://issues.apache.org/jira/browse/SPARK-10009 Project: Spark

[jira] [Closed] (SPARK-9940) PySpark DenseVector, SparseVector implement __hash__ method

2015-08-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang closed SPARK-9940. -- Resolution: Duplicate > PySpark DenseVector, SparseVector implement __hash__ method > --

[jira] [Commented] (SPARK-9940) PySpark DenseVector, SparseVector implement __hash__ method

2015-08-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698189#comment-14698189 ] Yanbo Liang commented on SPARK-9940: combine this with SPARK-9793, so close this. > P

[jira] [Commented] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__

2015-08-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698190#comment-14698190 ] Yanbo Liang commented on SPARK-9793: [~josephkb] I have combined this with SPARK-9940

[jira] [Commented] (SPARK-10009) PySpark Param of Vector type can be set with Python array or numpy.array

2015-08-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698523#comment-14698523 ] Yanbo Liang commented on SPARK-10009: - [~kaisasak] I think what you means is Params w

[jira] [Comment Edited] (SPARK-10009) PySpark Param of Vector type can be set with Python array or numpy.array

2015-08-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698523#comment-14698523 ] Yanbo Liang edited comment on SPARK-10009 at 8/16/15 3:13 AM: -

[jira] [Comment Edited] (SPARK-10009) PySpark Param of Vector type can be set with Python array or numpy.array

2015-08-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698523#comment-14698523 ] Yanbo Liang edited comment on SPARK-10009 at 8/16/15 3:14 AM: -

<    3   4   5   6   7   8   9   10   11   12   >