[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-02-02 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/14725 For future reference: Merged into master --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-02-01 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 Ah, I see.. No worries then, I though maybe you had forgotten. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-02-01 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/14725 I'd love to - however my JIRA account still doesn't have those permissions. My plan was to bug people in person next week to get that sorted out and go back and update the JIRAs. --- If your

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-02-01 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 Thanks @holdenk! Would you mind assigning the JIRA to me? https://issues.apache.org/jira/browse/SPARK-17161 --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-31 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 Thanks @holdenk! I updated the description. I'll follow up with `CountVectorizerModel` after this merged and ping you. --- If your project is set up for it, you can reply to this email and

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-31 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/14725 LGTM - thanks for doing this - please ping me on the follow up PRs with the `CountVectorizerModel`. Before merge would you mind updating the PR description for how this was tested to remove the

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/72092/ Test PASSed. ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #72092 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72092/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #72092 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72092/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/72056/ Test PASSed. ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #72056 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72056/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/72051/ Test PASSed. ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #72051 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72051/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #72056 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72056/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 ya good idea, I'll add that --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/14725 This looks good, how about also adding a test for an empty array given that it was a consideration in the earlier iteration (not anymore but good to have as a test incase we forget or someone else

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 Hey @holdenk , I removed the attempt at type inference so now must be specified explicitly and added a docstring showing common examples. Please take another look when you can, thanks! ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #72051 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72051/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-09 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/14725 I don't think the wrappers are public APIs per-se, but I agree reducing the amount of boilerplate scala code required to expose the ML stuff is good if we can make it robust :) --- If your

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-09 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 Sure, I can add a better docstring. This is just for developers and doesn't have to be used, but it can be used to avoid creating more Java-friendly functions only because they have arrays -

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/14725 Maybe it could be cleared up a bit with a good docstring? Although if the result is too confusing to be used then it's probably not worth doing. --- If your project is set up for it, you can reply

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 Thanks @holdenk for taking a look! Yeah, I think you're right about the issues trying to infer a type. It would be nice if there was some easy way to specify a primitive type since that would

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/14725 cc @davies perhaps - what are your thoughts? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/70887/ Test PASSed. ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #70887 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70887/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 Jenkins, retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #70887 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70887/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #70885 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70885/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-04 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #70885 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70885/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/70846/ Test FAILed. ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #70846 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70846/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-03 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 ping @jkbradley @yanboliang @MLnick . This seems to have gone stale, but I think it would be great to get in to add things like CountVectorizer constructor from vocab list to PySpark. If any

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2017-01-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #70846 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70846/testReport)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-09-28 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 ping @jkbradley @yanboliang --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-09-13 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 ping @yanboliang, mind taking a look? I'd like to have this to create a CountVectorizerModel from a vocabulary list from SPARK-15009, thanks! --- If your project is set up for it, you can

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64306/ Test PASSed. ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #64306 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64306/consoleFull)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64305/ Test PASSed. ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #64305 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64305/consoleFull)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #64306 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64306/consoleFull)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #64305 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64305/consoleFull)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 Thanks @holdenk. I have an example usage for primitive arrays in the PR description, let me know if that is not clear enough to show how this change useful. I also added usage for

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64304/ Test FAILed. ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #64304 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64304/consoleFull)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-23 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #64304 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64304/consoleFull)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14725 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64111/ Test FAILed. ---

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #64111 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64111/consoleFull)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-19 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/14725 Maybe it would be useful to see in the PR where this is going to simplify the current PySpark ML code? (Need not be every model right away since if we decide to make design changes that could

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14725 **[Test build #64111 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64111/consoleFull)** for PR 14725 at commit

[GitHub] spark issue #14725: [SPARK-17161] [PYSPARK][ML] Add PySpark-ML JavaWrapper c...

2016-08-19 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/14725 @yanboliang, @jkbradley what do you think of this proposal? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does