Github user holdenk commented on the issue:
https://github.com/apache/spark/pull/14725
For future reference: Merged into master
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
Ah, I see.. No worries then, I though maybe you had forgotten.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project
Github user holdenk commented on the issue:
https://github.com/apache/spark/pull/14725
I'd love to - however my JIRA account still doesn't have those permissions.
My plan was to bug people in person next week to get that sorted out and go
back and update the JIRAs.
---
If your
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
Thanks @holdenk! Would you mind assigning the JIRA to me?
https://issues.apache.org/jira/browse/SPARK-17161
---
If your project is set up for it, you can reply to this email and have your
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
Thanks @holdenk! I updated the description. I'll follow up with
`CountVectorizerModel` after this merged and ping you.
---
If your project is set up for it, you can reply to this email and
Github user holdenk commented on the issue:
https://github.com/apache/spark/pull/14725
LGTM - thanks for doing this - please ping me on the follow up PRs with the
`CountVectorizerModel`. Before merge would you mind updating the PR description
for how this was tested to remove the
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/72092/
Test PASSed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #72092 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72092/testReport)**
for PR 14725 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #72092 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72092/testReport)**
for PR 14725 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/72056/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #72056 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72056/testReport)**
for PR 14725 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/72051/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #72051 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72051/testReport)**
for PR 14725 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #72056 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72056/testReport)**
for PR 14725 at commit
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
ya good idea, I'll add that
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled
Github user holdenk commented on the issue:
https://github.com/apache/spark/pull/14725
This looks good, how about also adding a test for an empty array given that
it was a consideration in the earlier iteration (not anymore but good to have
as a test incase we forget or someone else
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
Hey @holdenk , I removed the attempt at type inference so now must be
specified explicitly and added a docstring showing common examples. Please
take another look when you can, thanks!
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #72051 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/72051/testReport)**
for PR 14725 at commit
Github user holdenk commented on the issue:
https://github.com/apache/spark/pull/14725
I don't think the wrappers are public APIs per-se, but I agree reducing the
amount of boilerplate scala code required to expose the ML stuff is good if we
can make it robust :)
---
If your
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
Sure, I can add a better docstring. This is just for developers and
doesn't have to be used, but it can be used to avoid creating more
Java-friendly functions only because they have arrays -
Github user holdenk commented on the issue:
https://github.com/apache/spark/pull/14725
Maybe it could be cleared up a bit with a good docstring? Although if the
result is too confusing to be used then it's probably not worth doing.
---
If your project is set up for it, you can reply
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
Thanks @holdenk for taking a look! Yeah, I think you're right about the
issues trying to infer a type. It would be nice if there was some easy way to
specify a primitive type since that would
Github user holdenk commented on the issue:
https://github.com/apache/spark/pull/14725
cc @davies perhaps - what are your thoughts?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/70887/
Test PASSed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #70887 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70887/testReport)**
for PR 14725 at commit
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
Jenkins, retest this please
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #70887 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70887/testReport)**
for PR 14725 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #70885 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70885/testReport)**
for PR 14725 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #70885 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70885/testReport)**
for PR 14725 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/70846/
Test FAILed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #70846 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70846/testReport)**
for PR 14725 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
ping @jkbradley @yanboliang @MLnick . This seems to have gone stale, but I
think it would be great to get in to add things like CountVectorizer
constructor from vocab list to PySpark. If any
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #70846 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70846/testReport)**
for PR 14725 at commit
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
ping @jkbradley @yanboliang
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
ping @yanboliang, mind taking a look? I'd like to have this to create a
CountVectorizerModel from a vocabulary list from SPARK-15009, thanks!
---
If your project is set up for it, you can
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64306/
Test PASSed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #64306 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64306/consoleFull)**
for PR 14725 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64305/
Test PASSed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #64305 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64305/consoleFull)**
for PR 14725 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #64306 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64306/consoleFull)**
for PR 14725 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #64305 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64305/consoleFull)**
for PR 14725 at commit
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
Thanks @holdenk. I have an example usage for primitive arrays in the PR
description, let me know if that is not clear enough to show how this change
useful. I also added usage for
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64304/
Test FAILed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #64304 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64304/consoleFull)**
for PR 14725 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #64304 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64304/consoleFull)**
for PR 14725 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/14725
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64111/
Test FAILed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #64111 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64111/consoleFull)**
for PR 14725 at commit
Github user holdenk commented on the issue:
https://github.com/apache/spark/pull/14725
Maybe it would be useful to see in the PR where this is going to simplify
the current PySpark ML code? (Need not be every model right away since if we
decide to make design changes that could
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/14725
**[Test build #64111 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64111/consoleFull)**
for PR 14725 at commit
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/14725
@yanboliang, @jkbradley what do you think of this proposal?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does
59 matches
Mail list logo