Github user ericl commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141000851
@mengxr made the requested changes. I found it simpler to keep
`numFeatures` in combination with an array of offsets instead of just the
cumulative count though.
---
If
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141001187
Merged build triggered.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141000373
Merged build started.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141000349
Merged build triggered.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141001217
Merged build started.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141004752
[Test build #42588 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/42588/consoleFull)
for PR 7987 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141019662
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141019539
[Test build #42588 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/42588/console)
for PR 7987 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141019663
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141005178
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141005175
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user ericl commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39793920
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,276 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141216887
Merged build triggered.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user ericl commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39793925
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,276 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user ericl commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39793923
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,276 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141218143
[Test build #42615 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/42615/consoleFull)
for PR 7987 at commit
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39792678
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,276 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39792732
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,276 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141213211
LGTM except minor comments.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39792679
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,276 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141216920
Merged build started.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141229054
Merged into master. Thanks!
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have
Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/7987
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141228602
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141228445
[Test build #42615 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/42615/console)
for PR 7987 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-141228603
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39622002
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621952
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621862
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621888
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621896
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621910
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621929
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621905
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621956
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621868
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621892
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621937
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621944
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/7987#discussion_r39621882
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/Interaction.scala ---
@@ -0,0 +1,243 @@
+/*
+ * Licensed to the Apache Software Foundation
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140885188
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140885145
[Test build #42542 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/42542/console)
for PR 7987 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140885184
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140882828
[Test build #42542 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/42542/consoleFull)
for PR 7987 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140878801
Merged build started.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140878760
Merged build triggered.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140614860
Merged build triggered.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140614869
Merged build started.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140613869
Merged build started.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user ericl commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140614349
@mengxr I did the refactoring as suggested
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140615714
[Test build #42524 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/42524/consoleFull)
for PR 7987 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140613858
Merged build triggered.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140615919
[Test build #42524 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/42524/console)
for PR 7987 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140615923
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140615922
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140615999
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-140615996
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-139654262
Had an offline discussion with @ericl
* split this PR into two
1. an `Interaction` transformer that only handles ML columns (`Double`
and `Vector` with
Github user dputler commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-132909037
I'm not clear as to how the order operation is determined. Looking at the
tests, in the case of a categorical interaction it appears that it is based on
the order in
Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-133085190
@dputler Under distributed setting, we need to make at least one pass to
collect all categories. The ordering is not alphabetical but by frequency
Github user dputler commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-133112345
That actually doesn't deal with the scoring issue. What happens when new
data to be predicted from an existing model has a more frequent category in a
categorical
Github user dputler commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-133150940
Thanks Eric, I knew that was it really wasn't an issue related to this
specific pull request, but one that exposed an issue that, from past
experience, looked like
Github user ericl commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-133127935
If if I understand correctly, the concern is that the category to index
assignment when predicting data will be different from that used when fitting
the model. This
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128549041
Merged build triggered.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128548927
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128549338
[Test build #40115 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40115/consoleFull)
for PR 7987 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128549047
Merged build started.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128547907
Merged build started.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128554840
[Test build #40115 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40115/console)
for PR 7987 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128547886
Merged build triggered.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128553745
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128553719
[Test build #40114 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40114/console)
for PR 7987 at commit
Github user ericl commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128548608
@mengxr done, this PR now just has the RInteraction changes.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/7987#issuecomment-128554904
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
74 matches
Mail list logo