[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162831905 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162831907 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162831899 **[Test build #47326 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/47326/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162830886 **[Test build #47326 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/47326/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162856962 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162860155 **[Test build #47334 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/47334/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162860161 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162860159 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162859323 **[Test build #47334 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/47334/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162934620 **[Test build #47341 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/47341/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162948208 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162947991 **[Test build #47341 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/47341/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162948210 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-12-07 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-162797991 @mengxr @selvinsource As we talked there, I don't think PMML has good supports for multinomial naive bayes because we cannot fit the model of multinomial naive bayes

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-19 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-158013196 @mengxr How do you think about the PMML export for Multinomial Naive Bayes? --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-15 Thread selvinsource
Github user selvinsource commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156852089 @yinxusen https://github.com/selvinsource/spark-pmml-exporter-validator/tree/logistic_regression_multi_class I tested both multinomial and bernoulli.

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156393966 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156393965 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156379677 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156379693 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156380788 **[Test build #45855 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45855/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156475190 If you want to see the exported xml of multinomial distribution, click

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread selvinsource
Github user selvinsource commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156492495 @yinxusen I will check out your branch and do some testing as well using the validator. From what I can see the exported xml seems correct :+1: . --- If your

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156602705 @selvinsource Yes I looks correct and the same with what I exported from R (with libraries pmml and e1071 for naive bayes). But I am a little worried about the

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156391326 @selvinsource @mengxr I modified your [code of pmml export

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-13 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156393810 **[Test build #45855 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45855/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156041797 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156041735 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-12 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156045652 **[Test build #45725 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/45725/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156057702 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-12 Thread selvinsource
Github user selvinsource commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-156044525 @yinxusen for multinomial naive Bayes you could still use the inputs as discrete as they should be frequency of the terms accordingly to the documentation,

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-10 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-155661487 @selvinsource Sorry for taking too long a time. I check the code and generated XML file carefully. The null pointer is caused by a mistake that I process continuous

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-11-01 Thread yinxusen
Github user yinxusen commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-152889324 @selvinsource I"ll check it ASAP. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-31 Thread selvinsource
Github user selvinsource commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-152717699 @yinxusen If you look at https://github.com/selvinsource/spark-pmml-exporter-validator/tree/logistic_regression_multi_class I added a test for your

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-31 Thread vruusmann
Github user vruusmann commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-152766778 You may want to check out some valid NaiveBayes models. For example, see the following NB model for the popular "Audit" dataset:

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-31 Thread vruusmann
Github user vruusmann commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-15273 The value of the `TargetValueCount@value` attribute must equal some **valid** value of the target `DataField` element (as defined by `DataField/Value@value`

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-29 Thread selvinsource
Github user selvinsource commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-152173191 @JasmineGeorge, it would be great if you can add a test for the validator to ensure the exported xml file can be loaded in JPMML and score the same results.

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-29 Thread JasmineGeorge
Github user JasmineGeorge commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-152181045 Sorry I can't get to it until next Wednesday.. Can someone else take over --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-29 Thread selvinsource
Github user selvinsource commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-152340315 I will do it, no prob. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-28 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-151891076 @JasmineGeorge Please sign off if the changes look good to you:) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-151717416 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-151717425 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-151726120 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-27 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-151726070 **[Test build #44488 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44488/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-27 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-151717717 **[Test build #44488 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44488/consoleFull)** for PR 9057 at commit

[GitHub] spark pull request: [SPARK-8546] Add PMML export for Naive Bayes

2015-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9057#issuecomment-151726122 Test PASSed. Refer to this link for build results (access rights to CI server needed):