[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation
[ https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15942473#comment-15942473 ] chris snow commented on SPARK-20072: Will do. Thanks Sean. > Clarify ALS-WR documentation > > > Key: SPARK-20072 > URL: https://issues.apache.org/jira/browse/SPARK-20072 > Project: Spark > Issue Type: Improvement > Components: Documentation >Affects Versions: 2.1.0 >Reporter: chris snow >Priority: Trivial > > https://www.mail-archive.com/user@spark.apache.org/msg62590.html > The documentation for collaborative filtering is as follows: > === > Scaling of the regularization parameter > Since v1.1, we scale the regularization parameter lambda in solving > each least squares problem by the number of ratings the user generated > in updating user factors, or the number of ratings the product > received in updating product factors. > === > I find this description confusing, probably because I lack a detailed > understanding of ALS. The wording suggest that the number of ratings > change ("generated", "received") during solving the least squares. > This is how I think I should be interpreting the description: > === > Since v1.1, we scale the regularization parameter lambda when solving > each least squares problem. When updating the user factors, we scale > the regularization parameter by the total number of ratings from the > user. Similarly, when updating the product factors, we scale the > regularization parameter by the total number of ratings for the > product. > === -- This message was sent by Atlassian JIRA (v6.3.15#6346) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation
[ https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939958#comment-15939958 ] Sean Owen commented on SPARK-20072: --- Have a read through the rest of the docs and if you collect a few more comments, OK let's make your suggested updates. > Clarify ALS-WR documentation > > > Key: SPARK-20072 > URL: https://issues.apache.org/jira/browse/SPARK-20072 > Project: Spark > Issue Type: Improvement > Components: Documentation >Affects Versions: 2.1.0 >Reporter: chris snow >Priority: Trivial > > https://www.mail-archive.com/user@spark.apache.org/msg62590.html > The documentation for collaborative filtering is as follows: > === > Scaling of the regularization parameter > Since v1.1, we scale the regularization parameter lambda in solving > each least squares problem by the number of ratings the user generated > in updating user factors, or the number of ratings the product > received in updating product factors. > === > I find this description confusing, probably because I lack a detailed > understanding of ALS. The wording suggest that the number of ratings > change ("generated", "received") during solving the least squares. > This is how I think I should be interpreting the description: > === > Since v1.1, we scale the regularization parameter lambda when solving > each least squares problem. When updating the user factors, we scale > the regularization parameter by the total number of ratings from the > user. Similarly, when updating the product factors, we scale the > regularization parameter by the total number of ratings for the > product. > === -- This message was sent by Atlassian JIRA (v6.3.15#6346) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation
[ https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938477#comment-15938477 ] chris snow commented on SPARK-20072: Fair enough. Though this did cause me some grief - I had read and re-read that paragraph a number of times before posting an email to the user group to verify if my understanding was correct. I also appreciate that my suggested rewording could probably be improved a lot. I fully understand if this ticket should be closed because it doesn't merit the effort processing it. > Clarify ALS-WR documentation > > > Key: SPARK-20072 > URL: https://issues.apache.org/jira/browse/SPARK-20072 > Project: Spark > Issue Type: Improvement > Components: Documentation >Affects Versions: 2.1.0 >Reporter: chris snow >Priority: Trivial > > https://www.mail-archive.com/user@spark.apache.org/msg62590.html > The documentation for collaborative filtering is as follows: > === > Scaling of the regularization parameter > Since v1.1, we scale the regularization parameter lambda in solving > each least squares problem by the number of ratings the user generated > in updating user factors, or the number of ratings the product > received in updating product factors. > === > I find this description confusing, probably because I lack a detailed > understanding of ALS. The wording suggest that the number of ratings > change ("generated", "received") during solving the least squares. > This is how I think I should be interpreting the description: > === > Since v1.1, we scale the regularization parameter lambda when solving > each least squares problem. When updating the user factors, we scale > the regularization parameter by the total number of ratings from the > user. Similarly, when updating the product factors, we scale the > regularization parameter by the total number of ratings for the > product. > === -- This message was sent by Atlassian JIRA (v6.3.15#6346) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation
[ https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938415#comment-15938415 ] Sean Owen commented on SPARK-20072: --- I don't think those two wordings differ meaningfully? I think small suggestions are OK but weight the value vs overhead of processing these changes. This is pretty borderline. > Clarify ALS-WR documentation > > > Key: SPARK-20072 > URL: https://issues.apache.org/jira/browse/SPARK-20072 > Project: Spark > Issue Type: Improvement > Components: Documentation >Affects Versions: 2.1.0 >Reporter: chris snow >Priority: Trivial > > https://www.mail-archive.com/user@spark.apache.org/msg62590.html > The documentation for collaborative filtering is as follows: > === > Scaling of the regularization parameter > Since v1.1, we scale the regularization parameter lambda in solving > each least squares problem by the number of ratings the user generated > in updating user factors, or the number of ratings the product > received in updating product factors. > === > I find this description confusing, probably because I lack a detailed > understanding of ALS. The wording suggest that the number of ratings > change ("generated", "received") during solving the least squares. > This is how I think I should be interpreting the description: > === > Since v1.1, we scale the regularization parameter lambda when solving > each least squares problem. When updating the user factors, we scale > the regularization parameter by the total number of ratings from the > user. Similarly, when updating the product factors, we scale the > regularization parameter by the total number of ratings for the > product. > === -- This message was sent by Atlassian JIRA (v6.3.15#6346) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org