[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation

2017-03-26 Thread chris snow (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15942473#comment-15942473
 ] 

chris snow commented on SPARK-20072:


Will do.  Thanks Sean.

> Clarify ALS-WR documentation
> 
>
> Key: SPARK-20072
> URL: https://issues.apache.org/jira/browse/SPARK-20072
> Project: Spark
>  Issue Type: Improvement
>  Components: Documentation
>Affects Versions: 2.1.0
>Reporter: chris snow
>Priority: Trivial
>
> https://www.mail-archive.com/user@spark.apache.org/msg62590.html
> The documentation for collaborative filtering is as follows:
> ===
> Scaling of the regularization parameter
> Since v1.1, we scale the regularization parameter lambda in solving
> each least squares problem by the number of ratings the user generated
> in updating user factors, or the number of ratings the product
> received in updating product factors.
> ===
> I find this description confusing, probably because I lack a detailed
> understanding of ALS.   The wording suggest that the number of ratings
> change ("generated", "received") during solving the least squares.
> This is how I think I should be interpreting the description:
> ===
> Since v1.1, we scale the regularization parameter lambda when solving
> each least squares problem.  When updating the user factors, we scale
> the regularization parameter by the total number of ratings from the
> user.  Similarly, when updating the product factors, we scale the
> regularization parameter by the total number of ratings for the
> product.
> ===



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation

2017-03-24 Thread Sean Owen (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939958#comment-15939958
 ] 

Sean Owen commented on SPARK-20072:
---

Have a read through the rest of the docs and if you collect a few more 
comments, OK let's make your suggested updates.

> Clarify ALS-WR documentation
> 
>
> Key: SPARK-20072
> URL: https://issues.apache.org/jira/browse/SPARK-20072
> Project: Spark
>  Issue Type: Improvement
>  Components: Documentation
>Affects Versions: 2.1.0
>Reporter: chris snow
>Priority: Trivial
>
> https://www.mail-archive.com/user@spark.apache.org/msg62590.html
> The documentation for collaborative filtering is as follows:
> ===
> Scaling of the regularization parameter
> Since v1.1, we scale the regularization parameter lambda in solving
> each least squares problem by the number of ratings the user generated
> in updating user factors, or the number of ratings the product
> received in updating product factors.
> ===
> I find this description confusing, probably because I lack a detailed
> understanding of ALS.   The wording suggest that the number of ratings
> change ("generated", "received") during solving the least squares.
> This is how I think I should be interpreting the description:
> ===
> Since v1.1, we scale the regularization parameter lambda when solving
> each least squares problem.  When updating the user factors, we scale
> the regularization parameter by the total number of ratings from the
> user.  Similarly, when updating the product factors, we scale the
> regularization parameter by the total number of ratings for the
> product.
> ===



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation

2017-03-23 Thread chris snow (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938477#comment-15938477
 ] 

chris snow commented on SPARK-20072:


Fair enough.  Though this did cause me some grief - I had read and re-read that 
paragraph a number of times before posting an email to the user group to verify 
if my understanding was correct.

I also appreciate that my suggested rewording could probably be improved a lot.

I fully understand if this ticket should be closed because it doesn't merit the 
effort processing it.

> Clarify ALS-WR documentation
> 
>
> Key: SPARK-20072
> URL: https://issues.apache.org/jira/browse/SPARK-20072
> Project: Spark
>  Issue Type: Improvement
>  Components: Documentation
>Affects Versions: 2.1.0
>Reporter: chris snow
>Priority: Trivial
>
> https://www.mail-archive.com/user@spark.apache.org/msg62590.html
> The documentation for collaborative filtering is as follows:
> ===
> Scaling of the regularization parameter
> Since v1.1, we scale the regularization parameter lambda in solving
> each least squares problem by the number of ratings the user generated
> in updating user factors, or the number of ratings the product
> received in updating product factors.
> ===
> I find this description confusing, probably because I lack a detailed
> understanding of ALS.   The wording suggest that the number of ratings
> change ("generated", "received") during solving the least squares.
> This is how I think I should be interpreting the description:
> ===
> Since v1.1, we scale the regularization parameter lambda when solving
> each least squares problem.  When updating the user factors, we scale
> the regularization parameter by the total number of ratings from the
> user.  Similarly, when updating the product factors, we scale the
> regularization parameter by the total number of ratings for the
> product.
> ===



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation

2017-03-23 Thread Sean Owen (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938415#comment-15938415
 ] 

Sean Owen commented on SPARK-20072:
---

I don't think those two wordings differ meaningfully? I think small suggestions 
are OK but weight the value vs overhead of processing these changes. This is 
pretty borderline.

> Clarify ALS-WR documentation
> 
>
> Key: SPARK-20072
> URL: https://issues.apache.org/jira/browse/SPARK-20072
> Project: Spark
>  Issue Type: Improvement
>  Components: Documentation
>Affects Versions: 2.1.0
>Reporter: chris snow
>Priority: Trivial
>
> https://www.mail-archive.com/user@spark.apache.org/msg62590.html
> The documentation for collaborative filtering is as follows:
> ===
> Scaling of the regularization parameter
> Since v1.1, we scale the regularization parameter lambda in solving
> each least squares problem by the number of ratings the user generated
> in updating user factors, or the number of ratings the product
> received in updating product factors.
> ===
> I find this description confusing, probably because I lack a detailed
> understanding of ALS.   The wording suggest that the number of ratings
> change ("generated", "received") during solving the least squares.
> This is how I think I should be interpreting the description:
> ===
> Since v1.1, we scale the regularization parameter lambda when solving
> each least squares problem.  When updating the user factors, we scale
> the regularization parameter by the total number of ratings from the
> user.  Similarly, when updating the product factors, we scale the
> regularization parameter by the total number of ratings for the
> product.
> ===



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org