[ 
https://issues.apache.org/jira/browse/SPARK-14409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15826186#comment-15826186
 ] 

Nick Pentreath commented on SPARK-14409:
----------------------------------------

Yes to be more clear, I would expect that the {{k}} param would be specified as 
in Danilo's version, for example. I do like the use of windowing to achieve the 
sort within each user.

This approach would also not work well with purely implicit data (unweighted). 
If everything is relevant in the ground truth then the model would score 
perfectly each time. It sort of works for the explicit rating case or the 
implicit case with "preference weights" since the ground truth then has an 
inherent ordering. 

Still I think the evaluator must be able to deal with the case of generating 
recommendations from the full item set. This means that the "label" and 
"prediction" columns could contains nulls.
e.g. where an item exists in the ground truth but is not recommended (hence no 
score), the "prediction" column would be null. While if an item is recommended 
but is not in ground truth, the "label" column would be null. See my comments 
in SPARK-13857 for details.

> Investigate adding a RankingEvaluator to ML
> -------------------------------------------
>
>                 Key: SPARK-14409
>                 URL: https://issues.apache.org/jira/browse/SPARK-14409
>             Project: Spark
>          Issue Type: New Feature
>          Components: ML
>            Reporter: Nick Pentreath
>            Priority: Minor
>
> {{mllib.evaluation}} contains a {{RankingMetrics}} class, while there is no 
> {{RankingEvaluator}} in {{ml.evaluation}}. Such an evaluator can be useful 
> for recommendation evaluation (and can be useful in other settings 
> potentially).
> Should be thought about in conjunction with adding the "recommendAll" methods 
> in SPARK-13857, so that top-k ranking metrics can be used in cross-validators.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to