[ 
https://issues.apache.org/jira/browse/MADLIB-1181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Frank McQuillan updated MADLIB-1181:
------------------------------------
    Description: 
Follow on from 
https://issues.apache.org/jira/browse/MADLIB-1059
(please see this JIRA for additional comments)

MADlib does a simple average of the k-nearest neighbors to come up with the
final value for classification and regression. Doing a weighted average instead
might be a desirable functionality. The weighting for the average can be based 
on the
distance of the k-nearest neighbors.

We can probably provide an optional parameter to let users choose how the final
score has to be computed (avg or weighted avg).

This JIRA applies to classification and regression.



  was:
Follow on from 
https://issues.apache.org/jira/browse/MADLIB-1059
(please see this JIRA for additional comments)

MADlib does a simple average of the k-nearest neighbors to come up with the
final value for classification and regression. Doing a weighted average instead
might be a desirable functionality. The weighting for the average can be based 
on the
distance of the k-nearest neighbors.

We can probably provide an optional parameter to let users choose how the final
score has to be computed (avg or weighted avg).

This JIRA applies to regression only not classification, because classification 
uses majority voting.




> Add an option for weighted average in k-NN
> ------------------------------------------
>
>                 Key: MADLIB-1181
>                 URL: https://issues.apache.org/jira/browse/MADLIB-1181
>             Project: Apache MADlib
>          Issue Type: Improvement
>          Components: k-NN
>            Reporter: Frank McQuillan
>            Assignee: Himanshu Pandey
>            Priority: Minor
>             Fix For: v1.14
>
>
> Follow on from 
> https://issues.apache.org/jira/browse/MADLIB-1059
> (please see this JIRA for additional comments)
> MADlib does a simple average of the k-nearest neighbors to come up with the
> final value for classification and regression. Doing a weighted average 
> instead
> might be a desirable functionality. The weighting for the average can be 
> based on the
> distance of the k-nearest neighbors.
> We can probably provide an optional parameter to let users choose how the 
> final
> score has to be computed (avg or weighted avg).
> This JIRA applies to classification and regression.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to