[ 
https://issues.apache.org/jira/browse/MADLIB-1129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16167279#comment-16167279
 ] 

Nandish Jayaram commented on MADLIB-1129:
-----------------------------------------

Himanshu, 
Since you are working on including more distance functions for kNN, I thought
extending that to the output layer might also be useful. Right now, it looks 
like
MADlib does a simple average of the k-nearest neighbors to come up with the
final value for both classification and regression. Doing a weighted average 
instead
might be a desirable functionality. The weighting for the average can be based 
on the
distance of the k-nearest neighbors.
We can probably provide an optional parameter to let users choose how the final
classification label or regression score has to be computed (avg or weighted 
avg).
[~fmcquillan] any thoughts?

> Additional output information for k-NN
> --------------------------------------
>
>                 Key: MADLIB-1129
>                 URL: https://issues.apache.org/jira/browse/MADLIB-1129
>             Project: Apache MADlib
>          Issue Type: Improvement
>          Components: k-NN
>            Reporter: Frank McQuillan
>            Assignee: Himanshu Pandey
>            Priority: Minor
>              Labels: starter
>             Fix For: v2.0
>
>
> Follow on to
> https://issues.apache.org/jira/browse/MADLIB-927
> List the k-nearest neighbors that were used in the voting/averaging, sorted 
> in ASC order according to the distance function used.  This could be added to 
> the current output table.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to