[ 
https://issues.apache.org/jira/browse/MAHOUT-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14387993#comment-14387993
 ] 

Andrew Musselman commented on MAHOUT-1539:
------------------------------------------

I'd say start with something commonly used, like vectors.

Please make a pull request as soon as you can so we can look at actual code 
rather than just concepts, then develop from there.

> Implement affinity matrix computation in Mahout DSL
> ---------------------------------------------------
>
>                 Key: MAHOUT-1539
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1539
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Clustering
>    Affects Versions: 0.9
>            Reporter: Shannon Quinn
>            Assignee: Shannon Quinn
>              Labels: DSL, scala, spark
>             Fix For: 0.10.1
>
>         Attachments: ComputeAffinities.scala
>
>
> This has the same goal as MAHOUT-1506, but rather than code the pairwise 
> computations in MapReduce, this will be done in the Mahout DSL.
> An orthogonal issue is the format of the raw input (vectors, text, images, 
> SequenceFiles), and how the user specifies the distance equation and any 
> associated parameters.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to