[ 
https://issues.apache.org/jira/browse/MAHOUT-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13977367#comment-13977367
 ] 

Pat Ferrel commented on MAHOUT-1518:
------------------------------------

The same can be said of creating a new type with most if not all of the same 
operations. That will be a pain to maintain too, just not your pain. The 
semantics may be different enough to warrant it though.

I really can't judge one pain from the other. A new type, something like a 
'table' is fine I suppose. It will mean wrapping many of the matrix ops in a 
way that deals with the new type and I'd expect most users would deal with 
these wrappers. Users just want to transpose the user-item table and cluster 
its rows getting IDs they recognize so we should give them that.

BTW [~ssc]'s pdf is interesting. It rates Mahout as being very low on the 
usability scale. While we are making if fast, let's make it easy too and do 
everyone a favor.


> Preprocessing for collaborative filtering with the Scala DSL
> ------------------------------------------------------------
>
>                 Key: MAHOUT-1518
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1518
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Collaborative Filtering
>            Reporter: Sebastian Schelter
>            Assignee: Sebastian Schelter
>             Fix For: 1.0
>
>         Attachments: MAHOUT-1518.patch
>
>
> The aim here is to provide some easy-to-use machinery to enable the usage of 
> the new Cooccurrence Analysis code from MAHOUT-1464 with datasets represented 
> as follows in a CSV file with the schema _timestamp, userId, itemId, action_, 
> e.g.
> {code}
> timestamp1, userIdString1, itemIdString1, “view"
> timestamp2, userIdString2, itemIdString1, “like"
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to