[
https://issues.apache.org/jira/browse/MAHOUT-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13977367#comment-13977367
]
Pat Ferrel commented on MAHOUT-1518:
------------------------------------
The same can be said of creating a new type with most if not all of the same
operations. That will be a pain to maintain too, just not your pain. The
semantics may be different enough to warrant it though.
I really can't judge one pain from the other. A new type, something like a
'table' is fine I suppose. It will mean wrapping many of the matrix ops in a
way that deals with the new type and I'd expect most users would deal with
these wrappers. Users just want to transpose the user-item table and cluster
its rows getting IDs they recognize so we should give them that.
BTW [~ssc]'s pdf is interesting. It rates Mahout as being very low on the
usability scale. While we are making if fast, let's make it easy too and do
everyone a favor.
> Preprocessing for collaborative filtering with the Scala DSL
> ------------------------------------------------------------
>
> Key: MAHOUT-1518
> URL: https://issues.apache.org/jira/browse/MAHOUT-1518
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Fix For: 1.0
>
> Attachments: MAHOUT-1518.patch
>
>
> The aim here is to provide some easy-to-use machinery to enable the usage of
> the new Cooccurrence Analysis code from MAHOUT-1464 with datasets represented
> as follows in a CSV file with the schema _timestamp, userId, itemId, action_,
> e.g.
> {code}
> timestamp1, userIdString1, itemIdString1, “view"
> timestamp2, userIdString2, itemIdString1, “like"
> {code}
--
This message was sent by Atlassian JIRA
(v6.2#6252)