[
https://issues.apache.org/jira/browse/MAHOUT-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13977285#comment-13977285
]
Pat Ferrel commented on MAHOUT-1518:
------------------------------------
[~dlyubimov] Don't be too literal. Since there is drm inside the object it's a
matrix in every sense that a drm is, right? The matrix operations could be
defined to work on this new data type, indexed row and column lookup can be
added when needed, without loading all into core. It's an example and not
complete.
I'd favor allowing the matrix ops to work on this but even if it's a new
wrapper "table" api instead of the the actual drm DSL that's OK too. To me the
first goal is exactly what Sebastian did, make import/export easy. If it
becomes a 1st class universally (in Mahout anyway) supported rdd-style object
even better.
> Preprocessing for collaborative filtering with the Scala DSL
> ------------------------------------------------------------
>
> Key: MAHOUT-1518
> URL: https://issues.apache.org/jira/browse/MAHOUT-1518
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Fix For: 1.0
>
> Attachments: MAHOUT-1518.patch
>
>
> The aim here is to provide some easy-to-use machinery to enable the usage of
> the new Cooccurrence Analysis code from MAHOUT-1464 with datasets represented
> as follows in a CSV file with the schema _timestamp, userId, itemId, action_,
> e.g.
> {code}
> timestamp1, userIdString1, itemIdString1, “view"
> timestamp2, userIdString2, itemIdString1, “like"
> {code}
--
This message was sent by Atlassian JIRA
(v6.2#6252)