Assuming I've done my own translation (I followed Ted's piece), how do I get
this into the rest of the RecJob? Right now, I have a NamedVector (the name is
the id of the from email address) and the cells are {0,1} for each message id
(1 if that user has interacted with that message id). In looking at the
RecommenderJob, it seems like I could skip the first couple of phases, but it
also seems like I need a DistributedRowMatrix as input for the next phase
(maybePruneAndTranspose). Is my understanding correct? I guess I need to
convert my seq. file of NamedVectors to the DistributedRowMatrix?
On Aug 31, 2011, at 11:55 AM, Sean Owen wrote:
> Yes, I'm suggesting that could at least be 80% of what you need. If you can
> generalize that bit further and refactor it, all the better.
>
> I wouldn't bother necessarily extending to support the "user: item item
> item" syntax or else we'd get into supporting a lot of stuff. That
> conversion IMHO can be left to the caller.
>
> On Wed, Aug 31, 2011 at 4:52 PM, Grant Ingersoll <[email protected]>wrote:
>
>>
>> On Aug 31, 2011, at 11:47 AM, Sean Owen wrote:
>>
>>> No it still wants "user,item[,rating]" input. But otherwise yes, it's
>>> translated and un-translated internally as needed.
>>>
>>> You could change the mapper to read that input easily though.
>>>
>>> it still wants numeric input. It's hashing longs to ints. But this could
>>> easily be changed to record a more general mapping.
>>
>> Ah, so I would still have to do the conversion, or hash on the string.
>>
>>>
>>> On Wed, Aug 31, 2011 at 4:44 PM, Grant Ingersoll <[email protected]
>>> wrote:
>>>
>>>>
>>>> On Aug 31, 2011, at 11:26 AM, Sean Owen wrote:
>>>>
>>>>> Is the problem not just a matter of "translating" from the original
>>>>> identifiers to ints, so they can be used as offsets into a vector, and
>>>> then
>>>>> back again?
>>>>
>>>> Yeah, I was wondering about that when looking at the RecommenderJob.
>>>>
>>>> If I understand you right, I could just output lines of text as:
>>>> from: msgId1, msgId3, ... msgIdn
>>>> ...
>>>>
>>>> And the RecommenderJob would automatically do the translation?
>>>>
>>>>
>>
>>
--------------------------------------------
Grant Ingersoll
http://www.lucidimagination.com
Lucene Eurocon 2011: http://www.lucene-eurocon.com