On Aug 31, 2011, at 11:47 AM, Sean Owen wrote: > No it still wants "user,item[,rating]" input. But otherwise yes, it's > translated and un-translated internally as needed. > > You could change the mapper to read that input easily though. > > it still wants numeric input. It's hashing longs to ints. But this could > easily be changed to record a more general mapping.
Ah, so I would still have to do the conversion, or hash on the string. > > On Wed, Aug 31, 2011 at 4:44 PM, Grant Ingersoll <[email protected]>wrote: > >> >> On Aug 31, 2011, at 11:26 AM, Sean Owen wrote: >> >>> Is the problem not just a matter of "translating" from the original >>> identifiers to ints, so they can be used as offsets into a vector, and >> then >>> back again? >> >> Yeah, I was wondering about that when looking at the RecommenderJob. >> >> If I understand you right, I could just output lines of text as: >> from: msgId1, msgId3, ... msgIdn >> ... >> >> And the RecommenderJob would automatically do the translation? >> >>
