For this purpose, yes just do your own translation. Just hash down into a 32-bit int.
On Mon, Aug 16, 2010 at 5:51 PM, Jeff Heuer <[email protected]> wrote: > Hello, > > I have a dataset of users and items where those objects are identified by > text strings (e.g. "x97wfm"), rather than a numeric ID. What would be the > best approach to processing this data using RecommenderJob (* > org.apache.mahout.cf.taste.hadoop.item.RecommenderJob*)? Is there any > support within Mahout (or Hadoop) to do that sort of transformation, or do I > have to write my own job as an input to the process which would convert the > text keys to numeric IDs manually? Any advice is appreciated. > > Thanks, > > Jeff >
