For any interested, I made a blog posting about this issue; perhaps it
will help elucidate the problem.
http://spectrallyclustered.wordpress.com/2010/06/16/sprint-1-getting-the-hang-of-mapreduce/
Thanks again!
Shannon
-------- Original Message --------
Subject: Re: M/R capturing line numbers in text files
Date: Wed, 16 Jun 2010 09:53:51 -0400
From: Shannon Quinn <[email protected]>
To: [email protected]
Perfect. Thank you.
Unfortunately, now I receive this exception:
java.io.IOException: wrong value class:
org.apache.mahout.math.hadoop.DistributedRowMatrix$MatrixEntryWritable
is not class org.apache.mahout.math.VectorWritable
My Mapper's value output and Reducer's input is a
DRM.MatrixEntryWritable, and is specified as such in the Conf object.
The Reducer's output is a VectorWritable. The stack trace doesn't
mention any code of mine, so I'm not sure how to approach this.
The basic problem is that something has produced data that uses a long as an
ID and your mapper is expecting an int. Have you posted your code as a
patch on the jira or a git link?
I attached a patch to my project's ticket on jira (363).
Thanks again!
Regards,
Shannon