On Tue, Jan 5, 2010 at 10:27 AM, Drew Farris <drew.far...@gmail.com> wrote:
> On Tue, Jan 5, 2010 at 1:08 PM, Jake Mannix <jake.man...@gmail.com> wrote: > > I assumed it could be done it similarly to the way in which it is > currently done for Vector implementations, but I'm surprised this even > works in light of Hadoop's exact class matching. I'll have to crack > open the Mahout code later to take a closer look. > > It's hard to find where the ugly little pieces bit you - because you can certainly have Mapper<IntWritable, Vector, Foo, Bar> (and we do - CanopyMapper for instance), and if you unit test with DummyOutputCollector, you'll never see the ugliness - it only rears its head once you whip out the SequenceFile. See: HADOOP-5452 <http://issues.apache.org/jira/browse/HADOOP-5452> for more details. -jake