We have historically tracked the latest versions of Hadoop pretty soon after they have been available. If the tests run on 0.21 and it has the CompositeInputFormat then I'd be +1 to move forward. Hopefully there will be a Cloudera version that tracks it pretty soon too, else users will have to build their own AMIs again.
-----Original Message----- From: Shannon Quinn (JIRA) [mailto:[email protected]] Sent: Thursday, November 04, 2010 12:27 PM To: [email protected] Subject: [jira] Commented: (MAHOUT-537) Bring DistributedRowMatrix into compliance with Hadoop 0.20.2 [ https://issues.apache.org/jira/browse/MAHOUT-537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12928314#action_12928314 ] Shannon Quinn commented on MAHOUT-537: -------------------------------------- Something worth discussing: Hadoop just released version 0.21.0, which re-includes the updated CompositeInputFormat that was missing in 0.20.2 and deprecated in 0.18. I'm going to install v0.21 and see if tests pass on the trunk, but provided they do then I'm wondering if I should go ahead and implement this patch using Hadoop 0.21. Any thoughts? > Bring DistributedRowMatrix into compliance with Hadoop 0.20.2 > ------------------------------------------------------------- > > Key: MAHOUT-537 > URL: https://issues.apache.org/jira/browse/MAHOUT-537 > Project: Mahout > Issue Type: Improvement > Affects Versions: 0.4 > Reporter: Shannon Quinn > Assignee: Shannon Quinn > Attachments: MAHOUT-537.patch > > > Convert the current DistributedRowMatrix to use the newer Hadoop 0.20.2 API, > in particular eliminate dependence on the deprecated JobConf, using instead > the separate Job and Configuration objects. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
