I presume that Robin's rework addresses this, right? ---------- Forwarded message ---------- From: Chris K Wensel <[email protected]> Date: Tue, May 25, 2010 at 12:43 PM Subject: Re: Moving to new Hadoop APIs To: Ted Dunning <[email protected]>
you guys need to update the mail list page. still sends to lucene.apache.org On May 25, 2010, at 12:40 PM, Ted Dunning wrote: Thanks. On Tue, May 25, 2010 at 12:20 PM, Chris K Wensel <[email protected]> wrote: > I'm not on the list. > > here is my opinion on the new apis > > http://groups.google.com/group/cascading-user/browse_thread/thread/4dc26b68401bbc0f# > > and here > > http://stackoverflow.com/questions/2855167/which-hadoop-api-version-should-i-use/2859863#2859863 > > I can reply to the list, but am rushing out of the office. let me know. > > ckw > > On May 25, 2010, at 11:45 AM, Ted Dunning wrote: > > > Chris, > > Can you comment on this? > > ---------- Forwarded message ---------- > From: Sean Owen <[email protected]> > Date: Tue, May 25, 2010 at 11:37 AM > Subject: Re: Moving to new Hadoop APIs > To: Mahout Dev List <[email protected]>, Ankur Goel < > [email protected]> > > > First snag I've hit -- the new APIs don't include an equivalent of > MultipleInputs, which is the primary support for mapping several input > paths. Anyone happen to know the equivalent (esp. Ankur)? This could > be a bear to reproduce on our own. > > Sean > > On Tue, May 25, 2010 at 3:17 PM, Sean Owen <[email protected]> wrote: > > Just to state what seems to be in progress -- looks like we are agreed > > we should move to the new Hadoop APIs. Some code is already using it; > > most of the part that isn't is the recommender which was due to some > > strange bugs deep in Hadoop in prior versions. It's time to try it > > again. I'm going to work on porting everything forward now. > > > > The other argument against this was that Amazon EMR runs 0.18.3. I > > think Jeff already established that what we're doing has already > > broken compatibility with 0.18.x. We can point those users to release > > 0.3 and say they can try to back-port that code to 0.18.x > > compatibility. But 0.4 onwards is for 0.20.x+ and you can run your own > > cluster using AWS and hopefully EMR updates soon. > > > > Sean > > > > > -- > Chris K Wensel > [email protected] > http://www.concurrentinc.com > > -- Chris K Wensel [email protected] http://www.concurrentinc.com
