Re: Make Hadoop NetworkTopology and data locality more pluggable for other deploying topology like: virtualization.

2012-06-04 Thread Milind.Bhandarkar
That's great Junping. Hoping to see this in trunk / hadoop 2.0 and hadoop 1.1 soon. - milind On Jun 4, 2012, at 8:48 AM, Jun Ping Du wrote: Hello Folks, I just filed a Umbrella jira today to address current NetworkTopology issue that binding strictly to three tier network. The

Re: [RESULT] - [VOTE] Rename hadoop branches post hadoop-1.x

2012-04-03 Thread Milind.Bhandarkar
Arun, I am even more confused now than I was before: Here you say: Essentially 'trunk' is where incompatible changes *may* be committed (in future). We should allow for that. On another thread, responding to Avner (re: MAPREDUCE-4049?) you say, We do expect 'new features' to make it to

Re: [RESULT] - [VOTE] Rename hadoop branches post hadoop-1.x

2012-04-03 Thread Milind.Bhandarkar
Thanks ATM. I guess the *may* emphasis confused me. Just to get some more clarity: What would be guideline for a new feature, such as https://issues.apache.org/jira/browse/MAPREDUCE-4049, which maintains compatibility for 1.x, but is not relevant to trunk, because the codebases have completely

Re: [RESULT] - [VOTE] Rename hadoop branches post hadoop-1.x

2012-04-03 Thread Milind.Bhandarkar
To my knowledge, shuffle is already pluggable in 0.23 onwards, as long as it is used only by mapreduce framework. That's why Avner says : In parallel, I'll try to *learn what exists* in 0.23. (Emphasize my own.) That's why I was wondering about the insistence of committing to trunk first. -

Re: [RESULT] - [VOTE] Rename hadoop branches post hadoop-1.x

2012-04-03 Thread Milind.Bhandarkar
Great ! Thanks @atm, - milind On 4/3/12 3:21 PM, Aaron T. Myers a...@cloudera.com wrote: If that's the case then there doesn't seem to be any question here. The feature is in trunk, and an implementation could be done for an older release branch that would be compatible with that branch. Sure,

Yarn build error

2012-01-12 Thread Milind.Bhandarkar
Folks, I am getting the Error reading assemblies : No assembly descriptor found error while following the build instructions, and building with mvn clean install assembly:assembly -Pnative. Can you let me know how to fix this ? - Milind --- Milind Bhandarkar Greenplum Labs, EMC (Disclaimer:

Re: Trouble building 0.23 branch

2011-12-07 Thread Milind.Bhandarkar
Ralph, Looks like you are using a Mac to build. The native components of the build are not buildable on Mac. Disable native build by specifying -P-native. - Milind --- Milind Bhandarkar Greenplum Labs, EMC (Disclaimer: Opinions expressed in this email are those of the author, and do not

Re: Trouble building 0.23 branch

2011-12-07 Thread Milind.Bhandarkar
Ralph, I usually do mvn install -Dresolvers=internal in every directory. This publishes artifacts in ~/.m2 directory, and looks up ~/.m2 while resolving dependencies. - Milind On 12/7/11 11:42 AM, Ralph Castain r...@open-mpi.org wrote: Thanks! Getting closer now - hitting this block:

Re: MapReduce and MPI

2011-12-01 Thread Milind.Bhandarkar
Ralph, At the MPI Forum meeting at SC11, Jeff mentioned that C++ bindings are going to be dropped from the standard, and that no other language bindings were proposed. Do you think there is enough time for Java bindings to make it into the 3.0 standard ? - Milind On 12/1/11 3:31 AM, Ralph

Re: Map-Reduce without sorting

2011-10-19 Thread Milind.Bhandarkar
How is that different from specifying a comparator that always returns that k1 and k2 are equal regardless of k1 and k2 ? So, you will get only partitioning, and not sorting. - Milind On 10/19/11 2:42 PM, Zheng Shao zsh...@gmail.com wrote: Google's Tenzing paper mentioned that they modified

Re: Map-Reduce without sorting

2011-10-19 Thread Milind.Bhandarkar
Arun, From the Tenzing paper: quote Hash table based aggregation is common in RDBMS sys- tems. However, it is impossible to implement eciently on the basic MapReduce framework, since the reducer al- ways unnecessarily sorts the data by key. We enhanced the MapReduce framework to relax this

Yarn NodeManager services

2011-10-11 Thread Milind.Bhandarkar
Hi, As part of MPI implementation in Yarn (aka Hamster), I was looking at refactoring some of the functionality into node manager services, so that it can be reused by other frameworks too. (Based on the discussion with some folks led me to believe that suffle etc is also being planned as NM

Re: Yarn NodeManager services

2011-10-11 Thread Milind.Bhandarkar
This is cool. Exactly what I was looking for ! Thanks Chris Mahadev. - Milind On 10/11/11 2:33 PM, Chris Douglas cdoug...@apache.org wrote: On Tue, Oct 11, 2011 at 2:05 PM, milind.bhandar...@emc.com wrote: As part of MPI implementation in Yarn (aka Hamster), I was looking at refactoring

Re: hadoop-yarn-applications maven module

2011-10-07 Thread Milind.Bhandarkar
I think the intent of hadoop-tools was quite different. Hadoop-yarn-applications is the code for alternative (to MR) frameworks supported by Yarn. These are not hadoop-tools, any more than mapreduce is a hadoop-tool. - milind On 10/7/11 9:59 AM, Alejandro Abdelnur t...@cloudera.com wrote:

0.23 compilation failure on a Mac

2011-09-29 Thread Milind.Bhandarkar
Folks, While I am waiting for a Linux dev box (see my earlier emails about TestLargeBlock etc), I noticed that the linux-container-executor is being compiled on Mac, and is throwing errors, causing the build to fail every time. I do not need the linux-coniatiner-executor, since I am on on linux,

Re: 0.23 compilation failure on a Mac

2011-09-29 Thread Milind.Bhandarkar
Thanks. Trying it out now. - milind On 9/29/11 5:04 PM, Arun C Murthy a...@hortonworks.com wrote: Pass -P-cbuild to maven. Arun On Sep 29, 2011, at 5:02 PM, milind.bhandar...@emc.com wrote: Folks, While I am waiting for a Linux dev box (see my earlier emails about TestLargeBlock etc),

Re: 0.23 compilation failure on a Mac

2011-09-29 Thread Milind.Bhandarkar
Yeah, that worked. Thanks Arun. milind On 9/29/11 5:07 PM, Bhandarkar, Milind milind.bhandar...@emc.com wrote: Thanks. Trying it out now. - milind On 9/29/11 5:04 PM, Arun C Murthy a...@hortonworks.com wrote: Pass -P-cbuild to maven. Arun On Sep 29, 2011, at 5:02 PM,

Re: MRv1 in 0.23+

2011-09-08 Thread Milind.Bhandarkar
Thanks for the prompt response Eli. In effect, MRv1 is not supprted in 0.23 onwards, and so LTC should not be patched in those releases. (I searched the mailing lists, but could not see it said explicitly.) - Milind --- Milind Bhandarkar Greenplum Labs, EMC (Disclaimer: Opinions expressed in

MRv1 in 0.23+

2011-09-07 Thread Milind.Bhandarkar
Folks, Has the community decided how long MRv1 will remain part of the codebase, after 0.23 ? The reason I am asking is, for those who are working on forward porting LinuxTaskController fixes (from 0.20.2xx) to 0.22, will they have to patch 0.23 and trunk as well ? Or should these branches be

Re: Hadoop Tools Layout (was Re: DistCpV2 in 0.23)

2011-09-07 Thread Milind.Bhandarkar
+1 for separate hadoop-tools module. However, if a tool is broken at release time, and no one comes forward to fix it, it should be removed. (i.e. Unlike contrib modules, where build and test failures were tolerated.) - milind On 9/7/11 11:27 AM, Mahadev Konar maha...@hortonworks.com wrote: I

Re: Merge MR-279 (was Re: Status on MAPREDUCE-279 and plans forward)

2011-08-05 Thread Milind.Bhandarkar
Is it building for you after hadoop-common mavenization in trunk ? Hadoop common tests testHardLink and testFilterFileSystem are failing for me on mac, and Mr279 mapreduce depends on these test jars: [ERROR] Failed to execute goal on project yarn-api: Could not resolve dependencies for project