> On 2012-02-02 07:59:13, Mohammad Islam wrote: > > So it is WIP. right? > > > > So we are now bundling the multiple hadoop jars. As the new version of > > hadoop comes, we will need add a new pom file and associated changes in > > other pom files. > > Moreover (in some cases) to support a newly released version we might need > > to release new oozie version with only new hadoop packages. > > In addition, oozie tarball size will increase substantially. > > > > I think we might need to be little cautious on this. > > > > Alejandro Abdelnur wrote: > Mohammed, > > The patch is not WIP, is ready for commit. > > I assume you mean that will be evolving, adding more Hadoop versions as > they come, if that is what you meant you are correct. > > When a new version of Hadoop comes and we wan to support it in Oozie, we > would add 1 new set of hadoop-client/hadoop-test for that Hadoop version in > Oozie's hadooplibs, correct. No other changes in the Oozie POMs to use it > other than setting the desired version from the command line when running > Maven '-Dhadoop.version=#####'. > > As we stop supporting builds/tests for a particular version of Hadoop we > would remove the corresponding hadoop-client/hadoop-test modules from > hadooplibs. > > Apache projects are Open Source projects and the official distribution of > release should be a source tarball. Note that in the source form this does > not increase the size of the TAR in a considerable way (just a few KBs for > the 2 new POMs). > > For convenience, many projects also include binary distributions. You are > correct that the more Hadoop versions we have hadooplibs the bigger will be > the size of the binary distribution. We have different options for this: > > 1* The binary distribution does not include/bundle/produce hadooplibs. > The size of Oozie TAR remain will remain constant. This is what we have > today, the user is responsible for harvesting the required Hadoop JARs and > can use the hadooplib poms as a guidance. > 2* The binary distribution would be multiple TARs, one for Oozie proper > and one for each hadooplib. The size of Oozie TAR will remain constant, the > user will have to download the corresponding hadooplib tar. > 3* Bundle all hadooplibs as part of the binary distribution. (As you've > mentioned) The size of the TAR will increase. > > Doing #1 the binary TAR is not different from today's. > Doing #2 the binary TAR is not different from today's, you have > additional Oozie hadooplib tars. > Doing #3 the binary TAR will be bigger from today's. > > We can do #1, thus our concerns are addressed and our build easily > handles multiple versions of Hadoop. > > Thoughts? > > Thanks > > Alejandro > > > > > > > Mohammad Islam wrote: > I prefer both option #1 and option #. > But Option #2 will be better for easy/quick installation. > So pls go ahead with option #2. > > One question: do we need to add pom for every new hadoop (major/minor) > release (although there is no changes in the jar files? >
Hi Alejandro, For 22 prospective there would not be any client artifact, so we need to do something which will work in both the scenario with and without hadoop client artifacts. Thanks, Mayank - Mayank ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/3726/#review4764 ----------------------------------------------------------- On 2012-02-02 22:37:20, Alejandro Abdelnur wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/3726/ > ----------------------------------------------------------- > > (Updated 2012-02-02 22:37:20) > > > Review request for oozie. > > > Summary > ------- > > The current mechanism allow to easily package/use new versions of hadoop > without complicating Oozie's POMs. > > mr1 & mr2 profiles are gone from oozie, now it refers to one of the versions > in hadooplibs. > > New maven modules under hadooplib define the hadoop-client/hadoop-test POMs > for different hadoop versions. > > Note that because of HADOOP-8009, hadoop will start providing a hadoop-client > artifact (even for already released versions), still we'll need the > corresponding hadooplibs module to be able to use the assembly (as it is done > as part of this patch) to pull into the oozie distro the JARs for the > supported/tested versions of Hadoop required for the client side. > > Note that this can be used tosimplify the logic of addtowar.sh that won't > have to be aware of the JARs deps of different versions of Hadoop or of > hadoop JARs at all. > > > This addresses bug OOZIE-667. > https://issues.apache.org/jira/browse/OOZIE-667 > > > Diffs > ----- > > trunk/core/pom.xml 1239887 > trunk/core/src/test/java/org/apache/oozie/test/XTestCase.java 1239887 > trunk/docs/src/site/twiki/AG_Install.twiki 1239887 > trunk/docs/src/site/twiki/DG_QuickStart.twiki 1239887 > trunk/examples/pom.xml 1239887 > trunk/hadooplibs/hadoop-0_23_1/pom.xml PRE-CREATION > trunk/hadooplibs/hadoop-0_24_0/pom.xml PRE-CREATION > trunk/hadooplibs/hadoop-1_0_0/pom.xml PRE-CREATION > trunk/hadooplibs/hadoop-test-0_23_1/pom.xml PRE-CREATION > trunk/hadooplibs/hadoop-test-0_24_0/pom.xml PRE-CREATION > trunk/hadooplibs/hadoop-test-1_0_0/pom.xml PRE-CREATION > trunk/hadooplibs/pom.xml PRE-CREATION > trunk/pom.xml 1239887 > trunk/sharelib/pom.xml 1239887 > trunk/sharelib/streaming/pom.xml 1239887 > trunk/src/main/assemblies/hadooplib.xml PRE-CREATION > trunk/src/main/assemblies/hadooplibs.xml PRE-CREATION > trunk/src/main/assemblies/partial-sharelib.xml 1239887 > trunk/webapp/pom.xml 1239887 > > Diff: https://reviews.apache.org/r/3726/diff > > > Testing > ------- > > Tested with 1.0.0 and 0.24.0-SNAPSHOTs (several testcases failures here but > that is still work in progress for the mr2 integration) > > > Thanks, > > Alejandro > >
