Makes sense. I've cherry-picked the commits in branch-2 that were missed in branch-2.10.
Jonathan Hung On Wed, Apr 15, 2020 at 2:25 AM Akira Ajisaka <aajis...@apache.org> wrote: > Hi folks, > > I am still seeing some changes are being committed to branch-2. > I'd like to delete the source code from branch-2 to avoid mistakes. > https://issues.apache.org/jira/browse/HADOOP-16988 > > -Akira > > On Wed, Jan 1, 2020 at 2:38 AM Ayush Saxena <ayush...@gmail.com> wrote: > >> Hi Jim, >> Thanx for catching, I have configured the build to run on branch-2.10. >> >> -Ayush >> >> On Tue, 31 Dec 2019 at 22:50, Jim Brennan <james.bren...@verizonmedia.com> >> wrote: >> >>> It looks like QBT tests are still being run on branch-2 ( >>> https://builds.apache.org/view/H-L/view/Hadoop/job/hadoop-qbt-branch2-java7-linux-x86/), >>> and they are not very helpful at this point. >>> Can we change the QBT tests to run against branch-2.10 instead? >>> >>> Jim >>> >>> On Mon, Dec 23, 2019 at 7:44 PM Akira Ajisaka <aajis...@apache.org> >>> wrote: >>> >>>> Thank you, Ayush. >>>> >>>> I understand we should keep branch-2 as is, as well as master. >>>> >>>> -Akira >>>> >>>> >>>> On Mon, Dec 23, 2019 at 9:14 PM Ayush Saxena <ayush...@gmail.com> >>>> wrote: >>>> >>>> > Hi Akira >>>> > Seems there was an INFRA ticket for that. INFRA-19581, >>>> > But the INFRA people closed as wont do and yes, the branch is >>>> protected, >>>> > we can’t delete it directly. >>>> > >>>> > Ref: https://issues.apache.org/jira/browse/INFRA-19581 >>>> > >>>> > -Ayush >>>> > >>>> > On 23-Dec-2019, at 5:03 PM, Akira Ajisaka <aajis...@apache.org> >>>> wrote: >>>> > >>>> > Thank you for your work, Jonathan. >>>> > >>>> > I found branch-2 has been unintentionally pushed again. Would you >>>> remove >>>> > it? >>>> > I think the branch should be protected if possible. >>>> > >>>> > -Akira >>>> > >>>> > On Tue, Dec 10, 2019 at 5:17 AM Jonathan Hung <jyhung2...@gmail.com> >>>> > wrote: >>>> > >>>> > It's done. The new commit chain is: trunk -> branch-3.2 -> branch-3.1 >>>> -> >>>> > >>>> > branch-2.10 -> branch-2.9 -> branch-2.8 (branch-2 no longer exists, >>>> please >>>> > >>>> > don't try to commit to it) >>>> > >>>> > >>>> > Completed procedure: >>>> > >>>> > >>>> > - Verified everything in old branch-2.10 was in old branch-2 >>>> > >>>> > - Delete old branch-2.10 >>>> > >>>> > - Rename branch-2 to (new) branch-2.10 >>>> > >>>> > - Set version in new branch-2.10 to 2.10.1-SNAPSHOT >>>> > >>>> > - Renamed fix versions from 2.11.0 to 2.10.1 >>>> > >>>> > - Removed 2.11.0 as a version in HADOOP/YARN/HDFS/MAPREDUCE >>>> > >>>> > >>>> > >>>> > Jonathan Hung >>>> > >>>> > >>>> > >>>> > On Wed, Dec 4, 2019 at 10:55 AM Jonathan Hung <jyhung2...@gmail.com> >>>> > >>>> > wrote: >>>> > >>>> > >>>> > FYI, starting the rename process, beginning with INFRA-19521. >>>> > >>>> > >>>> > Jonathan Hung >>>> > >>>> > >>>> > >>>> > On Wed, Nov 27, 2019 at 12:15 PM Konstantin Shvachko < >>>> > >>>> > shv.had...@gmail.com> >>>> > >>>> > wrote: >>>> > >>>> > >>>> > Hey guys, >>>> > >>>> > >>>> > I think we diverged a bit from the initial topic of this discussion, >>>> > >>>> > which is removing branch-2.10, and changing the version of branch-2 >>>> from >>>> > >>>> > 2.11.0-SNAPSHOT to 2.10.1-SNAPSHOT. >>>> > >>>> > Sounds like the subject line for this thread "Making 2.10 the last >>>> minor >>>> > >>>> > 2.x release" confused people. >>>> > >>>> > It is in fact a wider matter that can be discussed when somebody >>>> > >>>> > actually >>>> > >>>> > proposes to release 2.11, which I understand nobody does at the >>>> moment. >>>> > >>>> > >>>> > So if anybody objects removing branch-2.10 please make an argument. >>>> > >>>> > Otherwise we should go ahead and just do it next week. >>>> > >>>> > I see people still struggling to keep branch-2 and branch-2.10 in >>>> sync. >>>> > >>>> > >>>> > Thanks, >>>> > >>>> > --Konstantin >>>> > >>>> > >>>> > On Thu, Nov 21, 2019 at 3:49 PM Jonathan Hung <jyhung2...@gmail.com> >>>> > >>>> > wrote: >>>> > >>>> > >>>> > Thanks for the detailed thoughts, everyone. >>>> > >>>> > >>>> > Eric (Badger), my understanding is the same as yours re. minor vs >>>> patch >>>> > >>>> > releases. As for putting features into minor/patch releases, if we >>>> > >>>> > keep the >>>> > >>>> > convention of putting new features only into minor releases, my >>>> > >>>> > assumption >>>> > >>>> > is still that it's unlikely people will want to get them into branch-2 >>>> > >>>> > (based on the 2.10.0 release process). For the java 11 issue, we >>>> > >>>> > haven't >>>> > >>>> > even really removed support for java 7 in branch-2 (much less java 8), >>>> > >>>> > so I >>>> > >>>> > feel moving to java 11 would go along with a move to branch 3. And as >>>> > >>>> > you >>>> > >>>> > mentioned, if people really want to use java 11 on branch-2, we can >>>> > >>>> > always >>>> > >>>> > revive branch-2. But for now I think the convenience of not needing to >>>> > >>>> > port >>>> > >>>> > to both branch-2 and branch-2.10 (and below) outweighs the cost of >>>> > >>>> > potentially needing to revive branch-2. >>>> > >>>> > >>>> > Jonathan Hung >>>> > >>>> > >>>> > >>>> > On Wed, Nov 20, 2019 at 10:50 AM Eric Yang <ey...@cloudera.com> >>>> wrote: >>>> > >>>> > >>>> > +1 for 2.10.x as last release for 2.x version. >>>> > >>>> > >>>> > Software would become more compatible when more companies stress test >>>> > >>>> > the same software and making improvements in trunk. Some may be extra >>>> > >>>> > caution on moving up the version because obligation internally to keep >>>> > >>>> > things running. Company obligation should not be the driving force to >>>> > >>>> > maintain Hadoop branches. There is no proper collaboration in the >>>> > >>>> > community when every name brand company maintains its own Hadoop 2.x >>>> > >>>> > version. I think it would be more healthy for the community to >>>> > >>>> > reduce the >>>> > >>>> > branch forking and spend energy on trunk to harden the software. >>>> > >>>> > This will >>>> > >>>> > give more confidence to move up the version than trying to fix n >>>> > >>>> > permutations breakage like Flash fixing the timeline. >>>> > >>>> > >>>> > Apache license stated, there is no warranty of any kind for code >>>> > >>>> > contributions. Fewer community release process should improve >>>> > >>>> > software >>>> > >>>> > quality when eyes are on trunk, and help steering toward the same end >>>> > >>>> > goals. >>>> > >>>> > >>>> > regards, >>>> > >>>> > Eric >>>> > >>>> > >>>> > >>>> > >>>> > On Tue, Nov 19, 2019 at 3:03 PM Eric Badger >>>> > >>>> > <ebad...@verizonmedia.com.invalid> wrote: >>>> > >>>> > >>>> > Hello all, >>>> > >>>> > >>>> > Is it written anywhere what the difference is between a minor release >>>> > >>>> > and a >>>> > >>>> > point/dot/maintenance (I'll use "point" from here on out) release? I >>>> > >>>> > have >>>> > >>>> > looked around and I can't find anything other than some compatibility >>>> > >>>> > documentation in 2.x that has since been removed in 3.x [1] [2]. I >>>> > >>>> > think >>>> > >>>> > this would help shape my opinion on whether or not to keep branch-2 >>>> > >>>> > alive. >>>> > >>>> > My current understanding is that we can't really break compatibility >>>> > >>>> > in >>>> > >>>> > either a minor or point release. But the only mention of the >>>> > >>>> > difference >>>> > >>>> > between minor and point releases is how to deal with Stable, >>>> > >>>> > Evolving, >>>> > >>>> > and >>>> > >>>> > Unstable tags, and how to deal with changing default configuration >>>> > >>>> > values. >>>> > >>>> > So it seems like there really isn't a big official difference between >>>> > >>>> > the >>>> > >>>> > two. In my mind, the functional difference between the two is that >>>> > >>>> > the >>>> > >>>> > minor releases may have added features and rewrites, while the point >>>> > >>>> > releases only have bug fixes. This might be an incorrect >>>> > >>>> > understanding, but >>>> > >>>> > that's what I have gathered from watching the releases over the last >>>> > >>>> > few >>>> > >>>> > years. Whether or not this is a correct understanding, I think that >>>> > >>>> > this >>>> > >>>> > needs to be documented somewhere, even if it is just a convention. >>>> > >>>> > >>>> > Given my assumed understanding of minor vs point releases, here are >>>> > >>>> > the >>>> > >>>> > pros/cons that I can think of for having a branch-2. Please add on or >>>> > >>>> > correct me for anything you feel is missing or inadequate. >>>> > >>>> > Pros: >>>> > >>>> > - Features/rewrites/higher-risk patches are less likely to be put >>>> > >>>> > into >>>> > >>>> > 2.10.x >>>> > >>>> > - It is less necessary to move to 3.x >>>> > >>>> > >>>> > Cons: >>>> > >>>> > - Bug fixes are less likely to be put into 2.10.x >>>> > >>>> > - An extra branch to maintain >>>> > >>>> > - Committers have an extra branch (5 vs 4 total branches) to commit >>>> > >>>> > patches to if they should go all the way back to 2.10.x >>>> > >>>> > - It is less necessary to move to 3.x >>>> > >>>> > >>>> > So on the one hand you get added stability in fewer features being >>>> > >>>> > committed to 2.10.x, but then on the other you get fewer bug fixes >>>> > >>>> > being >>>> > >>>> > committed. In a perfect world, we wouldn't have to make this >>>> > >>>> > tradeoff. >>>> > >>>> > But >>>> > >>>> > we don't live in a perfect world and committers will make mistakes >>>> > >>>> > either >>>> > >>>> > because of lack of knowledge or simply because they made a mistake. >>>> > >>>> > If >>>> > >>>> > we >>>> > >>>> > have a branch-2, committers will forget, not know to, or choose not >>>> > >>>> > to >>>> > >>>> > (for >>>> > >>>> > whatever reason) commit valid bug fixes back all the way to >>>> > >>>> > branch-2.10. If >>>> > >>>> > we don't have a branch-2, committers who want their borderline risky >>>> > >>>> > feature in the 2.x line will err on the side of putting it into >>>> > >>>> > branch-2.10 >>>> > >>>> > instead of proposing the creation of a branch-2. Clearly I have made >>>> > >>>> > quite >>>> > >>>> > a few assumptions here based on my own experiences, so I would like >>>> > >>>> > to >>>> > >>>> > hear >>>> > >>>> > if others have similar or opposing views. >>>> > >>>> > >>>> > As far as 3.x goes, to me it seems like some of the reasoning for >>>> > >>>> > killing >>>> > >>>> > branch-2 is due to an effort to push the community towards 3.x. This >>>> > >>>> > is why >>>> > >>>> > I have added movement to 3.x as both a pro and a con. As a community >>>> > >>>> > trying >>>> > >>>> > to move forward, keeping as many companies on similar branches as >>>> > >>>> > possible >>>> > >>>> > is a good way to make sure the code is well-tested. However, from a >>>> > >>>> > stability point of view, moving to 3.x is still scary and being able >>>> > >>>> > to >>>> > >>>> > stay on 2.x until you are comfortable to move is very nice. The >>>> > >>>> > 2.10.0 >>>> > >>>> > bridge release effort has been very good at making it possible for >>>> > >>>> > people >>>> > >>>> > to move from 2.x in 3.x, but the diff between 2.x and 3.x is so large >>>> > >>>> > that >>>> > >>>> > it is reasonable for companies to want to be extra cautious with 3.x >>>> > >>>> > due to >>>> > >>>> > potential performance degradation at large scale. >>>> > >>>> > >>>> > A question I'm pondering is what happens when we move to Java 11 and >>>> > >>>> > someone is still on 2.x? If they want to backport HADOOP-15338 >>>> > >>>> > <https://issues.apache.org/jira/browse/HADOOP-15338> for Java 11 >>>> > >>>> > support to >>>> > >>>> > 2.x, surely not everyone is going to want that (at least not >>>> > >>>> > immediately). >>>> > >>>> > The 2.10 documentation states, "The JVM requirements will not change >>>> > >>>> > across >>>> > >>>> > point releases within the same minor release except if the JVM >>>> > >>>> > version >>>> > >>>> > under question becomes unsupported" [1], so this would warrant a 2.11 >>>> > >>>> > release until Java 8 becomes unsupported (though one could argue that >>>> > >>>> > it is >>>> > >>>> > already unsupported since Oracle is no longer giving public Java 8 >>>> > >>>> > update). >>>> > >>>> > If we don't keep branch-2 around now, would a Java 11 backport be the >>>> > >>>> > catalyst for a branch-2 revival? >>>> > >>>> > >>>> > Not sure if this really leads to any sort of answer from me on >>>> > >>>> > whether >>>> > >>>> > or >>>> > >>>> > not we should keep branch-2 alive, but these are the things that I am >>>> > >>>> > weighing in my mind. For me, the bigger problem beyond having >>>> > >>>> > branch-2 >>>> > >>>> > or >>>> > >>>> > not is committers not being on the same page with where they should >>>> > >>>> > commit >>>> > >>>> > their patches. >>>> > >>>> > >>>> > Eric >>>> > >>>> > >>>> > [1] >>>> > >>>> > >>>> > >>>> > >>>> > >>>> https://hadoop.apache.org/docs/r2.10.0/hadoop-project-dist/hadoop-common/Compatibility.html >>>> > >>>> > [2] >>>> > >>>> > >>>> > >>>> > >>>> > >>>> https://hadoop.apache.org/docs/r3.0.0/hadoop-project-dist/hadoop-common/Compatibility.html >>>> > >>>> > >>>> > On Tue, Nov 19, 2019 at 2:49 PM epa...@apache.org <epa...@apache.org >>>> > >>>> > >>>> > wrote: >>>> > >>>> > >>>> > Hi Konstantin, >>>> > >>>> > >>>> > Sure, I understand those concerns. On the other hand, I worry about >>>> > >>>> > the >>>> > >>>> > stability of 2.10, since we will be on it for a couple of years at >>>> > >>>> > least. >>>> > >>>> > I worry >>>> > >>>> > that some committers may want to put new features into a branch 2 >>>> > >>>> > release, >>>> > >>>> > and without a branch-2, they will go directly into 2.10. Since we >>>> > >>>> > don't >>>> > >>>> > always >>>> > >>>> > catch corner cases or performance problems for some time (usually >>>> > >>>> > not >>>> > >>>> > until >>>> > >>>> > the release is deployed to a busy, 4-thousand node cluster), it >>>> > >>>> > may >>>> > >>>> > be >>>> > >>>> > very >>>> > >>>> > difficult to back out those changes. >>>> > >>>> > >>>> > It sounds like I'm in the minority here, so I'm not nixing the >>>> > >>>> > idea, >>>> > >>>> > but I >>>> > >>>> > do >>>> > >>>> > have these reservations. >>>> > >>>> > >>>> > Thanks, >>>> > >>>> > -Eric >>>> > >>>> > >>>> > >>>> > >>>> > On Tuesday, November 19, 2019, 1:04:15 AM CST, Konstantin Shvachko >>>> > >>>> > < >>>> > >>>> > shv.had...@gmail.com> wrote: >>>> > >>>> > Hi Eric, >>>> > >>>> > >>>> > We had a long discussion on this list regarding making the 2.10 >>>> > >>>> > release the >>>> > >>>> > last of branch-2 releases. We intended 2.10 as a bridge release >>>> > >>>> > between >>>> > >>>> > Hadoop 2 and 3. We may have bug-fix releases or 2.10, but 2.11 is >>>> > >>>> > not in >>>> > >>>> > the picture right now, and many people may object this idea. >>>> > >>>> > >>>> > I understand Jonathan's proposal as an attempt to >>>> > >>>> > 1. eliminate confusion which branches people should commit their >>>> > >>>> > back-ports >>>> > >>>> > to >>>> > >>>> > 2. save engineering effort committing to more branches than >>>> > >>>> > necessary >>>> > >>>> > >>>> > "Branches are cheap" as our founder used to say. If we ever decide >>>> > >>>> > to >>>> > >>>> > release 2.11 we can resurrect the branch. >>>> > >>>> > Until then I am in favor of Jonathan's proposal +1. >>>> > >>>> > >>>> > Thanks, >>>> > >>>> > --Konstantin >>>> > >>>> > >>>> > >>>> > On Mon, Nov 18, 2019 at 10:41 AM Jonathan Hung < >>>> > >>>> > jyhung2...@gmail.com >>>> > >>>> > >>>> > wrote: >>>> > >>>> > >>>> > Thanks Eric for the comments - regarding your concerns, I feel >>>> > >>>> > the >>>> > >>>> > pros >>>> > >>>> > outweigh the cons. To me, the chances of patch releases on 2.10.x >>>> > >>>> > are >>>> > >>>> > much >>>> > >>>> > higher than a new 2.11 minor release. (There didn't seem to be >>>> > >>>> > many >>>> > >>>> > people >>>> > >>>> > outside of our company who expressed interest in getting new >>>> > >>>> > features to >>>> > >>>> > branch-2 prior to the 2.10.0 release.) Even now, a few weeks >>>> > >>>> > after >>>> > >>>> > 2.10.0 >>>> > >>>> > release, there's 29 patches that have gone into branch-2 and 9 in >>>> > >>>> > branch-2.10, so it's already diverged quite a bit. >>>> > >>>> > >>>> > In any case, we can always reverse this decision if we really >>>> > >>>> > need >>>> > >>>> > to, by >>>> > >>>> > recreating branch-2. But this proposal would reduce a lot of >>>> > >>>> > confusion >>>> > >>>> > IMO. >>>> > >>>> > >>>> > Jonathan Hung >>>> > >>>> > >>>> > >>>> > On Fri, Nov 15, 2019 at 11:41 AM epa...@apache.org < >>>> > >>>> > epa...@apache.org> >>>> > >>>> > wrote: >>>> > >>>> > >>>> > Thanks Jonathan for opening the discussion. >>>> > >>>> > >>>> > I am not in favor of this proposal. 2.10 was very recently >>>> > >>>> > released, >>>> > >>>> > and >>>> > >>>> > moving to 2.10 will take some time for the community. It seems >>>> > >>>> > premature >>>> > >>>> > to >>>> > >>>> > make a decision at this point that there will never be a need >>>> > >>>> > for a >>>> > >>>> > 2.11 >>>> > >>>> > release. >>>> > >>>> > >>>> > -Eric >>>> > >>>> > >>>> > >>>> > On Thursday, November 14, 2019, 8:51:59 PM CST, Jonathan Hung >>>> > >>>> > < >>>> > >>>> > jyhung2...@gmail.com> wrote: >>>> > >>>> > >>>> > Hi folks, >>>> > >>>> > >>>> > Given the release of 2.10.0, and the fact that it's intended to >>>> > >>>> > be a >>>> > >>>> > bridge >>>> > >>>> > release to Hadoop 3.x [1], I'm proposing we make 2.10.x the >>>> > >>>> > last >>>> > >>>> > minor >>>> > >>>> > release line in branch-2. Currently, the main issue is that >>>> > >>>> > there's >>>> > >>>> > many >>>> > >>>> > fixes going into branch-2 (the theoretical 2.11.0) that's not >>>> > >>>> > going >>>> > >>>> > into >>>> > >>>> > branch-2.10 (which will become 2.10.1), so the fixes in >>>> > >>>> > branch-2 >>>> > >>>> > will >>>> > >>>> > likely never see the light of day unless they are backported to >>>> > >>>> > branch-2.10. >>>> > >>>> > >>>> > To do this, I propose we: >>>> > >>>> > >>>> > - Delete branch-2.10 >>>> > >>>> > - Rename branch-2 to branch-2.10 >>>> > >>>> > - Set version in the new branch-2.10 to 2.10.1-SNAPSHOT >>>> > >>>> > >>>> > This way we get all the current branch-2 fixes into the 2.10.x >>>> > >>>> > release >>>> > >>>> > line. Then the commit chain will look like: trunk -> branch-3.2 >>>> > >>>> > -> >>>> > >>>> > branch-3.1 -> branch-2.10 -> branch-2.9 -> branch-2.8 >>>> > >>>> > >>>> > Thoughts? >>>> > >>>> > >>>> > Jonathan Hung >>>> > >>>> > >>>> > [1] >>>> > >>>> > >>>> > >>>> > https://www.mail-archive.com/yarn-dev@hadoop.apache.org/msg29479.html >>>> > >>>> > >>>> > >>>> > >>>> > >>>> > --------------------------------------------------------------------- >>>> > >>>> > To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org >>>> > >>>> > For additional commands, e-mail: common-dev-h...@hadoop.apache.org >>>> > >>>> > >>>> > >>>> > >>>> > >>>> > >>>> > >>>> >>>