+1 I agree with Xiaoqiao He's idea.
Best Regards, Shilun Fan. On Fri, Mar 1, 2024 at 12:55 PM Xiaoqiao He <hexiaoq...@apache.org> wrote: > Thanks Shilun for your great work! It is acceptable for me to release > 3.4.0 first which is dependent on hadoop-thirdparty-1.2.0. > Then push forwards to fix the following issues mentioned above at the next > release version. > I don't think we can solve all historical issues in one release. If it is > possible, we could mark this release (release-3.4.0) as an > unstable version. > Any thoughts? Thanks again. > > Best Regards, > - He Xiaoqiao > > On Fri, Mar 1, 2024 at 12:19 PM slfan1989 <slfan1...@apache.org> wrote: > >> I expect to initiate a vote for hadoop-3.4.0-RC3 in preparation for the >> hadoop-3.4.0 release. We have been working on this for 2 months and have >> already released hadoop-thirdparty-1.2.0. >> >> Regarding the issue described in HADOOP-19090, I believe we can address >> it in the hadoop-3.4.1 release because not all improvements can be expected >> to be completed in hadoop-3.4.0. >> >> I commented on HADOOP-19090: >> >> I am not opposed to releasing hadoop-thirdparty-1.2.1, but I don't think >> now is a good time to do so. If we were to release hadoop-thirdparty-1.2.1, >> our process is too lengthy: >> >> 1. We need to announce this in a public mailing list. >> 2. Then initiate a vote, and after the vote passes, release >> hadoop-thirdparty-1.2.1. >> 3. Introduce version 1.2.1 in the Hadoop trunk branch. >> 4. backport hadoop-3.4.0 >> >> Even if we upgrade to protobuf-3.23.4, there might still be other issues. >> If there really are other issues, would we need to release >> hadoop-thirdparty-1.2.2? >> >> I think a better approach would be: >> >> To notify about this in the release email for hadoop-3.4.0, and then >> release hadoop-thirdparty-1.2.1 before the release of hadoop-3.4.1, >> followed by thorough validation. >> >> I would like to hear the thoughts of other members. >> >> Best Regards, >> Shilun Fan. >> >> On Fri, Mar 1, 2024 at 6:05 AM slfan1989 <slfan1...@apache.org> wrote: >> >>> Thank you for the feedback on this issue! >>> >>> We have already released hadoop-thirdparty-1.2.0. I think we should not >>> release hadoop-thirdparty-1.2.1 before the launch of hadoop-3.4.0, as we >>> are already short on time. >>> >>> Can we consider addressing this matter with the release of hadoop-3.4.1 >>> instead? >>> >>> From my personal point of view, I hope to solve this problem in >>> hadoop-3.4.1. >>> >>> Best Regards, >>> Shilun Fan. >>> >>> On Fri, Mar 1, 2024 at 5:37 AM PJ Fanning <fannin...@apache.org> wrote: >>> >>>> There is an issue with the protobuf lib - described here [1] >>>> >>>> The idea would be to do a new hadoop-thirdparty release and uptake that. >>>> >>>> Related the hadoop-thirdparty uptake, I would like to get the Avro >>>> uptake merged [2]. I think if we don't merge this for Hadoop 3.4.0, we >>>> will have to wait until v3.5.0 instead because changing the Avro >>>> compilation is probably something that you would want in a patch >>>> release. >>>> >>>> >>>> [1] https://issues.apache.org/jira/browse/HADOOP-19090 >>>> [2] https://github.com/apache/hadoop/pull/4854#issuecomment-1967549235 >>>> >>>> >>>> On Thu, 29 Feb 2024 at 22:24, slfan1989 <slfan1...@apache.org> wrote: >>>> > >>>> > I am preparing hadoop-3.4.0-RC3 as we have already released 3 RC >>>> versions >>>> > before, and I hope hadoop-3.4.0-RC3 will receive the approval of the >>>> > members. >>>> > >>>> > Compared to hadoop-3.4.0-RC2, my plan is to backport 2 PRs from >>>> branch-3.4 >>>> > to branch-3.4.0: >>>> > >>>> > HADOOP-18088: Replacing log4j 1.x with reload4j. >>>> > HADOOP-19084: Pruning hadoop-common transitive dependencies. >>>> > >>>> > I will use hadoop-release-support to package the arm version. >>>> > >>>> > I plan to release hadoop-3.4.0-RC3 next Monday. >>>> > >>>> > Best Regards, >>>> > Shilun Fan. >>>> > >>>> > On Sat, Feb 24, 2024 at 11:28 AM slfan1989 <slfan1...@apache.org> >>>> wrote: >>>> > >>>> > > Thank you very much for Steve's detailed test report and issue >>>> description! >>>> > > >>>> > > I appreciate your time spent helping with validation. I am >>>> currently >>>> > > trying to use hadoop-release-support to prepare hadoop-3.4.0-RC3. >>>> > > >>>> > > After completing the hadoop-3.4.0 version, I will document some of >>>> the >>>> > > issues encountered in the "how to release" document, so that future >>>> members >>>> > > can refer to it during the release process. >>>> > > >>>> > > Once again, thank you to all members involved in the hadoop-3.4.0 >>>> release. >>>> > > >>>> > > Let's hope for a smooth release process. >>>> > > >>>> > > Best Regards, >>>> > > Shilun Fan. >>>> > > >>>> > > On Sat, Feb 24, 2024 at 2:29 AM Steve Loughran >>>> <ste...@cloudera.com.invalid> >>>> > > wrote: >>>> > > >>>> > >> I have been testing this all week, and a -1 until some very minor >>>> changes >>>> > >> go in. >>>> > >> >>>> > >> >>>> > >> 1. build the arm64 binaries with the same jar artifacts as the >>>> x86 one >>>> > >> 2. include ad8b6541117b HADOOP-18088. Replace log4j 1.x with >>>> reload4j. >>>> > >> 3. include 80b4bb68159c HADOOP-19084. Prune hadoop-common >>>> transitive >>>> > >> dependencies >>>> > >> >>>> > >> >>>> > >> For #1 we have automation there in my client-validator module, >>>> which I >>>> > >> have >>>> > >> moved to be a hadoop-managed project and tried to make more >>>> > >> manageable >>>> > >> https://github.com/apache/hadoop-release-support >>>> > >> >>>> > >> This contains an ant project to perform a lot of the documented >>>> build >>>> > >> stages, including using SCP to copy down an x86 release tarball >>>> and make a >>>> > >> signed copy of this containing (locally built) arm artifacts. >>>> > >> >>>> > >> Although that only works with my development environment (macbook >>>> m1 >>>> > >> laptop >>>> > >> and remote ec2 server), it should be straightforward to make it >>>> more >>>> > >> flexible. >>>> > >> >>>> > >> It also includes and tests a maven project which imports many of >>>> the >>>> > >> hadoop-* pom files and run some test with it; this caught some >>>> problems >>>> > >> with exported slf4j and log4j2 artifacts getting into the >>>> classpath. That >>>> > >> is: hadoop-common pulling in log4j 1.2 and 2.x bindings. >>>> > >> >>>> > >> HADOOP-19084 fixes this; the build file now includes a target to >>>> scan the >>>> > >> dependencies and fail if "forbidden" artifacts are found. I have >>>> not been >>>> > >> able to stop logback ending on the transitive dependency list, but >>>> at >>>> > >> least >>>> > >> there is only one slf4j there. >>>> > >> >>>> > >> HADOOP-18088. Replace log4j 1.x with reload4j switches over to >>>> reload4j >>>> > >> while the move to v2 is still something we have to consider a WiP. >>>> > >> >>>> > >> I have tried doing some other changes to the packaging this week >>>> > >> - creating a lean distro without the AWS SDK >>>> > >> - trying to get protobuf-2.5 out of yarn-api >>>> > >> However, I think it is too late to try applying patches this risky. >>>> > >> >>>> > >> I Believe we should get the 3.4.0 release out for people to start >>>> playing >>>> > >> with while we rapidly iterate 3.4.1 release out with >>>> > >> - updated dependencies (where possible) >>>> > >> - separate "lean" and "full" installations, where "full" includes >>>> all the >>>> > >> cloud connectors and their dependencies; the default is lean and >>>> doesn't. >>>> > >> That will cut the default download size in half. >>>> > >> - critical issues which people who use the 3.4.0 release raise >>>> with us. >>>> > >> >>>> > >> That is: a packaging and bugs release, with a minimal number of new >>>> > >> features. >>>> > >> >>>> > >> I've created HADOOP-19087 >>>> > >> <https://issues.apache.org/jira/browse/HADOOP-19087> to cover >>>> this, >>>> > >> I'm willing to get my hands dirty here -Shilun Fan and Xiaoqiao He >>>> have >>>> > >> put >>>> > >> a lot of work on 3.4.0 and probably need other people to take up >>>> the work >>>> > >> for next release. Who else is willing to participate? (Yes Mukund, >>>> I have >>>> > >> you in mind too) >>>> > >> >>>> > >> One thing I would like to visit is: what hadoop-tools modules can >>>> we cut? >>>> > >> Are rumen and hadoop-streaming being actively used? Or can we >>>> consider >>>> > >> them >>>> > >> implicitly EOL and strip. Just think of the maintenance effort we >>>> would >>>> > >> save. >>>> > >> >>>> > >> --- >>>> > >> >>>> > >> Incidentally, I have tested the arm stuff on my raspberry pi5 >>>> which is now >>>> > >> running 64 bit linux. I believe it is the first time we have >>>> qualified a >>>> > >> Hadoop release with the media player under someone's television. >>>> > >> >>>> > >> On Thu, 15 Feb 2024 at 20:41, Mukund Madhav Thakur < >>>> mtha...@cloudera.com> >>>> > >> wrote: >>>> > >> >>>> > >> > Thanks, Shilun for putting this together. >>>> > >> > >>>> > >> > Tried the below things and everything worked for me. >>>> > >> > >>>> > >> > validated checksum and gpg signature. >>>> > >> > compiled from source. >>>> > >> > Ran AWS integration tests. >>>> > >> > untar the binaries and able to access objects in S3 via hadoop fs >>>> > >> commands. >>>> > >> > compiled gcs-connector successfully using the 3.4.0 version. >>>> > >> > >>>> > >> > qq: what is the difference between RC1 and RC2? apart from some >>>> extra >>>> > >> > patches. >>>> > >> > >>>> > >> > >>>> > >> > >>>> > >> > On Thu, Feb 15, 2024 at 10:58 AM slfan1989 <slfan1...@apache.org >>>> > >>>> > >> wrote: >>>> > >> > >>>> > >> >> Thank you for explaining this part! >>>> > >> >> >>>> > >> >> hadoop-3.4.0-RC2 used the validate-hadoop-client-artifacts tool >>>> to >>>> > >> >> generate >>>> > >> >> the ARM tar package, which should meet expectations. >>>> > >> >> >>>> > >> >> We also look forward to other members helping to verify. >>>> > >> >> >>>> > >> >> Best Regards, >>>> > >> >> Shilun Fan. >>>> > >> >> >>>> > >> >> On Fri, Feb 16, 2024 at 12:22 AM Steve Loughran < >>>> ste...@cloudera.com> >>>> > >> >> wrote: >>>> > >> >> >>>> > >> >> > >>>> > >> >> > >>>> > >> >> > On Mon, 12 Feb 2024 at 15:32, slfan1989 <slfan1...@apache.org >>>> > >>>> > >> wrote: >>>> > >> >> > >>>> > >> >> >> >>>> > >> >> >> >>>> > >> >> >> Note, because the arm64 binaries are built separately on a >>>> different >>>> > >> >> >> platform and JVM, their jar files may not match those of the >>>> x86 >>>> > >> >> >> release -and therefore the maven artifacts. I don't think >>>> this is >>>> > >> >> >> an issue (the ASF actually releases source tarballs, the >>>> binaries >>>> > >> are >>>> > >> >> >> there for help only, though with the maven repo that's a bit >>>> > >> blurred). >>>> > >> >> >> >>>> > >> >> >> The only way to be consistent would actually untar the >>>> x86.tar.gz, >>>> > >> >> >> overwrite its binaries with the arm stuff, retar, sign and >>>> push out >>>> > >> >> >> for the vote. >>>> > >> >> > >>>> > >> >> > >>>> > >> >> > >>>> > >> >> > that's exactly what the "arm.release" target in my >>>> client-validator >>>> > >> >> does. >>>> > >> >> > builds an arm tar with the x86 binaries but the arm native >>>> libs, >>>> > >> signs >>>> > >> >> it. >>>> > >> >> > >>>> > >> >> > >>>> > >> >> > >>>> > >> >> >> Even automating that would be risky. >>>> > >> >> >> >>>> > >> >> >> >>>> > >> >> > automating is the *only* way to do it; apache ant has >>>> everything >>>> > >> needed >>>> > >> >> > for this including the ability to run gpg. >>>> > >> >> > >>>> > >> >> > we did this on the relevant 3.3.x releases and nobody has yet >>>> > >> >> complained... >>>> > >> >> > >>>> > >> >> >>>> > >> > >>>> > >> >>>> > > >>>> >>>