Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-28 Thread Suhail, Ahmar
Thank you everyone for testing and voting, and apologies for not creating a 
separate thread for RC3, will keep that in mind next time. 

The voting summary is:

+5 binding (Ayush, Steve, He Xiaoqiao, Mukund, Masatake)
+3 non-binding (Erdogan, Dongjoon, Cheng Pan)

With 5 binding votes, the Hadoop release 3.4.2 has passed. 

I will now process with the next steps to finalise it. 

Thanks again to everyone for the help!
Ahmar

On 20/08/2025, 16:14, "Suhail, Ahmar" mailto:[email protected]>LID> wrote:


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.






Hey all,


I've fixed the following issues:
1) Issues with changelog, release notes and landing page. Created a new build 
so they now include all the JIRA's that were missing before.
2) Create the lean tar on ubuntu so you won't get the tar: Ignoring unknown 
extended header keyword 'LIBARCHIVE.xattr.com.apple.quarantine' message when 
untarring
3) Previous issues with sha512 and tar not being gzipped.


RC3 for Hadoop 3.4.2 is now available for voting.


The RC is available at:


https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/ 



The git tag is release-3.4.2-RC3, commit
84e8b89ee2ebe6923691205b9e171badde7a495c


The maven artifacts are staged at:


https://repository.apache.org/content/repositories/orgapachehadoop-1443 



You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:


https://dist.apache.org/repos/dist/release/hadoop/common/KEYS 





This release has been created off of branch-3.4.


Thanks,
Ahmar


On 20/08/2025, 09:25, "Masatake Iwasaki" mailto:[email protected]> >> wrote:




CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.












> Since I have to regenerate the site and changelog, do I need to do the whole 
> build again or is there an easier way to do that? (just by regenerating the 
> site manually and updating the artifacts in SVN maybe?)




I think just running the create-release[1] script could be an easy way
while it does not provide options for building only site or source tarballs 
(now).
Since the binary tarballs are containing affected CHANGELOG and RELEASENOTES,
it would be nice to update the binary tarballs.
# Swapping the CHANGELOG and RELEASENOTES of existing binary tarballs
# could be ok if only documentation is updated.
# Signing and checksums must be cared in that case.




In addition, I got following warnings when I extracted the binary tarballs
(hadoop-3.4.2-lean.tar.gz and hadoop-3.4.2-aarch64-lean.tar.gz) on Ubuntu.::




tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.provenance'
tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.quarantine'
tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.provenance'
tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.quarantine'




I guess the cause is modifying the tar files on OSX environment.
It would be nice to fix too.




[1] 
https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release
 

 

 
;>




Masatake Iwasaki




On 8/19/2025 10:28 PM, Suhail, Ahmar wrote:
> Thanks Masatake and Steve,
>
> Here's the PR for updating the top page of the site 
> https://github.com/apache/hadoop/pull/7887 
>  
>  
> ;> , if there's anything else 
> we think needs to go in there, let me know, otherwise I will merge by 
> tomorrow.
>
> Since I have to regenerate the site and changelog, do I need to do the whole 
> build again or is there an easier way to do that? (just by regenerating the 
> site manually and updating the artifacts in SVN maybe?)
>
> I will update the artifacts for the hadoop-3.4.2-lean.tar.gz and 
> hadoop-3.4.2-aarch64-lean.tar.sha512 now.
>
> On 19/08/2025, 08:26, "Masatake Iwasaki"    >    
>
> CAUTION: This email originated from outside of the organization. Do not click 
> l

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-26 Thread Masatake Iwasaki

+1 (binding)

+ verified checksums and signatures of src and site tarballs.
+ built from source tarball with native profile enabled on
  Rocky Linux 9 (x86_64 and aarch64).
+ deployed pseudo cluster with and without kerberos enabled.
  + ran some example jobs.
  + ran some simple query against httpfs.
+ skimmed the contents of pre-built site documentation.

Thanks for the great work, Ahmar.

Masatake Iwasaki

On 8/21/2025 12:13 AM, Suhail, Ahmar wrote:

Hey all,

I've fixed the following issues:
1) Issues with changelog, release notes and landing page. Created a new build 
so they now include all the JIRA's that were missing before.
2) Create the lean tar on ubuntu so you won't get the tar: Ignoring unknown 
extended header keyword 'LIBARCHIVE.xattr.com.apple.quarantine' message when 
untarring
3) Previous issues with sha512 and tar not being gzipped.

RC3 for Hadoop 3.4.2 is now available for voting.

The RC is available at:

https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/

The git tag is release-3.4.2-RC3, commit
84e8b89ee2ebe6923691205b9e171badde7a495c

The maven artifacts are staged at:

https://repository.apache.org/content/repositories/orgapachehadoop-1443

You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:

https://dist.apache.org/repos/dist/release/hadoop/common/KEYS


This release has been created off of branch-3.4.

Thanks,
Ahmar

On 20/08/2025, 09:25, "Masatake Iwasaki" mailto:[email protected]>> wrote:


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.







Since I have to regenerate the site and changelog, do I need to do the whole 
build again or is there an easier way to do that? (just by regenerating the 
site manually and updating the artifacts in SVN maybe?)



I think just running the create-release[1] script could be an easy way
while it does not provide options for building only site or source tarballs 
(now).
Since the binary tarballs are containing affected CHANGELOG and RELEASENOTES,
it would be nice to update the binary tarballs.
# Swapping the CHANGELOG and RELEASENOTES of existing binary tarballs
# could be ok if only documentation is updated.
# Signing and checksums must be cared in that case.


In addition, I got following warnings when I extracted the binary tarballs
(hadoop-3.4.2-lean.tar.gz and hadoop-3.4.2-aarch64-lean.tar.gz) on Ubuntu.::


tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.provenance'
tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.quarantine'
tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.provenance'
tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.quarantine'


I guess the cause is modifying the tar files on OSX environment.
It would be nice to fix too.


[1] https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release 



Masatake Iwasaki


On 8/19/2025 10:28 PM, Suhail, Ahmar wrote:

Thanks Masatake and Steve,

Here's the PR for updating the top page of the site 
https://github.com/apache/hadoop/pull/7887 
 , if there's anything else we 
think needs to go in there, let me know, otherwise I will merge by tomorrow.

Since I have to regenerate the site and changelog, do I need to do the whole 
build again or is there an easier way to do that? (just by regenerating the 
site manually and updating the artifacts in SVN maybe?)

I will update the artifacts for the hadoop-3.4.2-lean.tar.gz and 
hadoop-3.4.2-aarch64-lean.tar.sha512 now.

On 19/08/2025, 08:26, "Masatake Iwasaki" mailto:[email protected]> >> wrote:


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.






Thanks for the update, Ahmar


The binary artifacts basically looks good.
I have some comments about inconsistencies.


- Some JIRAs of additional commits did not have 3.4.2 as "Fix Version/s". I 
updated them.


Jira/Git commit message diff starting: 
##
Jira not present with version: 3.4.2. Commit: 
e1c0dee881820a4d834ec4a4d2c70d0d953bb933 HADOOP-19627. S3A: 
testIfMatchOverwriteWithOutdatedEtag() fails when not using SSE-KMS #7816
...
Jira not present with version: 3.4.2. Commit: 
6a4ea9b1155aefb7732b700d8d7154f977141e39 HADOOP-19626. S3A: Updates AAL version 
to 1.2.1 (#7807) (#7815)
Jira not present with version: 3.4.2. Commit: 
9677363829c21adeb2ce41f93059ab65fc115862 HADOOP-19554. LocalDirAllocator still 
doesn't always recover from directory deletion (#7683) (#7685)
...
Jira not present with version: 3.4.2. Commit: 
57d821e6aad23b6048332720

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-26 Thread Suhail, Ahmar
Thank you everyone for voting, 

Voting summary so far:

+4 binding (Ayush, Steve, He Xiaoqiao, Mukund)
+1 non-binding (Erdogan)

Looking for one more binding vote, and I will also add mine soon. Hoping to 
close voting by the end of the week and get the release out, so please do test 
and vote soon.

Thanks,
Ahmar

On 26/08/2025, 07:20, "Ayush Saxena" mailto:[email protected]>> wrote:


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.






+1 (Binding)


* Built from source (X86 & ARM)
* Validated checksums.
* Verified Signatures.
* Verified all source files have ASF headers
* Validated NOTICE & LICENSE files
* Verified no diff b/w git tag & src tar
* Verified the output of `hadoop version`
* Ran some basic HDFS commands
* Ran example jobs: teragen, terasort, teravalidate, pi
* Browsed through the UI: NN, DN, RM, NM & JHS
* Skimmed over the contents of site tar


Thanx Ahmar for driving the release, Good Luck!!!


-Ayush


On Tue, 26 Aug 2025 at 08:01, Xiaoqiao He mailto:[email protected]>> wrote:
>
> +1(binding).
>
> [Y] Verified sha512 checksum and signature.
> [Y] LICENSE files exist and NOTICE is included in source tarball.
> [Y] Build the source code on Ubuntu and OpenJDK 11 by `mvn clean package
> -DskipTests -Pnative -Pdist -Dtar`.
> [Y] Rat check is ok. `mvn clean apache-rat:check`
> [Y] Setup pseudo cluster with HDFS and YARN.
> [Y] Run simple FsShell - mkdir/put/get/mv/rm and check the result.
> [Y] Run example mr jobs and check the result - Pi & wordcount.
> [Y] Spot-check and run some unit tests.
> [Y] Skimmed the Web UI of NameNode/DataNode/Resourcemanager/NodeManager.
> [Y] Skimmed over the contents of site documentation.
> [Y] Skimmed over the contents of maven repo.
>
> Thanks Ahmar for your work.
>
> Best Regards,
> - He Xiaoqiao
>
> On Tue, Aug 26, 2025 at 8:09 AM Mukund Madhav Thakur
> mailto:[email protected]>lid> wrote:
>
> > +1 binding
> > Verified signature and checksums.
> > Compiled the full source tree successfully.
> > Downloaded and untar the arm binary on Mac OS M3 and ran hadoop fs command
> > on us-west2 bucket which verified dropping in of aws sdk library.
> > Also verified access to abfs containers.
> > Compiled gcs connector using this release.
> >
> > Thanks Ahmar for driving the release.
> >
> > PS: I was expecting a new email for RC3 as RC2 had some problems. I think
> > the norm is to create a new email for every RC vote as per my
> > understanding.
> >
> > On Mon, Aug 25, 2025 at 4:31 PM Steve Loughran  > lid
> > >
> > wrote:
> >
> > > +1 binding
> > >
> > > * downloaded, checked signing and hashes of binary, arm binary and source
> > > artifacts
> > > * full clean build of source
> > > * aws test runs against s3 london and dell ecs. One oddity in building
> > from
> > > the untarred source tree, Terasort failing with history server not
> > > starting. This did work in my git copy of the source so I'm assuming I've
> > > got the core-site/auth-keys wrong (the ones which get copied in). Still,
> > > the cluster tests run, which is an improvement over trunk right now
> > > * full s3a command line testing as per my qualifying and sdk doc.
> > >
> > >
> > > Steve
> > >
> > >
> > >
> > >
> > >
> > > On Thu, 21 Aug 2025 at 15:20, Erdoğan Özkoca  > > >
> > > wrote:
> > >
> > > > +1 (non-binding)
> > > >
> > > > * Verified sha512 checksum was correct for source tarball
> > > > * Verified signature was correct for source tarball (not verified
> > trust)
> > > > * Built source code on Amazon-Linux 2 and OpenJDK 8 in Amazon EC2
> > > > * Verified S3A (hadoop-tools/hadoop-aws) unit tests passing except
> > > > ITestS3AContractOpen.testInputStreamReadNegativePosition test which is
> > > > non-blocker for release as
> > > > https://issues.apache.org/jira/browse/HADOOP-19556 
> > > > 
> > > > * Verified S3A (hadoop-tools/hadoop-aws) integ tests with scale profile
> > > > passing against Amazon S3 in eu-west-1
> > > >
> > > > Suhail, Ahmar  > > > lid>, 21 Ağu 2025 Per, 10:52
> > > > tarihinde şunu yazdı:
> > > >
> > > > > Thanks Anuj, I've merged. Since I cut RC-3 yesterday, this won't be
> > > > > included in the release yet, if the RC fails the vote, will include
> > it
> > > in
> > > > > the next one.
> > > > >
> > > > > On 21/08/2025, 05:08, "Anuj Modi"  > > > >   > > > > [email protected] >> wrote:
> > > > >
> > > > >
> > > > > CAUTION: This email originated from outside of the organization. Do
> > not
> > > > > click links or open attachments unless you can confirm the sender and
> > > > know
> > > > > the content is safe.
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > > Hi, Ahmar
> > > > >
> > > > >
> > > > 

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-25 Thread Ayush Saxena
+1 (Binding)

* Built from source (X86 & ARM)
* Validated checksums.
* Verified Signatures.
* Verified all source files have ASF headers
* Validated NOTICE & LICENSE files
* Verified no diff b/w git tag & src tar
* Verified the output of `hadoop version`
* Ran some basic HDFS commands
* Ran example jobs: teragen, terasort, teravalidate, pi
* Browsed through the UI: NN, DN, RM, NM & JHS
* Skimmed over the contents of site tar

Thanx Ahmar for driving the release, Good Luck!!!

-Ayush

On Tue, 26 Aug 2025 at 08:01, Xiaoqiao He  wrote:
>
> +1(binding).
>
> [Y] Verified sha512 checksum and signature.
> [Y] LICENSE files exist and NOTICE is included in source tarball.
> [Y] Build the source code on Ubuntu and OpenJDK 11 by `mvn clean package
> -DskipTests -Pnative -Pdist -Dtar`.
> [Y] Rat check is ok. `mvn clean apache-rat:check`
> [Y] Setup pseudo cluster with HDFS and YARN.
> [Y] Run simple FsShell - mkdir/put/get/mv/rm and check the result.
> [Y] Run example mr jobs and check the result - Pi & wordcount.
> [Y] Spot-check and run some unit tests.
> [Y] Skimmed the Web UI of NameNode/DataNode/Resourcemanager/NodeManager.
> [Y] Skimmed over the contents of site documentation.
> [Y] Skimmed over the contents of maven repo.
>
> Thanks Ahmar for your work.
>
> Best Regards,
> - He Xiaoqiao
>
> On Tue, Aug 26, 2025 at 8:09 AM Mukund Madhav Thakur
>  wrote:
>
> > +1 binding
> > Verified signature and checksums.
> > Compiled the full source tree successfully.
> > Downloaded and untar the arm binary on Mac OS M3 and ran hadoop fs command
> > on us-west2 bucket which verified dropping in of aws sdk library.
> > Also verified access to abfs containers.
> > Compiled gcs connector using this release.
> >
> > Thanks Ahmar for driving the release.
> >
> > PS: I was expecting a new email for RC3 as RC2 had some problems. I think
> > the norm is to create a new email for every RC vote as per my
> > understanding.
> >
> > On Mon, Aug 25, 2025 at 4:31 PM Steve Loughran  > >
> > wrote:
> >
> > > +1 binding
> > >
> > > * downloaded, checked signing and hashes of binary, arm binary and source
> > > artifacts
> > > * full clean build of source
> > > * aws test runs against s3 london and dell ecs. One oddity in building
> > from
> > > the untarred source tree, Terasort failing with history server not
> > > starting. This did work in my git copy of the source so I'm assuming I've
> > > got the core-site/auth-keys wrong (the ones which get copied in). Still,
> > > the cluster tests run, which is an improvement over trunk right now
> > > * full s3a command line testing as per my qualifying and sdk doc.
> > >
> > >
> > > Steve
> > >
> > >
> > >
> > >
> > >
> > > On Thu, 21 Aug 2025 at 15:20, Erdoğan Özkoca 
> > > wrote:
> > >
> > > > +1 (non-binding)
> > > >
> > > > * Verified sha512 checksum was correct for source tarball
> > > > * Verified signature was correct for source tarball (not verified
> > trust)
> > > > * Built source code on Amazon-Linux 2 and OpenJDK 8 in Amazon EC2
> > > > * Verified S3A (hadoop-tools/hadoop-aws) unit tests passing except
> > > > ITestS3AContractOpen.testInputStreamReadNegativePosition test which is
> > > > non-blocker for release as
> > > > https://issues.apache.org/jira/browse/HADOOP-19556
> > > > * Verified S3A (hadoop-tools/hadoop-aws) integ tests with scale profile
> > > > passing against Amazon S3 in eu-west-1
> > > >
> > > > Suhail, Ahmar , 21 Ağu 2025 Per, 10:52
> > > > tarihinde şunu yazdı:
> > > >
> > > > > Thanks Anuj, I've merged. Since I cut RC-3 yesterday, this won't be
> > > > > included in the release yet, if the RC fails the vote, will include
> > it
> > > in
> > > > > the next one.
> > > > >
> > > > > On 21/08/2025, 05:08, "Anuj Modi"  > > > > [email protected]>> wrote:
> > > > >
> > > > >
> > > > > CAUTION: This email originated from outside of the organization. Do
> > not
> > > > > click links or open attachments unless you can confirm the sender and
> > > > know
> > > > > the content is safe.
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > > Hi, Ahmar
> > > > >
> > > > >
> > > > > Here are the PRs for the fix raised for ABFS close() causing NPE
> > raised
> > > > on
> > > > > RC.
> > > > > PR on branch-3.4.2: https://github.com/apache/hadoop/pull/7888 <
> > > > > https://github.com/apache/hadoop/pull/7888>
> > > > > PR on branch-3.4: https://github.com/apache/hadoop/pull/7889 <
> > > > > https://github.com/apache/hadoop/pull/7889>
> > > > >
> > > > >
> > > > > Feel free to merge them as per the release timelines.
> > > > >
> > > > >
> > > > > Thanks
> > > > >
> > > > >
> > > > > On Wed, Aug 20, 2025 at 11:24 PM Steve Loughran
> > >  > > > > lid>
> > > > > wrote:
> > > > >
> > > > >
> > > > > > just discovering as I do the third party compatibility testing of
> > the
> > > > > > latest sdk that the fs.s3a.create.checksum.algorithm stuff is a bit
> > > > > broken
> > > > > > -nothing to worry about though as its a new feature so 

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-25 Thread Xiaoqiao He
+1(binding).

[Y] Verified sha512 checksum and signature.
[Y] LICENSE files exist and NOTICE is included in source tarball.
[Y] Build the source code on Ubuntu and OpenJDK 11 by `mvn clean package
-DskipTests -Pnative -Pdist -Dtar`.
[Y] Rat check is ok. `mvn clean apache-rat:check`
[Y] Setup pseudo cluster with HDFS and YARN.
[Y] Run simple FsShell - mkdir/put/get/mv/rm and check the result.
[Y] Run example mr jobs and check the result - Pi & wordcount.
[Y] Spot-check and run some unit tests.
[Y] Skimmed the Web UI of NameNode/DataNode/Resourcemanager/NodeManager.
[Y] Skimmed over the contents of site documentation.
[Y] Skimmed over the contents of maven repo.

Thanks Ahmar for your work.

Best Regards,
- He Xiaoqiao

On Tue, Aug 26, 2025 at 8:09 AM Mukund Madhav Thakur
 wrote:

> +1 binding
> Verified signature and checksums.
> Compiled the full source tree successfully.
> Downloaded and untar the arm binary on Mac OS M3 and ran hadoop fs command
> on us-west2 bucket which verified dropping in of aws sdk library.
> Also verified access to abfs containers.
> Compiled gcs connector using this release.
>
> Thanks Ahmar for driving the release.
>
> PS: I was expecting a new email for RC3 as RC2 had some problems. I think
> the norm is to create a new email for every RC vote as per my
> understanding.
>
> On Mon, Aug 25, 2025 at 4:31 PM Steve Loughran  >
> wrote:
>
> > +1 binding
> >
> > * downloaded, checked signing and hashes of binary, arm binary and source
> > artifacts
> > * full clean build of source
> > * aws test runs against s3 london and dell ecs. One oddity in building
> from
> > the untarred source tree, Terasort failing with history server not
> > starting. This did work in my git copy of the source so I'm assuming I've
> > got the core-site/auth-keys wrong (the ones which get copied in). Still,
> > the cluster tests run, which is an improvement over trunk right now
> > * full s3a command line testing as per my qualifying and sdk doc.
> >
> >
> > Steve
> >
> >
> >
> >
> >
> > On Thu, 21 Aug 2025 at 15:20, Erdoğan Özkoca 
> > wrote:
> >
> > > +1 (non-binding)
> > >
> > > * Verified sha512 checksum was correct for source tarball
> > > * Verified signature was correct for source tarball (not verified
> trust)
> > > * Built source code on Amazon-Linux 2 and OpenJDK 8 in Amazon EC2
> > > * Verified S3A (hadoop-tools/hadoop-aws) unit tests passing except
> > > ITestS3AContractOpen.testInputStreamReadNegativePosition test which is
> > > non-blocker for release as
> > > https://issues.apache.org/jira/browse/HADOOP-19556
> > > * Verified S3A (hadoop-tools/hadoop-aws) integ tests with scale profile
> > > passing against Amazon S3 in eu-west-1
> > >
> > > Suhail, Ahmar , 21 Ağu 2025 Per, 10:52
> > > tarihinde şunu yazdı:
> > >
> > > > Thanks Anuj, I've merged. Since I cut RC-3 yesterday, this won't be
> > > > included in the release yet, if the RC fails the vote, will include
> it
> > in
> > > > the next one.
> > > >
> > > > On 21/08/2025, 05:08, "Anuj Modi"  > > > [email protected]>> wrote:
> > > >
> > > >
> > > > CAUTION: This email originated from outside of the organization. Do
> not
> > > > click links or open attachments unless you can confirm the sender and
> > > know
> > > > the content is safe.
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > Hi, Ahmar
> > > >
> > > >
> > > > Here are the PRs for the fix raised for ABFS close() causing NPE
> raised
> > > on
> > > > RC.
> > > > PR on branch-3.4.2: https://github.com/apache/hadoop/pull/7888 <
> > > > https://github.com/apache/hadoop/pull/7888>
> > > > PR on branch-3.4: https://github.com/apache/hadoop/pull/7889 <
> > > > https://github.com/apache/hadoop/pull/7889>
> > > >
> > > >
> > > > Feel free to merge them as per the release timelines.
> > > >
> > > >
> > > > Thanks
> > > >
> > > >
> > > > On Wed, Aug 20, 2025 at 11:24 PM Steve Loughran
> >  > > > lid>
> > > > wrote:
> > > >
> > > >
> > > > > just discovering as I do the third party compatibility testing of
> the
> > > > > latest sdk that the fs.s3a.create.checksum.algorithm stuff is a bit
> > > > broken
> > > > > -nothing to worry about though as its a new feature so not a
> > regression
> > > > >
> > > > > On Wed, 20 Aug 2025 at 16:13, Suhail, Ahmar
> >  > > > lid>
> > > > > wrote:
> > > > >
> > > > > > Hey all,
> > > > > >
> > > > > > I've fixed the following issues:
> > > > > > 1) Issues with changelog, release notes and landing page.
> Created a
> > > new
> > > > > > build so they now include all the JIRA's that were missing
> before.
> > > > > > 2) Create the lean tar on ubuntu so you won't get the tar:
> Ignoring
> > > > > > unknown extended header keyword
> > > 'LIBARCHIVE.xattr.com.apple.quarantine'
> > > > > > message when untarring
> > > > > > 3) Previous issues with sha512 and tar not being gzipped.
> > > > > >
> > > > > > RC3 for Hadoop 3.4.2 is now available for voting.
> > > > > >
> > > > > > The RC is 

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-25 Thread Mukund Madhav Thakur
+1 binding
Verified signature and checksums.
Compiled the full source tree successfully.
Downloaded and untar the arm binary on Mac OS M3 and ran hadoop fs command
on us-west2 bucket which verified dropping in of aws sdk library.
Also verified access to abfs containers.
Compiled gcs connector using this release.

Thanks Ahmar for driving the release.

PS: I was expecting a new email for RC3 as RC2 had some problems. I think
the norm is to create a new email for every RC vote as per my
understanding.

On Mon, Aug 25, 2025 at 4:31 PM Steve Loughran 
wrote:

> +1 binding
>
> * downloaded, checked signing and hashes of binary, arm binary and source
> artifacts
> * full clean build of source
> * aws test runs against s3 london and dell ecs. One oddity in building from
> the untarred source tree, Terasort failing with history server not
> starting. This did work in my git copy of the source so I'm assuming I've
> got the core-site/auth-keys wrong (the ones which get copied in). Still,
> the cluster tests run, which is an improvement over trunk right now
> * full s3a command line testing as per my qualifying and sdk doc.
>
>
> Steve
>
>
>
>
>
> On Thu, 21 Aug 2025 at 15:20, Erdoğan Özkoca 
> wrote:
>
> > +1 (non-binding)
> >
> > * Verified sha512 checksum was correct for source tarball
> > * Verified signature was correct for source tarball (not verified trust)
> > * Built source code on Amazon-Linux 2 and OpenJDK 8 in Amazon EC2
> > * Verified S3A (hadoop-tools/hadoop-aws) unit tests passing except
> > ITestS3AContractOpen.testInputStreamReadNegativePosition test which is
> > non-blocker for release as
> > https://issues.apache.org/jira/browse/HADOOP-19556
> > * Verified S3A (hadoop-tools/hadoop-aws) integ tests with scale profile
> > passing against Amazon S3 in eu-west-1
> >
> > Suhail, Ahmar , 21 Ağu 2025 Per, 10:52
> > tarihinde şunu yazdı:
> >
> > > Thanks Anuj, I've merged. Since I cut RC-3 yesterday, this won't be
> > > included in the release yet, if the RC fails the vote, will include it
> in
> > > the next one.
> > >
> > > On 21/08/2025, 05:08, "Anuj Modi"  > > [email protected]>> wrote:
> > >
> > >
> > > CAUTION: This email originated from outside of the organization. Do not
> > > click links or open attachments unless you can confirm the sender and
> > know
> > > the content is safe.
> > >
> > >
> > >
> > >
> > >
> > >
> > > Hi, Ahmar
> > >
> > >
> > > Here are the PRs for the fix raised for ABFS close() causing NPE raised
> > on
> > > RC.
> > > PR on branch-3.4.2: https://github.com/apache/hadoop/pull/7888 <
> > > https://github.com/apache/hadoop/pull/7888>
> > > PR on branch-3.4: https://github.com/apache/hadoop/pull/7889 <
> > > https://github.com/apache/hadoop/pull/7889>
> > >
> > >
> > > Feel free to merge them as per the release timelines.
> > >
> > >
> > > Thanks
> > >
> > >
> > > On Wed, Aug 20, 2025 at 11:24 PM Steve Loughran
>  > > lid>
> > > wrote:
> > >
> > >
> > > > just discovering as I do the third party compatibility testing of the
> > > > latest sdk that the fs.s3a.create.checksum.algorithm stuff is a bit
> > > broken
> > > > -nothing to worry about though as its a new feature so not a
> regression
> > > >
> > > > On Wed, 20 Aug 2025 at 16:13, Suhail, Ahmar
>  > > lid>
> > > > wrote:
> > > >
> > > > > Hey all,
> > > > >
> > > > > I've fixed the following issues:
> > > > > 1) Issues with changelog, release notes and landing page. Created a
> > new
> > > > > build so they now include all the JIRA's that were missing before.
> > > > > 2) Create the lean tar on ubuntu so you won't get the tar: Ignoring
> > > > > unknown extended header keyword
> > 'LIBARCHIVE.xattr.com.apple.quarantine'
> > > > > message when untarring
> > > > > 3) Previous issues with sha512 and tar not being gzipped.
> > > > >
> > > > > RC3 for Hadoop 3.4.2 is now available for voting.
> > > > >
> > > > > The RC is available at:
> > > > >
> > > > > https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/ <
> > > https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/>
> > > > >
> > > > > The git tag is release-3.4.2-RC3, commit
> > > > > 84e8b89ee2ebe6923691205b9e171badde7a495c
> > > > >
> > > > > The maven artifacts are staged at:
> > > > >
> > > > >
> > >
> https://repository.apache.org/content/repositories/orgapachehadoop-1443
> > <
> > >
> https://repository.apache.org/content/repositories/orgapachehadoop-1443>
> > > > >
> > > > > You can find my public key
> (02085AFB652F796A3B01D11FD737A6F52281FA98)
> > > at:
> > > > >
> > > > > https://dist.apache.org/repos/dist/release/hadoop/common/KEYS <
> > > https://dist.apache.org/repos/dist/release/hadoop/common/KEYS>
> > > > >
> > > > >
> > > > > This release has been created off of branch-3.4.
> > > > >
> > > > > Thanks,
> > > > > Ahmar
> > > > >
> > > > > On 20/08/2025, 09:25, "Masatake Iwasaki" <
> [email protected]
> > > 
> > > > > 

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-25 Thread Steve Loughran
+1 binding

* downloaded, checked signing and hashes of binary, arm binary and source
artifacts
* full clean build of source
* aws test runs against s3 london and dell ecs. One oddity in building from
the untarred source tree, Terasort failing with history server not
starting. This did work in my git copy of the source so I'm assuming I've
got the core-site/auth-keys wrong (the ones which get copied in). Still,
the cluster tests run, which is an improvement over trunk right now
* full s3a command line testing as per my qualifying and sdk doc.


Steve





On Thu, 21 Aug 2025 at 15:20, Erdoğan Özkoca 
wrote:

> +1 (non-binding)
>
> * Verified sha512 checksum was correct for source tarball
> * Verified signature was correct for source tarball (not verified trust)
> * Built source code on Amazon-Linux 2 and OpenJDK 8 in Amazon EC2
> * Verified S3A (hadoop-tools/hadoop-aws) unit tests passing except
> ITestS3AContractOpen.testInputStreamReadNegativePosition test which is
> non-blocker for release as
> https://issues.apache.org/jira/browse/HADOOP-19556
> * Verified S3A (hadoop-tools/hadoop-aws) integ tests with scale profile
> passing against Amazon S3 in eu-west-1
>
> Suhail, Ahmar , 21 Ağu 2025 Per, 10:52
> tarihinde şunu yazdı:
>
> > Thanks Anuj, I've merged. Since I cut RC-3 yesterday, this won't be
> > included in the release yet, if the RC fails the vote, will include it in
> > the next one.
> >
> > On 21/08/2025, 05:08, "Anuj Modi"  > [email protected]>> wrote:
> >
> >
> > CAUTION: This email originated from outside of the organization. Do not
> > click links or open attachments unless you can confirm the sender and
> know
> > the content is safe.
> >
> >
> >
> >
> >
> >
> > Hi, Ahmar
> >
> >
> > Here are the PRs for the fix raised for ABFS close() causing NPE raised
> on
> > RC.
> > PR on branch-3.4.2: https://github.com/apache/hadoop/pull/7888 <
> > https://github.com/apache/hadoop/pull/7888>
> > PR on branch-3.4: https://github.com/apache/hadoop/pull/7889 <
> > https://github.com/apache/hadoop/pull/7889>
> >
> >
> > Feel free to merge them as per the release timelines.
> >
> >
> > Thanks
> >
> >
> > On Wed, Aug 20, 2025 at 11:24 PM Steve Loughran  > lid>
> > wrote:
> >
> >
> > > just discovering as I do the third party compatibility testing of the
> > > latest sdk that the fs.s3a.create.checksum.algorithm stuff is a bit
> > broken
> > > -nothing to worry about though as its a new feature so not a regression
> > >
> > > On Wed, 20 Aug 2025 at 16:13, Suhail, Ahmar  > lid>
> > > wrote:
> > >
> > > > Hey all,
> > > >
> > > > I've fixed the following issues:
> > > > 1) Issues with changelog, release notes and landing page. Created a
> new
> > > > build so they now include all the JIRA's that were missing before.
> > > > 2) Create the lean tar on ubuntu so you won't get the tar: Ignoring
> > > > unknown extended header keyword
> 'LIBARCHIVE.xattr.com.apple.quarantine'
> > > > message when untarring
> > > > 3) Previous issues with sha512 and tar not being gzipped.
> > > >
> > > > RC3 for Hadoop 3.4.2 is now available for voting.
> > > >
> > > > The RC is available at:
> > > >
> > > > https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/ <
> > https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/>
> > > >
> > > > The git tag is release-3.4.2-RC3, commit
> > > > 84e8b89ee2ebe6923691205b9e171badde7a495c
> > > >
> > > > The maven artifacts are staged at:
> > > >
> > > >
> > https://repository.apache.org/content/repositories/orgapachehadoop-1443
> <
> > https://repository.apache.org/content/repositories/orgapachehadoop-1443>
> > > >
> > > > You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98)
> > at:
> > > >
> > > > https://dist.apache.org/repos/dist/release/hadoop/common/KEYS <
> > https://dist.apache.org/repos/dist/release/hadoop/common/KEYS>
> > > >
> > > >
> > > > This release has been created off of branch-3.4.
> > > >
> > > > Thanks,
> > > > Ahmar
> > > >
> > > > On 20/08/2025, 09:25, "Masatake Iwasaki"  > 
> > > >  >>>
> > wrote:
> > > >
> > > >
> > > > CAUTION: This email originated from outside of the organization. Do
> not
> > > > click links or open attachments unless you can confirm the sender and
> > > know
> > > > the content is safe.
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > > Since I have to regenerate the site and changelog, do I need to do
> > the
> > > > whole build again or is there an easier way to do that? (just by
> > > > regenerating the site manually and updating the artifacts in SVN
> > maybe?)
> > > >
> > > >
> > > > I think just running the create-release[1] script could be an easy
> way
> > > > while it does not provide options for building only site or source
> > > > tarballs (now).
> > > > Since the binary tarballs are containing affected CHANGELOG and
> > > > RELEASENOTES,

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-21 Thread Suhail, Ahmar
Thanks Anuj, I've merged. Since I cut RC-3 yesterday, this won't be included in 
the release yet, if the RC fails the vote, will include it in the next one.

On 21/08/2025, 05:08, "Anuj Modi" mailto:[email protected]>> wrote:


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.






Hi, Ahmar


Here are the PRs for the fix raised for ABFS close() causing NPE raised on
RC.
PR on branch-3.4.2: https://github.com/apache/hadoop/pull/7888 

PR on branch-3.4: https://github.com/apache/hadoop/pull/7889 



Feel free to merge them as per the release timelines.


Thanks


On Wed, Aug 20, 2025 at 11:24 PM Steve Loughran mailto:[email protected]>lid>
wrote:


> just discovering as I do the third party compatibility testing of the
> latest sdk that the fs.s3a.create.checksum.algorithm stuff is a bit broken
> -nothing to worry about though as its a new feature so not a regression
>
> On Wed, 20 Aug 2025 at 16:13, Suhail, Ahmar  lid>
> wrote:
>
> > Hey all,
> >
> > I've fixed the following issues:
> > 1) Issues with changelog, release notes and landing page. Created a new
> > build so they now include all the JIRA's that were missing before.
> > 2) Create the lean tar on ubuntu so you won't get the tar: Ignoring
> > unknown extended header keyword 'LIBARCHIVE.xattr.com.apple.quarantine'
> > message when untarring
> > 3) Previous issues with sha512 and tar not being gzipped.
> >
> > RC3 for Hadoop 3.4.2 is now available for voting.
> >
> > The RC is available at:
> >
> > https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/ 
> > 
> >
> > The git tag is release-3.4.2-RC3, commit
> > 84e8b89ee2ebe6923691205b9e171badde7a495c
> >
> > The maven artifacts are staged at:
> >
> > https://repository.apache.org/content/repositories/orgapachehadoop-1443 
> > 
> >
> > You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
> >
> > https://dist.apache.org/repos/dist/release/hadoop/common/KEYS 
> > 
> >
> >
> > This release has been created off of branch-3.4.
> >
> > Thanks,
> > Ahmar
> >
> > On 20/08/2025, 09:25, "Masatake Iwasaki"  > 
> > >> 
> > wrote:
> >
> >
> > CAUTION: This email originated from outside of the organization. Do not
> > click links or open attachments unless you can confirm the sender and
> know
> > the content is safe.
> >
> >
> >
> >
> >
> >
> > > Since I have to regenerate the site and changelog, do I need to do the
> > whole build again or is there an easier way to do that? (just by
> > regenerating the site manually and updating the artifacts in SVN maybe?)
> >
> >
> > I think just running the create-release[1] script could be an easy way
> > while it does not provide options for building only site or source
> > tarballs (now).
> > Since the binary tarballs are containing affected CHANGELOG and
> > RELEASENOTES,
> > it would be nice to update the binary tarballs.
> > # Swapping the CHANGELOG and RELEASENOTES of existing binary tarballs
> > # could be ok if only documentation is updated.
> > # Signing and checksums must be cared in that case.
> >
> >
> > In addition, I got following warnings when I extracted the binary
> tarballs
> > (hadoop-3.4.2-lean.tar.gz and hadoop-3.4.2-aarch64-lean.tar.gz) on
> > Ubuntu.::
> >
> >
> > tar: Ignoring unknown extended header keyword
> > 'LIBARCHIVE.xattr.com.apple.provenance'
> > tar: Ignoring unknown extended header keyword
> > 'LIBARCHIVE.xattr.com.apple.quarantine'
> > tar: Ignoring unknown extended header keyword
> > 'LIBARCHIVE.xattr.com.apple.provenance'
> > tar: Ignoring unknown extended header keyword
> > 'LIBARCHIVE.xattr.com.apple.quarantine'
> >
> >
> > I guess the cause is modifying the tar files on OSX environment.
> > It would be nice to fix too.
> >
> >
> > [1]
> >
> https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release
>  
> 
> > <
> >
> https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release
>  
> 
> > >
> >
> >
> > Masatake Iwasaki
> >
> >
> > On 8/19/2025 10:28 PM, Suhail, Ahmar wrote:
> > > Thanks Masatake and Steve,
> > >
> > > Here's the PR for updating the top page of the site
> > https://github.com/apache/hadoop/pull/7887 
> >  <
> > https://github.com/apache/hadoop/pull/7887> 
> > ;> 

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-20 Thread Anuj Modi
Hi, Ahmar

Here are the PRs for the fix raised for ABFS close() causing NPE raised on
RC.
PR on branch-3.4.2: https://github.com/apache/hadoop/pull/7888
PR on branch-3.4: https://github.com/apache/hadoop/pull/7889

Feel free to merge them as per the release timelines.

Thanks

On Wed, Aug 20, 2025 at 11:24 PM Steve Loughran 
wrote:

> just discovering as I do the third party compatibility testing of the
> latest sdk that the fs.s3a.create.checksum.algorithm stuff is a bit broken
> -nothing to worry about though as its a new feature so not a regression
>
> On Wed, 20 Aug 2025 at 16:13, Suhail, Ahmar 
> wrote:
>
> > Hey all,
> >
> > I've fixed the following issues:
> > 1) Issues with changelog, release notes and landing page. Created a new
> > build so they now include all the JIRA's that were missing before.
> > 2) Create the lean tar on ubuntu so you won't get the tar: Ignoring
> > unknown extended header keyword 'LIBARCHIVE.xattr.com.apple.quarantine'
> > message when untarring
> > 3) Previous issues with sha512 and tar not being gzipped.
> >
> > RC3 for Hadoop 3.4.2 is now available for voting.
> >
> > The RC is available at:
> >
> > https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/
> >
> > The git tag is release-3.4.2-RC3, commit
> > 84e8b89ee2ebe6923691205b9e171badde7a495c
> >
> > The maven artifacts are staged at:
> >
> > https://repository.apache.org/content/repositories/orgapachehadoop-1443
> >
> > You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
> >
> > https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
> >
> >
> > This release has been created off of branch-3.4.
> >
> > Thanks,
> > Ahmar
> >
> > On 20/08/2025, 09:25, "Masatake Iwasaki"  > > wrote:
> >
> >
> > CAUTION: This email originated from outside of the organization. Do not
> > click links or open attachments unless you can confirm the sender and
> know
> > the content is safe.
> >
> >
> >
> >
> >
> >
> > > Since I have to regenerate the site and changelog, do I need to do the
> > whole build again or is there an easier way to do that? (just by
> > regenerating the site manually and updating the artifacts in SVN maybe?)
> >
> >
> > I think just running the create-release[1] script could be an easy way
> > while it does not provide options for building only site or source
> > tarballs (now).
> > Since the binary tarballs are containing affected CHANGELOG and
> > RELEASENOTES,
> > it would be nice to update the binary tarballs.
> > # Swapping the CHANGELOG and RELEASENOTES of existing binary tarballs
> > # could be ok if only documentation is updated.
> > # Signing and checksums must be cared in that case.
> >
> >
> > In addition, I got following warnings when I extracted the binary
> tarballs
> > (hadoop-3.4.2-lean.tar.gz and hadoop-3.4.2-aarch64-lean.tar.gz) on
> > Ubuntu.::
> >
> >
> > tar: Ignoring unknown extended header keyword
> > 'LIBARCHIVE.xattr.com.apple.provenance'
> > tar: Ignoring unknown extended header keyword
> > 'LIBARCHIVE.xattr.com.apple.quarantine'
> > tar: Ignoring unknown extended header keyword
> > 'LIBARCHIVE.xattr.com.apple.provenance'
> > tar: Ignoring unknown extended header keyword
> > 'LIBARCHIVE.xattr.com.apple.quarantine'
> >
> >
> > I guess the cause is modifying the tar files on OSX environment.
> > It would be nice to fix too.
> >
> >
> > [1]
> >
> https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release
> > <
> >
> https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release
> > >
> >
> >
> > Masatake Iwasaki
> >
> >
> > On 8/19/2025 10:28 PM, Suhail, Ahmar wrote:
> > > Thanks Masatake and Steve,
> > >
> > > Here's the PR for updating the top page of the site
> > https://github.com/apache/hadoop/pull/7887 <
> > https://github.com/apache/hadoop/pull/7887> , if there's anything else
> we
> > think needs to go in there, let me know, otherwise I will merge by
> tomorrow.
> > >
> > > Since I have to regenerate the site and changelog, do I need to do the
> > whole build again or is there an easier way to do that? (just by
> > regenerating the site manually and updating the artifacts in SVN maybe?)
> > >
> > > I will update the artifacts for the hadoop-3.4.2-lean.tar.gz and
> > hadoop-3.4.2-aarch64-lean.tar.sha512 now.
> > >
> > > On 19/08/2025, 08:26, "Masatake Iwasaki"  >   > >> wrote:
> > >
> > >
> > > CAUTION: This email originated from outside of the organization. Do not
> > click links or open attachments unless you can confirm the sender and
> know
> > the content is safe.
> > >
> > >
> > >
> > >
> > >
> > >
> > > Thanks for the update, Ahmar
> > >
> > >
> > > The binary artifacts basically looks good.
> > > I have some comments about inconsistencies.
> > >
> > >
> > > - Some JIRAs of additional commits did not have 3.4.2 as "Fix
> > Version/s". I updated them.
> > >
> > >
> > > Jira/G

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-20 Thread Steve Loughran
just discovering as I do the third party compatibility testing of the
latest sdk that the fs.s3a.create.checksum.algorithm stuff is a bit broken
-nothing to worry about though as its a new feature so not a regression

On Wed, 20 Aug 2025 at 16:13, Suhail, Ahmar 
wrote:

> Hey all,
>
> I've fixed the following issues:
> 1) Issues with changelog, release notes and landing page. Created a new
> build so they now include all the JIRA's that were missing before.
> 2) Create the lean tar on ubuntu so you won't get the tar: Ignoring
> unknown extended header keyword 'LIBARCHIVE.xattr.com.apple.quarantine'
> message when untarring
> 3) Previous issues with sha512 and tar not being gzipped.
>
> RC3 for Hadoop 3.4.2 is now available for voting.
>
> The RC is available at:
>
> https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/
>
> The git tag is release-3.4.2-RC3, commit
> 84e8b89ee2ebe6923691205b9e171badde7a495c
>
> The maven artifacts are staged at:
>
> https://repository.apache.org/content/repositories/orgapachehadoop-1443
>
> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>
> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>
>
> This release has been created off of branch-3.4.
>
> Thanks,
> Ahmar
>
> On 20/08/2025, 09:25, "Masatake Iwasaki"  > wrote:
>
>
> CAUTION: This email originated from outside of the organization. Do not
> click links or open attachments unless you can confirm the sender and know
> the content is safe.
>
>
>
>
>
>
> > Since I have to regenerate the site and changelog, do I need to do the
> whole build again or is there an easier way to do that? (just by
> regenerating the site manually and updating the artifacts in SVN maybe?)
>
>
> I think just running the create-release[1] script could be an easy way
> while it does not provide options for building only site or source
> tarballs (now).
> Since the binary tarballs are containing affected CHANGELOG and
> RELEASENOTES,
> it would be nice to update the binary tarballs.
> # Swapping the CHANGELOG and RELEASENOTES of existing binary tarballs
> # could be ok if only documentation is updated.
> # Signing and checksums must be cared in that case.
>
>
> In addition, I got following warnings when I extracted the binary tarballs
> (hadoop-3.4.2-lean.tar.gz and hadoop-3.4.2-aarch64-lean.tar.gz) on
> Ubuntu.::
>
>
> tar: Ignoring unknown extended header keyword
> 'LIBARCHIVE.xattr.com.apple.provenance'
> tar: Ignoring unknown extended header keyword
> 'LIBARCHIVE.xattr.com.apple.quarantine'
> tar: Ignoring unknown extended header keyword
> 'LIBARCHIVE.xattr.com.apple.provenance'
> tar: Ignoring unknown extended header keyword
> 'LIBARCHIVE.xattr.com.apple.quarantine'
>
>
> I guess the cause is modifying the tar files on OSX environment.
> It would be nice to fix too.
>
>
> [1]
> https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release
> <
> https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release
> >
>
>
> Masatake Iwasaki
>
>
> On 8/19/2025 10:28 PM, Suhail, Ahmar wrote:
> > Thanks Masatake and Steve,
> >
> > Here's the PR for updating the top page of the site
> https://github.com/apache/hadoop/pull/7887 <
> https://github.com/apache/hadoop/pull/7887> , if there's anything else we
> think needs to go in there, let me know, otherwise I will merge by tomorrow.
> >
> > Since I have to regenerate the site and changelog, do I need to do the
> whole build again or is there an easier way to do that? (just by
> regenerating the site manually and updating the artifacts in SVN maybe?)
> >
> > I will update the artifacts for the hadoop-3.4.2-lean.tar.gz and
> hadoop-3.4.2-aarch64-lean.tar.sha512 now.
> >
> > On 19/08/2025, 08:26, "Masatake Iwasaki"    >> wrote:
> >
> >
> > CAUTION: This email originated from outside of the organization. Do not
> click links or open attachments unless you can confirm the sender and know
> the content is safe.
> >
> >
> >
> >
> >
> >
> > Thanks for the update, Ahmar
> >
> >
> > The binary artifacts basically looks good.
> > I have some comments about inconsistencies.
> >
> >
> > - Some JIRAs of additional commits did not have 3.4.2 as "Fix
> Version/s". I updated them.
> >
> >
> > Jira/Git commit message diff starting:
> ##
> > Jira not present with version: 3.4.2. Commit:
> e1c0dee881820a4d834ec4a4d2c70d0d953bb933 HADOOP-19627. S3A:
> testIfMatchOverwriteWithOutdatedEtag() fails when not using SSE-KMS #7816
> > ...
> > Jira not present with version: 3.4.2. Commit:
> 6a4ea9b1155aefb7732b700d8d7154f977141e39 HADOOP-19626. S3A: Updates AAL
> version to 1.2.1 (#7807) (#7815)
> > Jira not present with version: 3.4.2. Commit:
> 9677363829c21adeb2ce41f93059ab65fc115862 HADOOP-19554. LocalDirAllocator
> still doesn't always recover from directory deletion (#7683) 

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-20 Thread Suhail, Ahmar
Hey all,

I've fixed the following issues:
1) Issues with changelog, release notes and landing page. Created a new build 
so they now include all the JIRA's that were missing before. 
2) Create the lean tar on ubuntu so you won't get the tar: Ignoring unknown 
extended header keyword 'LIBARCHIVE.xattr.com.apple.quarantine' message when 
untarring
3) Previous issues with sha512 and tar not being gzipped.

RC3 for Hadoop 3.4.2 is now available for voting.

The RC is available at:

https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC3/

The git tag is release-3.4.2-RC3, commit
84e8b89ee2ebe6923691205b9e171badde7a495c

The maven artifacts are staged at:

https://repository.apache.org/content/repositories/orgapachehadoop-1443

You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:

https://dist.apache.org/repos/dist/release/hadoop/common/KEYS


This release has been created off of branch-3.4.

Thanks,
Ahmar

On 20/08/2025, 09:25, "Masatake Iwasaki" mailto:[email protected]>> wrote:


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.






> Since I have to regenerate the site and changelog, do I need to do the whole 
> build again or is there an easier way to do that? (just by regenerating the 
> site manually and updating the artifacts in SVN maybe?)


I think just running the create-release[1] script could be an easy way
while it does not provide options for building only site or source tarballs 
(now).
Since the binary tarballs are containing affected CHANGELOG and RELEASENOTES,
it would be nice to update the binary tarballs.
# Swapping the CHANGELOG and RELEASENOTES of existing binary tarballs
# could be ok if only documentation is updated.
# Signing and checksums must be cared in that case.


In addition, I got following warnings when I extracted the binary tarballs
(hadoop-3.4.2-lean.tar.gz and hadoop-3.4.2-aarch64-lean.tar.gz) on Ubuntu.::


tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.provenance'
tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.quarantine'
tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.provenance'
tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.quarantine'


I guess the cause is modifying the tar files on OSX environment.
It would be nice to fix too.


[1] 
https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release
 



Masatake Iwasaki


On 8/19/2025 10:28 PM, Suhail, Ahmar wrote:
> Thanks Masatake and Steve,
>
> Here's the PR for updating the top page of the site 
> https://github.com/apache/hadoop/pull/7887 
>  , if there's anything else we 
> think needs to go in there, let me know, otherwise I will merge by tomorrow.
>
> Since I have to regenerate the site and changelog, do I need to do the whole 
> build again or is there an easier way to do that? (just by regenerating the 
> site manually and updating the artifacts in SVN maybe?)
>
> I will update the artifacts for the hadoop-3.4.2-lean.tar.gz and 
> hadoop-3.4.2-aarch64-lean.tar.sha512 now.
>
> On 19/08/2025, 08:26, "Masatake Iwasaki"    >> wrote:
>
>
> CAUTION: This email originated from outside of the organization. Do not click 
> links or open attachments unless you can confirm the sender and know the 
> content is safe.
>
>
>
>
>
>
> Thanks for the update, Ahmar
>
>
> The binary artifacts basically looks good.
> I have some comments about inconsistencies.
>
>
> - Some JIRAs of additional commits did not have 3.4.2 as "Fix Version/s". I 
> updated them.
>
>
> Jira/Git commit message diff starting: 
> ##
> Jira not present with version: 3.4.2. Commit: 
> e1c0dee881820a4d834ec4a4d2c70d0d953bb933 HADOOP-19627. S3A: 
> testIfMatchOverwriteWithOutdatedEtag() fails when not using SSE-KMS #7816
> ...
> Jira not present with version: 3.4.2. Commit: 
> 6a4ea9b1155aefb7732b700d8d7154f977141e39 HADOOP-19626. S3A: Updates AAL 
> version to 1.2.1 (#7807) (#7815)
> Jira not present with version: 3.4.2. Commit: 
> 9677363829c21adeb2ce41f93059ab65fc115862 HADOOP-19554. LocalDirAllocator 
> still doesn't always recover from directory deletion (#7683) (#7685)
> ...
> Jira not present with version: 3.4.2. Commit: 
> 57d821e6aad23b604833272071c8481567363e63 HADOOP-19143. Upgrade commons-cli to 
> 1.9.0. (#7126) (#7146) Contributed by Min Yan.
>
>
> - The top page of site documentation was not updated for 3.4.2.
> https://github.com/apache/hadoop/blob/release-3.4.2-RC2/hadoop-project/src/site/markdown/index.md.vm
>  
> 

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-20 Thread Masatake Iwasaki

Since I have to regenerate the site and changelog, do I need to do the whole 
build again or is there an easier way to do that? (just by regenerating the 
site manually and updating the artifacts in SVN maybe?)


I think just running the create-release[1] script could be an easy way
while it does not provide options for building only site or source tarballs 
(now).
Since the binary tarballs are containing affected CHANGELOG and RELEASENOTES,
it would be nice to update the binary tarballs.
# Swapping the CHANGELOG and RELEASENOTES of existing binary tarballs
# could be ok if only documentation is updated.
# Signing and checksums must be cared in that case.

In addition, I got following warnings when I extracted the binary tarballs
(hadoop-3.4.2-lean.tar.gz and hadoop-3.4.2-aarch64-lean.tar.gz) on Ubuntu.::

  tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.provenance'
  tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.quarantine'
  tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.provenance'
  tar: Ignoring unknown extended header keyword 
'LIBARCHIVE.xattr.com.apple.quarantine'

I guess the cause is modifying the tar files on OSX environment.
It would be nice to fix too.

[1] 
https://github.com/apache/hadoop/blob/branch-3.4.2/dev-support/bin/create-release

Masatake Iwasaki

On 8/19/2025 10:28 PM, Suhail, Ahmar wrote:

Thanks Masatake and Steve,

Here's the PR for updating the top page of the site 
https://github.com/apache/hadoop/pull/7887 , if there's anything else we think 
needs to go in there, let me know, otherwise I will merge by tomorrow.

Since I have to regenerate the site and changelog, do I need to do the whole 
build again or is there an easier way to do that? (just by regenerating the 
site manually and updating the artifacts in SVN maybe?)

I will update the artifacts for the hadoop-3.4.2-lean.tar.gz and 
hadoop-3.4.2-aarch64-lean.tar.sha512 now.

On 19/08/2025, 08:26, "Masatake Iwasaki" mailto:[email protected]>> wrote:


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.






Thanks for the update, Ahmar


The binary artifacts basically looks good.
I have some comments about inconsistencies.


- Some JIRAs of additional commits did not have 3.4.2 as "Fix Version/s". I 
updated them.


Jira/Git commit message diff starting: 
##
Jira not present with version: 3.4.2. Commit: 
e1c0dee881820a4d834ec4a4d2c70d0d953bb933 HADOOP-19627. S3A: 
testIfMatchOverwriteWithOutdatedEtag() fails when not using SSE-KMS #7816
...
Jira not present with version: 3.4.2. Commit: 
6a4ea9b1155aefb7732b700d8d7154f977141e39 HADOOP-19626. S3A: Updates AAL version 
to 1.2.1 (#7807) (#7815)
Jira not present with version: 3.4.2. Commit: 
9677363829c21adeb2ce41f93059ab65fc115862 HADOOP-19554. LocalDirAllocator still 
doesn't always recover from directory deletion (#7683) (#7685)
...
Jira not present with version: 3.4.2. Commit: 
57d821e6aad23b604833272071c8481567363e63 HADOOP-19143. Upgrade commons-cli to 
1.9.0. (#7126) (#7146) Contributed by Min Yan.


- The top page of site documentation was not updated for 3.4.2.
https://github.com/apache/hadoop/blob/release-3.4.2-RC2/hadoop-project/src/site/markdown/index.md.vm
 



- Some release artifact are inconsistent as Steve pointed out.


- `hadoop-3.4.2-lean.tar.gz` is not gzipped.


- `hadoop-3.4.2-aarch64-lean.tar.sha512` should be 
`hadoop-3.4.2-aarch64-lean.tar.gz.sha512`


Masatake Iwasaki


On 8/14/2025 6:12 PM, Ahmar Suhail wrote:

Hey all,

Just a reminder about this. Would be great to get some votes in asap and
hopefully finalise 3.4.2.

Thanks,
Ahmar

On Thu, Aug 7, 2025 at 9:35 PM Ahmar Suhail mailto:[email protected]>> wrote:


Hey all,

RC2 for Hadoop 3.4.2 is now available for voting.

There are a couple of things to note:

1/ We have ARM artifacts again! Thanks to Masatake and Wei-Chiu Chuang for
working on the fix.

2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
For S3A, download SDK bundle v2.29.52 from:
https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52 
,
and drop it into /share/hadoop/common/lib. Release notes will be updated
with these instructions.


The RC is available at:

https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC2/ 


The git tag is release-3.4.2-RC2, commit
e1c0dee881820a4d834ec4a4d2c70d0d953bb933

The maven artifacts are staged at:

https://repository.apache.org/content/

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-19 Thread Suhail, Ahmar
Thanks Masatake and Steve, 

Here's the PR for updating the top page of the site 
https://github.com/apache/hadoop/pull/7887 , if there's anything else we think 
needs to go in there, let me know, otherwise I will merge by tomorrow. 

Since I have to regenerate the site and changelog, do I need to do the whole 
build again or is there an easier way to do that? (just by regenerating the 
site manually and updating the artifacts in SVN maybe?) 

I will update the artifacts for the hadoop-3.4.2-lean.tar.gz and 
hadoop-3.4.2-aarch64-lean.tar.sha512 now.

On 19/08/2025, 08:26, "Masatake Iwasaki" mailto:[email protected]>> wrote:


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.






Thanks for the update, Ahmar


The binary artifacts basically looks good.
I have some comments about inconsistencies.


- Some JIRAs of additional commits did not have 3.4.2 as "Fix Version/s". I 
updated them.


Jira/Git commit message diff starting: 
##
Jira not present with version: 3.4.2. Commit: 
e1c0dee881820a4d834ec4a4d2c70d0d953bb933 HADOOP-19627. S3A: 
testIfMatchOverwriteWithOutdatedEtag() fails when not using SSE-KMS #7816
...
Jira not present with version: 3.4.2. Commit: 
6a4ea9b1155aefb7732b700d8d7154f977141e39 HADOOP-19626. S3A: Updates AAL version 
to 1.2.1 (#7807) (#7815)
Jira not present with version: 3.4.2. Commit: 
9677363829c21adeb2ce41f93059ab65fc115862 HADOOP-19554. LocalDirAllocator still 
doesn't always recover from directory deletion (#7683) (#7685)
...
Jira not present with version: 3.4.2. Commit: 
57d821e6aad23b604833272071c8481567363e63 HADOOP-19143. Upgrade commons-cli to 
1.9.0. (#7126) (#7146) Contributed by Min Yan.


- The top page of site documentation was not updated for 3.4.2.
https://github.com/apache/hadoop/blob/release-3.4.2-RC2/hadoop-project/src/site/markdown/index.md.vm
 



- Some release artifact are inconsistent as Steve pointed out.


- `hadoop-3.4.2-lean.tar.gz` is not gzipped.


- `hadoop-3.4.2-aarch64-lean.tar.sha512` should be 
`hadoop-3.4.2-aarch64-lean.tar.gz.sha512`


Masatake Iwasaki


On 8/14/2025 6:12 PM, Ahmar Suhail wrote:
> Hey all,
>
> Just a reminder about this. Would be great to get some votes in asap and
> hopefully finalise 3.4.2.
>
> Thanks,
> Ahmar
>
> On Thu, Aug 7, 2025 at 9:35 PM Ahmar Suhail  > wrote:
>
>> Hey all,
>>
>> RC2 for Hadoop 3.4.2 is now available for voting.
>>
>> There are a couple of things to note:
>>
>> 1/ We have ARM artifacts again! Thanks to Masatake and Wei-Chiu Chuang for
>> working on the fix.
>>
>> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
>> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
>> which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
>> For S3A, download SDK bundle v2.29.52 from:
>> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52 
>> ,
>> and drop it into /share/hadoop/common/lib. Release notes will be updated
>> with these instructions.
>>
>>
>> The RC is available at:
>>
>> https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC2/ 
>> 
>>
>> The git tag is release-3.4.2-RC2, commit
>> e1c0dee881820a4d834ec4a4d2c70d0d953bb933
>>
>> The maven artifacts are staged at:
>>
>> https://repository.apache.org/content/repositories/orgapachehadoop-1441 
>> 
>>
>>
>> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>>
>> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS 
>> 
>>
>>
>> This release has been created off of branch-3.4.
>>
>> Thanks,
>> Ahmar
>>
>







Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-19 Thread Masatake Iwasaki

Thanks for the update, Ahmar

The binary artifacts basically looks good.
I have some comments about inconsistencies.

- Some JIRAs of additional commits did not have 3.4.2 as "Fix Version/s". I 
updated them.

Jira/Git commit message diff starting: 
##
Jira not present with version: 3.4.2.Commit: 
e1c0dee881820a4d834ec4a4d2c70d0d953bb933 HADOOP-19627. S3A: 
testIfMatchOverwriteWithOutdatedEtag() fails when not using SSE-KMS #7816
...
Jira not present with version: 3.4.2.Commit: 
6a4ea9b1155aefb7732b700d8d7154f977141e39 HADOOP-19626. S3A: Updates AAL version 
to 1.2.1  (#7807) (#7815)
Jira not present with version: 3.4.2.Commit: 
9677363829c21adeb2ce41f93059ab65fc115862 HADOOP-19554. LocalDirAllocator still 
doesn't always recover from directory deletion (#7683) (#7685)
...
Jira not present with version: 3.4.2.Commit: 
57d821e6aad23b604833272071c8481567363e63 HADOOP-19143. Upgrade commons-cli to 
1.9.0. (#7126) (#7146) Contributed by Min Yan.

- The top page of site documentation was not updated for 3.4.2.
  
https://github.com/apache/hadoop/blob/release-3.4.2-RC2/hadoop-project/src/site/markdown/index.md.vm

- Some release artifact are inconsistent as Steve pointed out.

  - `hadoop-3.4.2-lean.tar.gz` is not gzipped.

  - `hadoop-3.4.2-aarch64-lean.tar.sha512` should be 
`hadoop-3.4.2-aarch64-lean.tar.gz.sha512`

Masatake Iwasaki

On 8/14/2025 6:12 PM, Ahmar Suhail wrote:

Hey all,

Just a reminder about this. Would be great to get some votes in asap and
hopefully finalise 3.4.2.

Thanks,
Ahmar

On Thu, Aug 7, 2025 at 9:35 PM Ahmar Suhail  wrote:


Hey all,

RC2 for Hadoop 3.4.2 is now available for voting.

There are a couple of things to note:

1/ We have ARM artifacts again! Thanks to Masatake and Wei-Chiu Chuang for
working on the fix.

2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
For S3A, download SDK bundle v2.29.52 from:
https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
and drop it into /share/hadoop/common/lib. Release notes will be updated
with these instructions.


The RC is available at:

https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC2/

The git tag is release-3.4.2-RC2, commit
e1c0dee881820a4d834ec4a4d2c70d0d953bb933

The maven artifacts are staged at:

https://repository.apache.org/content/repositories/orgapachehadoop-1441


You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:

https://dist.apache.org/repos/dist/release/hadoop/common/KEYS


This release has been created off of branch-3.4.

Thanks,
Ahmar






-
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-15 Thread Steve Loughran
sorry, been behind on this.

-1 packaging related.

that hadoop-3.4.2-lean.tar.gz file is actually a tar file. Took me a while
to work this down as I just assumed my ant build script was at fault.

file downloads/hadoop-3.4.2-RC2/hadoop-3.4.2-lean.tar.gz
downloads/hadoop-3.4.2-RC2/hadoop-3.4.2-lean.tar.gz: POSIX tar archive

I have verified that copying the right aws sdk version into tools/lib puts
it on the classpath.

Also found an NPE in the sequence new AzureBlobFileSystem().close() (i.e.
no Initialize)
Filed HADOOP-19650 for that

Also,  there's no sha512 of the ant lean stuff.

Can't get
https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC2/hadoop-3.4.2-aarch64-lean.tar.gz.sha512
to downloads/incoming/hadoop-3.4.2-aarch64-lean.tar.gz.sha512

Actual binaries, maven artifacts, good.

Parquet tests are failing with that buffer release logic which we have now
made optional in our code (HADOOP`18296). I'm wondering if we should have
parquet make vector read optional rather than on by default -but advise
turning on for all cloud stores.

Really unsure what to do there.

On Thu, 14 Aug 2025 at 10:13, Ahmar Suhail  wrote:

> Hey all,
>
> Just a reminder about this. Would be great to get some votes in asap and
> hopefully finalise 3.4.2.
>
> Thanks,
> Ahmar
>
> On Thu, Aug 7, 2025 at 9:35 PM Ahmar Suhail  wrote:
>
> > Hey all,
> >
> > RC2 for Hadoop 3.4.2 is now available for voting.
> >
> > There are a couple of things to note:
> >
> > 1/ We have ARM artifacts again! Thanks to Masatake and Wei-Chiu Chuang
> for
> > working on the fix.
> >
> > 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
> > from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
> > which makes the size of tar > 1GB, and it can no longer be uploaded to
> SVN.
> > For S3A, download SDK bundle v2.29.52 from:
> > https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52
> ,
> > and drop it into /share/hadoop/common/lib. Release notes will be updated
> > with these instructions.
> >
> >
> > The RC is available at:
> >
> > https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC2/
> >
> > The git tag is release-3.4.2-RC2, commit
> > e1c0dee881820a4d834ec4a4d2c70d0d953bb933
> >
> > The maven artifacts are staged at:
> >
> > https://repository.apache.org/content/repositories/orgapachehadoop-1441
> >
> >
> > You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
> >
> > https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
> >
> >
> > This release has been created off of branch-3.4.
> >
> > Thanks,
> > Ahmar
> >
>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-14 Thread Xiaoqiao He
Thanks Ahmar for your work on this. Will check it ASAP.

Best Regards,
- He Xiaoqiao

On Thu, Aug 14, 2025 at 5:14 PM Ahmar Suhail  wrote:

> Hey all,
>
> Just a reminder about this. Would be great to get some votes in asap and
> hopefully finalise 3.4.2.
>
> Thanks,
> Ahmar
>
> On Thu, Aug 7, 2025 at 9:35 PM Ahmar Suhail  wrote:
>
> > Hey all,
> >
> > RC2 for Hadoop 3.4.2 is now available for voting.
> >
> > There are a couple of things to note:
> >
> > 1/ We have ARM artifacts again! Thanks to Masatake and Wei-Chiu Chuang
> for
> > working on the fix.
> >
> > 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
> > from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
> > which makes the size of tar > 1GB, and it can no longer be uploaded to
> SVN.
> > For S3A, download SDK bundle v2.29.52 from:
> > https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52
> ,
> > and drop it into /share/hadoop/common/lib. Release notes will be updated
> > with these instructions.
> >
> >
> > The RC is available at:
> >
> > https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC2/
> >
> > The git tag is release-3.4.2-RC2, commit
> > e1c0dee881820a4d834ec4a4d2c70d0d953bb933
> >
> > The maven artifacts are staged at:
> >
> > https://repository.apache.org/content/repositories/orgapachehadoop-1441
> >
> >
> > You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
> >
> > https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
> >
> >
> > This release has been created off of branch-3.4.
> >
> > Thanks,
> > Ahmar
> >
>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-08-14 Thread Ahmar Suhail
Hey all,

Just a reminder about this. Would be great to get some votes in asap and
hopefully finalise 3.4.2.

Thanks,
Ahmar

On Thu, Aug 7, 2025 at 9:35 PM Ahmar Suhail  wrote:

> Hey all,
>
> RC2 for Hadoop 3.4.2 is now available for voting.
>
> There are a couple of things to note:
>
> 1/ We have ARM artifacts again! Thanks to Masatake and Wei-Chiu Chuang for
> working on the fix.
>
> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
> which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
> For S3A, download SDK bundle v2.29.52 from:
> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
> and drop it into /share/hadoop/common/lib. Release notes will be updated
> with these instructions.
>
>
> The RC is available at:
>
> https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC2/
>
> The git tag is release-3.4.2-RC2, commit
> e1c0dee881820a4d834ec4a4d2c70d0d953bb933
>
> The maven artifacts are staged at:
>
> https://repository.apache.org/content/repositories/orgapachehadoop-1441
>
>
> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>
> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>
>
> This release has been created off of branch-3.4.
>
> Thanks,
> Ahmar
>


[VOTE] Release Apache Hadoop 3.4.2

2025-08-07 Thread Ahmar Suhail
Hey all,

RC2 for Hadoop 3.4.2 is now available for voting.

There are a couple of things to note:

1/ We have ARM artifacts again! Thanks to Masatake and Wei-Chiu Chuang for
working on the fix.

2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
For S3A, download SDK bundle v2.29.52 from:
https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
and drop it into /share/hadoop/common/lib. Release notes will be updated
with these instructions.


The RC is available at:

https://dist.apache.org/repos/dist/dev/hadoop/3.4.2-RC2/

The git tag is release-3.4.2-RC2, commit
e1c0dee881820a4d834ec4a4d2c70d0d953bb933

The maven artifacts are staged at:

https://repository.apache.org/content/repositories/orgapachehadoop-1441


You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:

https://dist.apache.org/repos/dist/release/hadoop/common/KEYS


This release has been created off of branch-3.4.

Thanks,
Ahmar


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-06 Thread Steve Loughran
regarding parquet, the jira there is :
https://github.com/apache/parquet-java/issues/3237

and while the change in that lib is "release the buffers", it hits a
problem with coalesced reads as there the releasing doesn't work.

my fix will be: disable coalescing, keep the code (for now), and add a
path/stream capability to indicate merging.

with a max size for range merging of 2MB, it'll only surface in production
if ORC/parquet had rowgroups of size < 2MB, so not realistic at all. But it
exists.

On Fri, 6 Jun 2025 at 09:54, Suhail, Ahmar 
wrote:

> Thanks everyone for testing out this RC.
>
>
> For this RC, the current status is:
>
>
>   *   Issue with buffers allocated by ParquetFileReader.readVectored() are
> not being released. While this is not new, we should root cause and fix in
> case it's a hadoop issue for 3.4.2.
>   *   Need to figure out what to do with AWS SDK bundle, the transitive
> dependency should ideally be restored. I will discuss with Steve.
>   *   Documentation updates - I will fix.
>   *   PR's are up for the YARN issues which was failing the ARM64 builds,
> thank you Masatake, so the next RC will include ARM64 binaries.
>
>
>
> Once the parquet buffer issue is resolved, I will begin work on RC-2. If
> there are any other issues in this RC that I've missed in the above list
> please let me know so I can be sure to include the fix in the next RC.
>
>
> Thank you,
> Ahmar
>
> 
> From: Masatake Iwasaki 
> Sent: Thursday, June 5, 2025 4:46:06 PM
> To: Wei-Chiu Chuang; Ahmar Suhail
> Cc: [email protected]; [email protected];
> [email protected]; [email protected]
> Subject: RE: [EXTERNAL] [VOTE] Release Apache Hadoop 3.4.2
>
> CAUTION: This email originated from outside of the organization. Do not
> click links or open attachments unless you can confirm the sender and know
> the content is safe.
>
>
>
> Hi Wei-Chiu Chuang,
>
> > I really wish we can make ARM64 binaries. Would like to find time to work
> > on the two jiras mentioned.
>
> I submitted PRs for YARN-11712 and YARN-11713.
> Could you check the patches?
>
> Thanks,
> Masatake Iwasaki
>
> On 2025/06/05 1:14, Wei-Chiu Chuang wrote:
> > I really wish we can make ARM64 binaries. Would like to find time to work
> > on the two jiras mentioned.
> >
> > On Wed, May 28, 2025 at 5:25 AM Ahmar Suhail  wrote:
> >
> >> Hey all,
> >>
> >> The first release candidate for Hadoop 3.4.2 is now available for
> voting.
> >>
> >> There are a couple of things to note:
> >>
> >> 1/ No Arm64 artifacts. This is due to previously reported issues:
> >> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
> >> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713
> >> <https://issues.apache.org/jira/projects/YARN/issues/YARN-11713>, which
> >> mean that the build fails on arm64.
> >>
> >> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
> >> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
> >> which makes the size of tar > 1GB, and it can no longer be uploaded to
> SVN.
> >> For S3A, download SDK bundle v2.29.52 from:
> >>
> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
> >> and drop it into /share/hadoop/common/lib. Release notes will be updated
> >> with these instructions.
> >>
> >>
> >> The RC is available at:
> >>
> >> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
> >>
> >> The git tag is release-3.4.2-RC1, commit
> >> 09870840ec35b48cd107972eb24d25e8aece04c9
> >>
> >> The maven artifacts are staged at:
> >>
> >> https://repository.apache.org/content/repositories/orgapachehadoop-1437
> >>
> >>
> >> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98)
> at:
> >>
> >> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
> >>
> >>
> >> This release has been created off of branch-3.4. Key changes include:
> >>
> >> * S3A: Integration with S3 Analytics Accelerator input stream
> >> * S3A: Support for S3 conditional writes
> >> * ABFS: Deprecation of WASB driver
> >> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
> >>
> >>
> >> This is my first attempt at managing a release, please do test the
> release
> >> and let me know in case of any issues.
> >>
> >> Thanks,
> >> Ahmar
> >>
> >
>
>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-06 Thread Suhail, Ahmar
Thanks everyone for testing out this RC.


For this RC, the current status is:


  *   Issue with buffers allocated by ParquetFileReader.readVectored() are not 
being released. While this is not new, we should root cause and fix in case 
it's a hadoop issue for 3.4.2.
  *   Need to figure out what to do with AWS SDK bundle, the transitive 
dependency should ideally be restored. I will discuss with Steve.
  *   Documentation updates - I will fix.
  *   PR's are up for the YARN issues which was failing the ARM64 builds, thank 
you Masatake, so the next RC will include ARM64 binaries.



Once the parquet buffer issue is resolved, I will begin work on RC-2. If there 
are any other issues in this RC that I've missed in the above list please let 
me know so I can be sure to include the fix in the next RC.


Thank you,
Ahmar


From: Masatake Iwasaki 
Sent: Thursday, June 5, 2025 4:46:06 PM
To: Wei-Chiu Chuang; Ahmar Suhail
Cc: [email protected]; [email protected]; 
[email protected]; [email protected]
Subject: RE: [EXTERNAL] [VOTE] Release Apache Hadoop 3.4.2

CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.



Hi Wei-Chiu Chuang,

> I really wish we can make ARM64 binaries. Would like to find time to work
> on the two jiras mentioned.

I submitted PRs for YARN-11712 and YARN-11713.
Could you check the patches?

Thanks,
Masatake Iwasaki

On 2025/06/05 1:14, Wei-Chiu Chuang wrote:
> I really wish we can make ARM64 binaries. Would like to find time to work
> on the two jiras mentioned.
>
> On Wed, May 28, 2025 at 5:25 AM Ahmar Suhail  wrote:
>
>> Hey all,
>>
>> The first release candidate for Hadoop 3.4.2 is now available for voting.
>>
>> There are a couple of things to note:
>>
>> 1/ No Arm64 artifacts. This is due to previously reported issues:
>> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
>> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713
>> <https://issues.apache.org/jira/projects/YARN/issues/YARN-11713>, which
>> mean that the build fails on arm64.
>>
>> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
>> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
>> which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
>> For S3A, download SDK bundle v2.29.52 from:
>> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
>> and drop it into /share/hadoop/common/lib. Release notes will be updated
>> with these instructions.
>>
>>
>> The RC is available at:
>>
>> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
>>
>> The git tag is release-3.4.2-RC1, commit
>> 09870840ec35b48cd107972eb24d25e8aece04c9
>>
>> The maven artifacts are staged at:
>>
>> https://repository.apache.org/content/repositories/orgapachehadoop-1437
>>
>>
>> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>>
>> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>>
>>
>> This release has been created off of branch-3.4. Key changes include:
>>
>> * S3A: Integration with S3 Analytics Accelerator input stream
>> * S3A: Support for S3 conditional writes
>> * ABFS: Deprecation of WASB driver
>> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
>>
>>
>> This is my first attempt at managing a release, please do test the release
>> and let me know in case of any issues.
>>
>> Thanks,
>> Ahmar
>>
>



Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-05 Thread Masatake Iwasaki

Hi Wei-Chiu Chuang,


I really wish we can make ARM64 binaries. Would like to find time to work
on the two jiras mentioned.


I submitted PRs for YARN-11712 and YARN-11713.
Could you check the patches?

Thanks,
Masatake Iwasaki

On 2025/06/05 1:14, Wei-Chiu Chuang wrote:

I really wish we can make ARM64 binaries. Would like to find time to work
on the two jiras mentioned.

On Wed, May 28, 2025 at 5:25 AM Ahmar Suhail  wrote:


Hey all,

The first release candidate for Hadoop 3.4.2 is now available for voting.

There are a couple of things to note:

1/ No Arm64 artifacts. This is due to previously reported issues:
https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713
, which
mean that the build fails on arm64.

2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
For S3A, download SDK bundle v2.29.52 from:
https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
and drop it into /share/hadoop/common/lib. Release notes will be updated
with these instructions.


The RC is available at:

https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/

The git tag is release-3.4.2-RC1, commit
09870840ec35b48cd107972eb24d25e8aece04c9

The maven artifacts are staged at:

https://repository.apache.org/content/repositories/orgapachehadoop-1437


You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:

https://dist.apache.org/repos/dist/release/hadoop/common/KEYS


This release has been created off of branch-3.4. Key changes include:

* S3A: Integration with S3 Analytics Accelerator input stream
* S3A: Support for S3 conditional writes
* ABFS: Deprecation of WASB driver
* ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver


This is my first attempt at managing a release, please do test the release
and let me know in case of any issues.

Thanks,
Ahmar






-
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-04 Thread Wei-Chiu Chuang
I really wish we can make ARM64 binaries. Would like to find time to work
on the two jiras mentioned.

On Wed, May 28, 2025 at 5:25 AM Ahmar Suhail  wrote:

> Hey all,
>
> The first release candidate for Hadoop 3.4.2 is now available for voting.
>
> There are a couple of things to note:
>
> 1/ No Arm64 artifacts. This is due to previously reported issues:
> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713
> , which
> mean that the build fails on arm64.
>
> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
> which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
> For S3A, download SDK bundle v2.29.52 from:
> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
> and drop it into /share/hadoop/common/lib. Release notes will be updated
> with these instructions.
>
>
> The RC is available at:
>
> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
>
> The git tag is release-3.4.2-RC1, commit
> 09870840ec35b48cd107972eb24d25e8aece04c9
>
> The maven artifacts are staged at:
>
> https://repository.apache.org/content/repositories/orgapachehadoop-1437
>
>
> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>
> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>
>
> This release has been created off of branch-3.4. Key changes include:
>
> * S3A: Integration with S3 Analytics Accelerator input stream
> * S3A: Support for S3 conditional writes
> * ABFS: Deprecation of WASB driver
> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
>
>
> This is my first attempt at managing a release, please do test the release
> and let me know in case of any issues.
>
> Thanks,
> Ahmar
>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-04 Thread Steve Loughran
ocator.java:160)
>> at
>> org.apache.parquet.hadoop.TestParquetReader.closeAllocator(TestParquetReader.java:175)
>> at
>> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native
>> Method)
>> at
>> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>> at
>> java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>> at java.base/java.lang.reflect.Method.invoke(Method.java:566)
>> at
>> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
>> at
>> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>> Caused by:
>> org.apache.parquet.bytes.TrackingByteBufferAllocator$ByteBufferAllocationStacktraceException:
>> Set org.apache.parquet.bytes.TrackingByteBufferAllocator.DEBUG = true for
>> more info
>>
>>
>>1. This is very much the vector io stuff.
>>2. It's not a regression since 3.4.0
>>3. but it's indicative of a memory leak. Surprised and Annoyed I
>>hadn't spotted this before...the tests went in last year.
>>
>>
>> On Tue, 3 Jun 2025 at 10:55, Steve Loughran  wrote:
>>
>>>
>>> Ahmar, no need for prs there, it's a "commit then review" repo, as in
>>> other people can revoke if you break things.
>>>
>>> will do the merge and test
>>>
>>> On Mon, 2 Jun 2025 at 16:51, Suhail, Ahmar 
>>> wrote:
>>>
>>>> Steve -  I created a PR on hadoop-release-support with my properties
>>>> file: https://github.com/apache/hadoop-release-support/pull/4
>>>>
>>>>
>>>> Masatake - Yes, sounds good, will update the documentation for the new
>>>> RC if it is created.
>>>>
>>>>
>>>> Mukund - I realised too late that I messed up the commit for that one.
>>>> Will figure out how to fix..
>>>>
>>>> 
>>>> From: Steve Loughran 
>>>> Sent: Monday, June 2, 2025 4:25:00 PM
>>>> To: Ahmar Suhail
>>>> Cc: [email protected]; [email protected];
>>>> [email protected]; [email protected]
>>>> Subject: RE: [EXTERNAL] [VOTE] Release Apache Hadoop 3.4.2
>>>>
>>>>
>>>> CAUTION: This email originated from outside of the organization. Do not
>>>> click links or open attachments unless you can confirm the sender and know
>>>> the content is safe.
>>>>
>>>>
>>>>
>>>> can you put up the relevant changes to the hadoop-release-support
>>>> module; I'd like to use it as part of my validation and I'm assuming you
>>>> have that src/releases/release-info-3.4.2.properties file
>>>>
>>>> On Wed, 28 May 2025 at 13:25, Ahmar Suhail >>> [email protected]>> wrote:
>>>> Hey all,
>>>>
>>>> The first release candidate for Hadoop 3.4.2 is now available for
>>>> voting.
>>>>
>>>> There are a couple of things to note:
>>>>
>>>> 1/ No Arm64 artifacts. This is due to previously reported issues:
>>>> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
>>>> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713<
>>>> http://issues.apache.org/jira/projects/YARN/issues/YARN-11713>
>>>> <https://issues.apache.org/jira/projects/YARN/issues/YARN-11713>, which
>>>> mean that the build fails on arm64.
>>>>
>>>> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
>>>> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
>>>> which makes the size of tar > 1GB, and it can no longer be uploaded to
>>>> SVN.
>>>> For S3A, download SDK bundle v2.29.52 from:
>>>> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52
>>>> ,
>>>> and drop it into /share/hadoop/common/lib. Release notes will be updated
>>>> with these instructions.
>>>>
>>>>
>>>> The RC is available at:
>>>>
>>>> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
>>>>
>>>> The git tag is release-3.4.2-RC1, commit
>>>> 09870840ec35b48cd107972eb24d25e8aece04c9
>>>>
>>>> The maven artifacts are staged at:
>>>>
>>>> https://repository.apache.org/content/repositories/orgapachehadoop-1437
>>>>
>>>>
>>>> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98)
>>>> at:
>>>>
>>>> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>>>>
>>>>
>>>> This release has been created off of branch-3.4. Key changes include:
>>>>
>>>> * S3A: Integration with S3 Analytics Accelerator input stream
>>>> * S3A: Support for S3 conditional writes
>>>> * ABFS: Deprecation of WASB driver
>>>> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
>>>>
>>>>
>>>> This is my first attempt at managing a release, please do test the
>>>> release
>>>> and let me know in case of any issues.
>>>>
>>>> Thanks,
>>>> Ahmar
>>>>
>>>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-04 Thread Steve Loughran
d and Annoyed I
>hadn't spotted this before...the tests went in last year.
>
>
> On Tue, 3 Jun 2025 at 10:55, Steve Loughran  wrote:
>
>>
>> Ahmar, no need for prs there, it's a "commit then review" repo, as in
>> other people can revoke if you break things.
>>
>> will do the merge and test
>>
>> On Mon, 2 Jun 2025 at 16:51, Suhail, Ahmar 
>> wrote:
>>
>>> Steve -  I created a PR on hadoop-release-support with my properties
>>> file: https://github.com/apache/hadoop-release-support/pull/4
>>>
>>>
>>> Masatake - Yes, sounds good, will update the documentation for the new
>>> RC if it is created.
>>>
>>>
>>> Mukund - I realised too late that I messed up the commit for that one.
>>> Will figure out how to fix..
>>>
>>> 
>>> From: Steve Loughran 
>>> Sent: Monday, June 2, 2025 4:25:00 PM
>>> To: Ahmar Suhail
>>> Cc: [email protected]; [email protected];
>>> [email protected]; [email protected]
>>> Subject: RE: [EXTERNAL] [VOTE] Release Apache Hadoop 3.4.2
>>>
>>>
>>> CAUTION: This email originated from outside of the organization. Do not
>>> click links or open attachments unless you can confirm the sender and know
>>> the content is safe.
>>>
>>>
>>>
>>> can you put up the relevant changes to the hadoop-release-support
>>> module; I'd like to use it as part of my validation and I'm assuming you
>>> have that src/releases/release-info-3.4.2.properties file
>>>
>>> On Wed, 28 May 2025 at 13:25, Ahmar Suhail >> [email protected]>> wrote:
>>> Hey all,
>>>
>>> The first release candidate for Hadoop 3.4.2 is now available for voting.
>>>
>>> There are a couple of things to note:
>>>
>>> 1/ No Arm64 artifacts. This is due to previously reported issues:
>>> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
>>> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713<
>>> http://issues.apache.org/jira/projects/YARN/issues/YARN-11713>
>>> <https://issues.apache.org/jira/projects/YARN/issues/YARN-11713>, which
>>> mean that the build fails on arm64.
>>>
>>> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
>>> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
>>> which makes the size of tar > 1GB, and it can no longer be uploaded to
>>> SVN.
>>> For S3A, download SDK bundle v2.29.52 from:
>>> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52
>>> ,
>>> and drop it into /share/hadoop/common/lib. Release notes will be updated
>>> with these instructions.
>>>
>>>
>>> The RC is available at:
>>>
>>> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
>>>
>>> The git tag is release-3.4.2-RC1, commit
>>> 09870840ec35b48cd107972eb24d25e8aece04c9
>>>
>>> The maven artifacts are staged at:
>>>
>>> https://repository.apache.org/content/repositories/orgapachehadoop-1437
>>>
>>>
>>> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>>>
>>> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>>>
>>>
>>> This release has been created off of branch-3.4. Key changes include:
>>>
>>> * S3A: Integration with S3 Analytics Accelerator input stream
>>> * S3A: Support for S3 conditional writes
>>> * ABFS: Deprecation of WASB driver
>>> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
>>>
>>>
>>> This is my first attempt at managing a release, please do test the
>>> release
>>> and let me know in case of any issues.
>>>
>>> Thanks,
>>> Ahmar
>>>
>>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-03 Thread Steve Loughran
I'm sort of leaning towards a -1, though the regression is actually in
3.4.0, and surfacing as a failure to release buffers in localfs IO.


[ERROR] org.apache.parquet.hadoop.TestParquetReader.testRangeFiltering[2]
-- Time elapsed: 0.033 s <<< ERROR!
org.apache.parquet.bytes.TrackingByteBufferAllocator$LeakedByteBufferException:
4 ByteBuffer object(s) is/are remained unreleased after closing this
allocator.
at
org.apache.parquet.bytes.TrackingByteBufferAllocator.close(TrackingByteBufferAllocator.java:160)
at
org.apache.parquet.hadoop.TestParquetReader.closeAllocator(TestParquetReader.java:175)
at
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native
Method)
at
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at
java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.base/java.lang.reflect.Method.invoke(Method.java:566)
at
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
at
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
Caused by:
org.apache.parquet.bytes.TrackingByteBufferAllocator$ByteBufferAllocationStacktraceException:
Set org.apache.parquet.bytes.TrackingByteBufferAllocator.DEBUG = true for
more info


   1. This is very much the vector io stuff.
   2. It's not a regression since 3.4.0
   3. but it's indicative of a memory leak. Surprised and Annoyed I hadn't
   spotted this before...the tests went in last year.


On Tue, 3 Jun 2025 at 10:55, Steve Loughran  wrote:

>
> Ahmar, no need for prs there, it's a "commit then review" repo, as in
> other people can revoke if you break things.
>
> will do the merge and test
>
> On Mon, 2 Jun 2025 at 16:51, Suhail, Ahmar 
> wrote:
>
>> Steve -  I created a PR on hadoop-release-support with my properties
>> file: https://github.com/apache/hadoop-release-support/pull/4
>>
>>
>> Masatake - Yes, sounds good, will update the documentation for the new RC
>> if it is created.
>>
>>
>> Mukund - I realised too late that I messed up the commit for that one.
>> Will figure out how to fix..
>>
>> 
>> From: Steve Loughran 
>> Sent: Monday, June 2, 2025 4:25:00 PM
>> To: Ahmar Suhail
>> Cc: [email protected]; [email protected];
>> [email protected]; [email protected]
>> Subject: RE: [EXTERNAL] [VOTE] Release Apache Hadoop 3.4.2
>>
>>
>> CAUTION: This email originated from outside of the organization. Do not
>> click links or open attachments unless you can confirm the sender and know
>> the content is safe.
>>
>>
>>
>> can you put up the relevant changes to the hadoop-release-support module;
>> I'd like to use it as part of my validation and I'm assuming you have that
>> src/releases/release-info-3.4.2.properties file
>>
>> On Wed, 28 May 2025 at 13:25, Ahmar Suhail > [email protected]>> wrote:
>> Hey all,
>>
>> The first release candidate for Hadoop 3.4.2 is now available for voting.
>>
>> There are a couple of things to note:
>>
>> 1/ No Arm64 artifacts. This is due to previously reported issues:
>> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
>> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713<
>> http://issues.apache.org/jira/projects/YARN/issues/YARN-11713>
>> <https://issues.apache.org/jira/projects/YARN/issues/YARN-11713>, which
>> mean that the build fails on arm64.
>>
>> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
>> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
>> which makes the size of tar > 1GB, and it can no longer be uploaded to
>> SVN.
>> For S3A, download SDK bundle v2.29.52 from:
>> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
>> and drop it into /share/hadoop/common/lib. Release notes will be updated
>> with these instructions.
>>
>>
>> The RC is available at:
>>
>> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
>>
>> The git tag is release-3.4.2-RC1, commit
>> 09870840ec35b48cd107972eb24d25e8aece04c9
>>
>> The maven artifacts are staged at:
>>
>> https://repository.apache.org/content/repositories/orgapachehadoop-1437
>>
>>
>> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>>
>> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>>
>>
>> This release has been created off of branch-3.4. Key changes include:
>>
>> * S3A: Integration with S3 Analytics Accelerator input stream
>> * S3A: Support for S3 conditional writes
>> * ABFS: Deprecation of WASB driver
>> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
>>
>>
>> This is my first attempt at managing a release, please do test the release
>> and let me know in case of any issues.
>>
>> Thanks,
>> Ahmar
>>
>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-03 Thread Steve Loughran
Ahmar, no need for prs there, it's a "commit then review" repo, as in other
people can revoke if you break things.

will do the merge and test

On Mon, 2 Jun 2025 at 16:51, Suhail, Ahmar 
wrote:

> Steve -  I created a PR on hadoop-release-support with my properties file:
> https://github.com/apache/hadoop-release-support/pull/4
>
>
> Masatake - Yes, sounds good, will update the documentation for the new RC
> if it is created.
>
>
> Mukund - I realised too late that I messed up the commit for that one.
> Will figure out how to fix..
>
> 
> From: Steve Loughran 
> Sent: Monday, June 2, 2025 4:25:00 PM
> To: Ahmar Suhail
> Cc: [email protected]; [email protected];
> [email protected]; [email protected]
> Subject: RE: [EXTERNAL] [VOTE] Release Apache Hadoop 3.4.2
>
>
> CAUTION: This email originated from outside of the organization. Do not
> click links or open attachments unless you can confirm the sender and know
> the content is safe.
>
>
>
> can you put up the relevant changes to the hadoop-release-support module;
> I'd like to use it as part of my validation and I'm assuming you have that
> src/releases/release-info-3.4.2.properties file
>
> On Wed, 28 May 2025 at 13:25, Ahmar Suhail  [email protected]>> wrote:
> Hey all,
>
> The first release candidate for Hadoop 3.4.2 is now available for voting.
>
> There are a couple of things to note:
>
> 1/ No Arm64 artifacts. This is due to previously reported issues:
> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713<
> http://issues.apache.org/jira/projects/YARN/issues/YARN-11713>
> <https://issues.apache.org/jira/projects/YARN/issues/YARN-11713>, which
> mean that the build fails on arm64.
>
> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
> which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
> For S3A, download SDK bundle v2.29.52 from:
> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
> and drop it into /share/hadoop/common/lib. Release notes will be updated
> with these instructions.
>
>
> The RC is available at:
>
> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
>
> The git tag is release-3.4.2-RC1, commit
> 09870840ec35b48cd107972eb24d25e8aece04c9
>
> The maven artifacts are staged at:
>
> https://repository.apache.org/content/repositories/orgapachehadoop-1437
>
>
> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>
> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>
>
> This release has been created off of branch-3.4. Key changes include:
>
> * S3A: Integration with S3 Analytics Accelerator input stream
> * S3A: Support for S3 conditional writes
> * ABFS: Deprecation of WASB driver
> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
>
>
> This is my first attempt at managing a release, please do test the release
> and let me know in case of any issues.
>
> Thanks,
> Ahmar
>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-02 Thread Suhail, Ahmar
Steve -  I created a PR on hadoop-release-support with my properties file: 
https://github.com/apache/hadoop-release-support/pull/4


Masatake - Yes, sounds good, will update the documentation for the new RC if it 
is created.


Mukund - I realised too late that I messed up the commit for that one. Will 
figure out how to fix..


From: Steve Loughran 
Sent: Monday, June 2, 2025 4:25:00 PM
To: Ahmar Suhail
Cc: [email protected]; [email protected]; 
[email protected]; [email protected]
Subject: RE: [EXTERNAL] [VOTE] Release Apache Hadoop 3.4.2


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.



can you put up the relevant changes to the hadoop-release-support module; I'd 
like to use it as part of my validation and I'm assuming you have that 
src/releases/release-info-3.4.2.properties file

On Wed, 28 May 2025 at 13:25, Ahmar Suhail 
mailto:[email protected]>> wrote:
Hey all,

The first release candidate for Hadoop 3.4.2 is now available for voting.

There are a couple of things to note:

1/ No Arm64 artifacts. This is due to previously reported issues:
https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713<http://issues.apache.org/jira/projects/YARN/issues/YARN-11713>
<https://issues.apache.org/jira/projects/YARN/issues/YARN-11713>, which
mean that the build fails on arm64.

2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
For S3A, download SDK bundle v2.29.52 from:
https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
and drop it into /share/hadoop/common/lib. Release notes will be updated
with these instructions.


The RC is available at:

https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/

The git tag is release-3.4.2-RC1, commit
09870840ec35b48cd107972eb24d25e8aece04c9

The maven artifacts are staged at:

https://repository.apache.org/content/repositories/orgapachehadoop-1437


You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:

https://dist.apache.org/repos/dist/release/hadoop/common/KEYS


This release has been created off of branch-3.4. Key changes include:

* S3A: Integration with S3 Analytics Accelerator input stream
* S3A: Support for S3 conditional writes
* ABFS: Deprecation of WASB driver
* ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver


This is my first attempt at managing a release, please do test the release
and let me know in case of any issues.

Thanks,
Ahmar


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-02 Thread Steve Loughran
can you put up the relevant changes to the hadoop-release-support module;
I'd like to use it as part of my validation and I'm assuming you have
that src/releases/release-info-3.4.2.properties file

On Wed, 28 May 2025 at 13:25, Ahmar Suhail  wrote:

> Hey all,
>
> The first release candidate for Hadoop 3.4.2 is now available for voting.
>
> There are a couple of things to note:
>
> 1/ No Arm64 artifacts. This is due to previously reported issues:
> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713
> , which
> mean that the build fails on arm64.
>
> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
> which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
> For S3A, download SDK bundle v2.29.52 from:
> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
> and drop it into /share/hadoop/common/lib. Release notes will be updated
> with these instructions.
>
>
> The RC is available at:
>
> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
>
> The git tag is release-3.4.2-RC1, commit
> 09870840ec35b48cd107972eb24d25e8aece04c9
>
> The maven artifacts are staged at:
>
> https://repository.apache.org/content/repositories/orgapachehadoop-1437
>
>
> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>
> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>
>
> This release has been created off of branch-3.4. Key changes include:
>
> * S3A: Integration with S3 Analytics Accelerator input stream
> * S3A: Support for S3 conditional writes
> * ABFS: Deprecation of WASB driver
> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
>
>
> This is my first attempt at managing a release, please do test the release
> and let me know in case of any issues.
>
> Thanks,
> Ahmar
>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-02 Thread Masatake Iwasaki

* git_jira_fix_version_check.py reported some inconsistencies between commits 
and JIRA.


I updated JIRA's "Fix Versions/s" and "Resolution" along with the commit status.

Since HADOOP-19580 was commited to branch-3.4.2 but not in the RC1,
it will be contained if RC2 is created.
"Fix Versions/s" should be updated later.

https://issues.apache.org/jira/browse/HADOOP-19580

On 2025/06/02 18:47, Masatake Iwasaki wrote:

Thanks for driving the release, Ahmar.

I found some issues on documentations.
While it is not worth for sinking the RC1, it would be nice to fix if we cut 
RC2.

* The top page of site documentation looks mentioning changes of 3.4.1.
   
https://github.com/apache/hadoop/blob/release-3.4.2-RC1/hadoop-project/src/site/markdown/index.md.vm

* git_jira_fix_version_check.py reported some inconsistencies between commits 
and JIRA.
   
https://github.com/apache/hadoop/tree/release-3.4.2-RC1/dev-support/git-jira-validation


   Jira/Git commit message diff starting: 
##
   WARN: Jira not found.    Commit: 
09870840ec35b48cd107972eb24d25e8aece04c9 updates SDK dependency to provided 
(#7688)
   WARN: Jira not found.    Commit: 
02b4849886c943be635af521714934b5bf6c32c1 Preparing for 3.4.2 release
   Jira not present with version: 3.4.2.    Commit: 
7f170e9788f8074091f98527cdd9552acd1d05db HADOOP-19548: [ABFS] Fix Logging in 
FSDataInputStream to Mention Correct Buffersize (#7642) (#7660)
   Jira is not resolved yet?    Commit: 
f3ec55b5911d1e5e7da438c64f614a59a7db62a3 HADOOP-19516. S3A: SDK reads content 
twice during PUT to S3 Express store.
   Commit seems reverted.   Commit: 
9d3082e130227fdd05216eb4a0ecd374bce447f2 Revert "HDFS-14884. Add sanity check that 
zone key equals feinfo key while setting Xattrs. Contributed by Mukul Kumar Singh." 
(#7529)
   Jira not present with version: 3.4.2.    Commit: 
c7d292ecd1078f72e234ae1e84e70d7ee280ad92 HADOOP-19488 fix temporary directory 
creation (branch-3.4) (#7530)
   Jira not present with version: 3.4.2.    Commit: 
a5fb37e6547e152fabbc47520b9c8f7fc448a1df HADOOP-18940. [ABFS] Remove commons 
IOUtils.close() from AbfsOutputStream (#7506)
   Commit seems reverted.   Commit: 
474f5343a4676832cf514ff37c03d25915472752 Revert "YARN-11793: Remove grizzly-http-* 
dependencies from Hadoop to avoid transitive inclusions."
   Jira not present with version: 3.4.2.    Commit: 
7e682e0f976694bf3030309f73ac2264694f6d70 YARN-11793: Remove grizzly-http-* 
dependencies from Hadoop to avoid transitive inclusions.
   Jira not present with version: 3.4.2.    Commit: 
c122ff33b82a727ac1f6c83760efcb7fd5c0bac7 [YARN-11776] Handle NPE in the 
RMDelegationTokenIdentifier if localServiceAddress is null (#7431) Contributed 
by Abhey Rana.
   Jira not present with version: 3.4.2.    Commit: 
b3ae9d1eb6841a6c9f9928a917cc50818a319f71 HADOOP-19152. Do not hard code 
security providers. (#6739) (#7439)
   WARN: Jira not found.    Commit: 
04ca53410beb18bbd279219275f59f04e6df293a Integrate analytics accelerator into 
S3A. (#7471)
   Jira not present with version: 3.4.2.    Commit: 
fd445c215459f2b7604e2cd383af98c040861cc2 HADOOP-19407: Update 3.4.1 docs 
landing page to highlight changes shipped in the release.
   Jira not present with version: 3.4.2.    Commit: 
7d699e5ffb7dee613ac6bc6bccb529ef6bb4a2a6 HADOOP-19401: Improve error message 
when OS can't identify the current user.
   Jira not present with version: 3.4.2.    Commit: 
57d821e6aad23b604833272071c8481567363e63 HADOOP-19143. Upgrade commons-cli to 
1.9.0. (#7126) (#7146) Contributed by Min Yan.
   Jira not present with version: 3.4.2.    Commit: 
0b74fbe9479a88b154c5d5d5acc5920c4a50e65d HDFS-17637. [BackPort] Fix spotbugs in 
HttpFSFileSystem#getXAttr (#7099) (#7195) Contributed by Hualong Zhang.
   Found first commit hash after which git history is redundant. commit: 
23f45d0b26d4a883e7a29278df0ec75bc5c1fda3
   Exiting successfully
   Jira/Git commit message diff completed: 
##

   Any resolved Jira with fixVersion 3.4.2 but corresponding commit not present
   Starting diff: ##
   HADOOP-19580 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
   HADOOP-19512 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
   HADOOP-19492 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
   HADOOP-19491 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
   HADOOP-19382 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
   Completed diff: ##


I will vote after some more local testing.

Thanks,
Masatake Iwasaki


On 2025/05/28 21:24, Ahmar Suhail wrote:

Hey all,

The first release candidate for Hadoop 3.4.2 is now available for voting.

There are a cou

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-06-02 Thread Masatake Iwasaki

Thanks for driving the release, Ahmar.

I found some issues on documentations.
While it is not worth for sinking the RC1, it would be nice to fix if we cut 
RC2.

* The top page of site documentation looks mentioning changes of 3.4.1.
  
https://github.com/apache/hadoop/blob/release-3.4.2-RC1/hadoop-project/src/site/markdown/index.md.vm

* git_jira_fix_version_check.py reported some inconsistencies between commits 
and JIRA.
  
https://github.com/apache/hadoop/tree/release-3.4.2-RC1/dev-support/git-jira-validation


  Jira/Git commit message diff starting: 
##
  WARN: Jira not found.Commit: 
09870840ec35b48cd107972eb24d25e8aece04c9 updates SDK dependency to provided 
(#7688)
  WARN: Jira not found.Commit: 
02b4849886c943be635af521714934b5bf6c32c1 Preparing for 3.4.2 release
  Jira not present with version: 3.4.2.Commit: 
7f170e9788f8074091f98527cdd9552acd1d05db HADOOP-19548: [ABFS] Fix Logging in 
FSDataInputStream to Mention Correct Buffersize (#7642) (#7660)
  Jira is not resolved yet?Commit: 
f3ec55b5911d1e5e7da438c64f614a59a7db62a3 HADOOP-19516. S3A: SDK reads content 
twice during PUT to S3 Express store.
  Commit seems reverted.   Commit: 
9d3082e130227fdd05216eb4a0ecd374bce447f2 Revert "HDFS-14884. Add sanity check that 
zone key equals feinfo key while setting Xattrs. Contributed by Mukul Kumar Singh." 
(#7529)
  Jira not present with version: 3.4.2.Commit: 
c7d292ecd1078f72e234ae1e84e70d7ee280ad92 HADOOP-19488 fix temporary directory 
creation (branch-3.4) (#7530)
  Jira not present with version: 3.4.2.Commit: 
a5fb37e6547e152fabbc47520b9c8f7fc448a1df HADOOP-18940. [ABFS] Remove commons 
IOUtils.close() from AbfsOutputStream (#7506)
  Commit seems reverted.   Commit: 
474f5343a4676832cf514ff37c03d25915472752 Revert "YARN-11793: Remove grizzly-http-* 
dependencies from Hadoop to avoid transitive inclusions."
  Jira not present with version: 3.4.2.Commit: 
7e682e0f976694bf3030309f73ac2264694f6d70 YARN-11793: Remove grizzly-http-* 
dependencies from Hadoop to avoid transitive inclusions.
  Jira not present with version: 3.4.2.Commit: 
c122ff33b82a727ac1f6c83760efcb7fd5c0bac7 [YARN-11776] Handle NPE in the 
RMDelegationTokenIdentifier if localServiceAddress is null (#7431) Contributed 
by Abhey Rana.
  Jira not present with version: 3.4.2.Commit: 
b3ae9d1eb6841a6c9f9928a917cc50818a319f71 HADOOP-19152. Do not hard code 
security providers. (#6739) (#7439)
  WARN: Jira not found.Commit: 
04ca53410beb18bbd279219275f59f04e6df293a Integrate analytics accelerator into 
S3A. (#7471)
  Jira not present with version: 3.4.2.Commit: 
fd445c215459f2b7604e2cd383af98c040861cc2 HADOOP-19407: Update 3.4.1 docs 
landing page to highlight changes shipped in the release.
  Jira not present with version: 3.4.2.Commit: 
7d699e5ffb7dee613ac6bc6bccb529ef6bb4a2a6 HADOOP-19401: Improve error message 
when OS can't identify the current user.
  Jira not present with version: 3.4.2.Commit: 
57d821e6aad23b604833272071c8481567363e63 HADOOP-19143. Upgrade commons-cli to 
1.9.0. (#7126) (#7146) Contributed by Min Yan.
  Jira not present with version: 3.4.2.Commit: 
0b74fbe9479a88b154c5d5d5acc5920c4a50e65d HDFS-17637. [BackPort] Fix spotbugs in 
HttpFSFileSystem#getXAttr (#7099) (#7195) Contributed by Hualong Zhang.
  Found first commit hash after which git history is redundant. commit: 
23f45d0b26d4a883e7a29278df0ec75bc5c1fda3
  Exiting successfully
  Jira/Git commit message diff completed: 
##

  Any resolved Jira with fixVersion 3.4.2 but corresponding commit not present
  Starting diff: ##
  HADOOP-19580 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
  HADOOP-19512 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
  HADOOP-19492 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
  HADOOP-19491 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
  HADOOP-19382 is marked resolved with fixVersion 3.4.2 but no corresponding 
commit found
  Completed diff: ##


I will vote after some more local testing.

Thanks,
Masatake Iwasaki


On 2025/05/28 21:24, Ahmar Suhail wrote:

Hey all,

The first release candidate for Hadoop 3.4.2 is now available for voting.

There are a couple of things to note:

1/ No Arm64 artifacts. This is due to previously reported issues:
https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713
, which
mean that the build fails on arm64.

2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
from hadoop-3.4.2.tar.gz. This is because the SDK bundl

Re: [VOTE] Release Apache Hadoop 3.4.2

2025-05-30 Thread Mukund Madhav Thakur
Thanks for starting this! I tried accessing the S3 bucket using the RC by
dropping in the bundle jar and it works. Will test more!

On Wed, May 28, 2025 at 7:25 AM Ahmar Suhail  wrote:

> Hey all,
>
> The first release candidate for Hadoop 3.4.2 is now available for voting.
>
> There are a couple of things to note:
>
> 1/ No Arm64 artifacts. This is due to previously reported issues:
> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713
> , which
> mean that the build fails on arm64.
>
> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
> which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
> For S3A, download SDK bundle v2.29.52 from:
> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
> and drop it into /share/hadoop/common/lib. Release notes will be updated
> with these instructions.
>
>
> The RC is available at:
>
> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
>
> The git tag is release-3.4.2-RC1, commit
> 09870840ec35b48cd107972eb24d25e8aece04c9
>
> The maven artifacts are staged at:
>
> https://repository.apache.org/content/repositories/orgapachehadoop-1437
>
>
> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>
> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>
>
> This release has been created off of branch-3.4. Key changes include:
>
> * S3A: Integration with S3 Analytics Accelerator input stream
> * S3A: Support for S3 conditional writes
> * ABFS: Deprecation of WASB driver
> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
>
>
> This is my first attempt at managing a release, please do test the release
> and let me know in case of any issues.
>
> Thanks,
> Ahmar
>


Re: [VOTE] Release Apache Hadoop 3.4.2

2025-05-28 Thread Xiaoqiao He
Thanks Ahmar for your work. Will try to verify and give feedback here.

Best Regards,
- He Xiaoqiao

On Wed, May 28, 2025 at 8:56 PM Ahmar Suhail  wrote:
>
> Hey all,
>
> The first release candidate for Hadoop 3.4.2 is now available for voting.
>
> There are a couple of things to note:
>
> 1/ No Arm64 artifacts. This is due to previously reported issues:
> https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
> ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713
> , which
> mean that the build fails on arm64.
>
> 2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
> from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
> which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
> For S3A, download SDK bundle v2.29.52 from:
> https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
> and drop it into /share/hadoop/common/lib. Release notes will be updated
> with these instructions.
>
>
> The RC is available at:
>
> https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/
>
> The git tag is release-3.4.2-RC1, commit
> 09870840ec35b48cd107972eb24d25e8aece04c9
>
> The maven artifacts are staged at:
>
> https://repository.apache.org/content/repositories/orgapachehadoop-1437
>
>
> You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:
>
> https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
>
>
> This release has been created off of branch-3.4. Key changes include:
>
> * S3A: Integration with S3 Analytics Accelerator input stream
> * S3A: Support for S3 conditional writes
> * ABFS: Deprecation of WASB driver
> * ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver
>
>
> This is my first attempt at managing a release, please do test the release
> and let me know in case of any issues.
>
> Thanks,
> Ahmar

-
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



[VOTE] Release Apache Hadoop 3.4.2

2025-05-28 Thread Ahmar Suhail
Hey all,

The first release candidate for Hadoop 3.4.2 is now available for voting.

There are a couple of things to note:

1/ No Arm64 artifacts. This is due to previously reported issues:
https://issues.apache.org/jira/projects/YARN/issues/YARN-11712 and
ttps://issues.apache.org/jira/projects/YARN/issues/YARN-11713
, which
mean that the build fails on arm64.

2/ Relevant for anyone testing S3A: We've removed the AWS SDK bundle
from hadoop-3.4.2.tar.gz. This is because the SDK bundle is now ~600MB,
which makes the size of tar > 1GB, and it can no longer be uploaded to SVN.
For S3A, download SDK bundle v2.29.52 from:
https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.29.52,
and drop it into /share/hadoop/common/lib. Release notes will be updated
with these instructions.


The RC is available at:

https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.2-RC1/

The git tag is release-3.4.2-RC1, commit
09870840ec35b48cd107972eb24d25e8aece04c9

The maven artifacts are staged at:

https://repository.apache.org/content/repositories/orgapachehadoop-1437


You can find my public key (02085AFB652F796A3B01D11FD737A6F52281FA98) at:

https://dist.apache.org/repos/dist/release/hadoop/common/KEYS


This release has been created off of branch-3.4. Key changes include:

* S3A: Integration with S3 Analytics Accelerator input stream
* S3A: Support for S3 conditional writes
* ABFS: Deprecation of WASB driver
* ABFS: Support for Non-Heirarchical Namespace Accounts on ABFS Driver


This is my first attempt at managing a release, please do test the release
and let me know in case of any issues.

Thanks,
Ahmar