Thanks steve, for helping with the signup. Hi Matt, I was able to create a JIRA for the bug report: https://issues.apache.org/jira/browse/HADOOP-19624. Once you are able to signup, please feel free to assign this to yourself. Since you have already made significant progress, please feel free to raise the PR against this JIRA. We will surely help with review and check-in.
Thanks again for all the efforts. Anuj On Tue, Jul 15, 2025 at 3:22 PM Steve Loughran <ste...@cloudera.com.invalid> wrote: > do the signup form for hadoop and say "as discussed on common dev" as > reason ... the signup is there to keep spam accounts out > > On Wed, 9 Jul 2025 at 05:21, Matt <m...@fastmail.to> wrote: > > > CC'ing common-dev as that list has more activity > > > > On 7/7/25 20:39, Matt wrote: > > > Hi Hadoop team! > > > > > > I'm not sure where to report this and the Jira board does not allow > > > for public sign up so I figured I'd start here. I found a thread leak > > > in the ABFS driver that causes OutOfMemoryErrors in Hive Metastore > > > environments -- specifically in this part of the code base: > > > > > > > > > https://github.com/apache/hadoop/blob/f099f08d922689dd2bd641bbbbd7c29c451463df/hadoop-tools/hadoop-azure/src/main/java/org/apache/hadoop/fs/azurebfs/services/AbfsClientThrottlingAnalyzer.java#L127 > > > > > > > > > What seems to be the issue is that the timer tasks are cleaned up but > > > the timer threads themselves are never actually cleaned up. This will > > > eventually lead to an OOM since nothing is collecting these. I was > > > able to reproduce this locally in 3.3.6 and 3.4.1 but I believe that > > > it would affect any version that relies on autothrottling for ABFS. > > > > > > I was also able to make a quick fix as well as confirm a workaround -- > > > the long term fix would be to include `timer.cancel()` and > > > `timer.purge()` in a method for AbfsClientThrottlingAnalyzer.java. The > > > short term workaround is to disable autothrottling and rely on Azure > > > to throttle the connections as needed with the below configuration. > > > > > > ``` > > > > > > <property> > > > <name>fs.azure.enable.autothrottling</name> > > > <value>false</value> > > > </property> > > > > > > ``` > > > > > > I'm happy to share my fix and test results but I'm not quite sure who > > > to share them with -- any direction is greatly appreciated! > > > > > > Thank you, > > > Matt > > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org > > For additional commands, e-mail: common-dev-h...@hadoop.apache.org > > > > >