Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-10 Thread Gabor Bota
Yes, sure. I'll do another RC for next week. Thank you all for working on this! On Thu, Jul 9, 2020 at 8:20 AM Masatake Iwasaki wrote: > > Hi Gabor Bota, > > I committed the fix of YARN-10347 to branch-3.1. > I think this should be blocker for 3.1.4. > Could you cherry-pick it to branch-3.1.4

Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-09 Thread Masatake Iwasaki
Hi Gabor Bota, I committed the fix of YARN-10347 to branch-3.1. I think this should be blocker for 3.1.4. Could you cherry-pick it to branch-3.1.4 and cut a new RC? Thanks, Masatake Iwasaki On 2020/07/08 23:31, Masatake Iwasaki wrote: Thanks Steve and Prabhu for the information. The cause

Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-08 Thread Masatake Iwasaki
Thanks Steve and Prabhu for the information. The cause turned out to be locking in CapacityScheduler#reinitialize. I think the method is called after transitioning to active stat if RM-HA is enabled. I filed YARN-10347 and created PR. Masatake Iwasaki On 2020/07/08 16:33, Prabhu Joseph

Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-08 Thread Prabhu Joseph
Hi Masatake, The thread is waiting for a ReadLock, we need to check what the other thread holding WriteLock is blocked on. Can you get three consecutive complete jstack of ResourceManager during the issue. >> I got no issue if RM-HA is disabled. Looks RM is not able to access Zookeeper

Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-06 Thread Steve Loughran
hmm YARN-9341 went through all of the yarn lock code -it's in 3.3 but not in 3.1. And we do not want to attempt to backport 175KB of lock acquire/release code, do we? anyone in yarn-dev got any thoughts here? On Sun, 5 Jul 2020 at 22:14, Masatake Iwasaki wrote: > Thanks for putting this up,

Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-05 Thread Masatake Iwasaki
Thanks for putting this up, Gabor Bota. I'm testing the RC2 on 3 node docker cluster with NN-HA and RM-HA enabled. ResourceManager reproducibly blocks on submitApplication while launching example MR jobs. Does anyone run into the same issue? The same configuration worked for 3.1.3. I got no

Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-02 Thread Steve Loughran
+1, with the instruction "warn everyone about the guava update possibly breaking things at run time" With the key issues being * code compiled with the new guava release will not link against the older releases, even without any changes in the source files. * this includes hadoop-common

Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-02 Thread Steve Loughran
Mukund -thank you for running these tests. Both of them are things we've fixed, and in both cases, problems in the tests, not the production code On Wed, 1 Jul 2020 at 14:22, Mukund Madhav Thakur wrote: > Compile the distribution using mvn package -Pdist -DskipTests > -Dmaven.javadoc.skip=true

Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-01 Thread Mukund Madhav Thakur
Compile the distribution using mvn package -Pdist -DskipTests -Dmaven.javadoc.skip=true -DskipShade and run some hadoop fs commands. All good there. Then I ran the hadoop-aws tests and saw following failures: [*ERROR*] *Failures: * [*ERROR*] *

[VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-06-26 Thread Gabor Bota
Hi folks, I have put together a release candidate (RC2) for Hadoop 3.1.4. The RC is available at: http://people.apache.org/~gabota/hadoop-3.1.4-RC2/ The RC tag in git is here: https://github.com/apache/hadoop/releases/tag/release-3.1.4-RC2 The maven artifacts are staged at