[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16179419#comment-16179419 ] Ajay Kumar commented on HDFS-12516: --- [~anu],[~arpitagarwal] thanks for review and commit. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar > Fix For: 3.1.0 > > Attachments: HDFS-12516.01.patch, HDFS-12516.02.patch, > HDFS-12516.03.patch > > > Whenever FsNameSystemLock is held for more than configured value of > {{dfs.namenode.write-lock-reporting-threshold-ms}}, we log stacktrace and an > entry in metrics. Loading FSImage from disk will usually cross this > threshold. We can suppress this FsNamesystem lock warning on NameNode startup. > {code} > 17/09/20 21:41:39 INFO namenode.FSNamesystem: FSNamesystem write lock held > for 7159 ms via > java.lang.Thread.getStackTrace(Thread.java:1552) > org.apache.hadoop.util.StringUtils.getStackTrace(StringUtils.java:945) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.writeUnlock(FSNamesystem.java:1659) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1074) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) > org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) > org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:992) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:976) > org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) > Number of suppressed write-lock reports: 0 > Longest write-lock held interval: 7159 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178519#comment-16178519 ] Hudson commented on HDFS-12516: --- SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #12963 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/12963/]) HDFS-12516. Suppress the fsnamesystem lock warning on nn startup. (aengineer: rev d0b2c5850b523a3888b2fadcfcdf6edbed33f221) * (edit) hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSNamesystem.java * (edit) hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/TestFSNamesystemLock.java * (edit) hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSNamesystemLock.java > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar > Fix For: 3.1.0 > > Attachments: HDFS-12516.01.patch, HDFS-12516.02.patch, > HDFS-12516.03.patch > > > Whenever FsNameSystemLock is held for more than configured value of > {{dfs.namenode.write-lock-reporting-threshold-ms}}, we log stacktrace and an > entry in metrics. Loading FSImage from disk will usually cross this > threshold. We can suppress this FsNamesystem lock warning on NameNode startup. > {code} > 17/09/20 21:41:39 INFO namenode.FSNamesystem: FSNamesystem write lock held > for 7159 ms via > java.lang.Thread.getStackTrace(Thread.java:1552) > org.apache.hadoop.util.StringUtils.getStackTrace(StringUtils.java:945) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.writeUnlock(FSNamesystem.java:1659) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1074) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) > org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) > org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:992) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:976) > org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) > Number of suppressed write-lock reports: 0 > Longest write-lock held interval: 7159 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178378#comment-16178378 ] Hadoop QA commented on HDFS-12516: -- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 20s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 14m 8s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 0s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 41s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 1m 4s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 56s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 44s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 1m 15s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 17s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 1m 17s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 40s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 1m 11s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 2m 13s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 45s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:red}-1{color} | {color:red} unit {color} | {color:red} 98m 47s{color} | {color:red} hadoop-hdfs in the patch failed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 16s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black}127m 55s{color} | {color:black} {color} | \\ \\ || Reason || Tests || | Failed junit tests | hadoop.hdfs.server.datanode.TestDirectoryScanner | | | hadoop.hdfs.TestReadStripedFileWithMissingBlocks | | | hadoop.hdfs.server.namenode.TestReencryption | \\ \\ || Subsystem || Report/Notes || | Docker | Image:yetus/hadoop:71bbb86 | | JIRA Issue | HDFS-12516 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12888758/HDFS-12516.03.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit findbugs checkstyle | | uname | Linux 027765252d8b 3.13.0-129-generic #178-Ubuntu SMP Fri Aug 11 12:48:20 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/hadoop/patchprocess/precommit/personality/provided.sh | | git revision | trunk / 415e5a1 | | Default Java | 1.8.0_144 | | findbugs | v3.1.0-RC1 | | unit | https://builds.apache.org/job/PreCommit-HDFS-Build/21331/artifact/patchprocess/patch-unit-hadoop-hdfs-project_hadoop-hdfs.txt | | Test Results | https://builds.apache.org/job/PreCommit-HDFS-Build/21331/testReport/ | | modules | C: hadoop-hdfs-project/hadoop-hdfs U: hadoop-hdfs-project/hadoop-hdfs | | Console output | https://builds.apache.org/job/PreCommit-HDFS-Build/21331/console | | Powered by | Apache Yetus 0.6.0-SNAPSHOT http://yetus.apache.org | This message was automatically generated. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar >
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178343#comment-16178343 ] Anu Engineer commented on HDFS-12516: - [~ajayydv] Thanks for updating the patch. +1, v3 patch pending jenkins. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar > Attachments: HDFS-12516.01.patch, HDFS-12516.02.patch, > HDFS-12516.03.patch > > > Whenever FsNameSystemLock is held for more than configured value of > {{dfs.namenode.write-lock-reporting-threshold-ms}}, we log stacktrace and an > entry in metrics. Loading FSImage from disk will usually cross this > threshold. We can suppress this FsNamesystem lock warning on NameNode startup. > {code} > 17/09/20 21:41:39 INFO namenode.FSNamesystem: FSNamesystem write lock held > for 7159 ms via > java.lang.Thread.getStackTrace(Thread.java:1552) > org.apache.hadoop.util.StringUtils.getStackTrace(StringUtils.java:945) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.writeUnlock(FSNamesystem.java:1659) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1074) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) > org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) > org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:992) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:976) > org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) > Number of suppressed write-lock reports: 0 > Longest write-lock held interval: 7159 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178339#comment-16178339 ] Ajay Kumar commented on HDFS-12516: --- [~anu], you are right. I think this is what [~arpitagarwal] suggested initially. Made the change in patch v3. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar > Attachments: HDFS-12516.01.patch, HDFS-12516.02.patch, > HDFS-12516.03.patch > > > Whenever FsNameSystemLock is held for more than configured value of > {{dfs.namenode.write-lock-reporting-threshold-ms}}, we log stacktrace and an > entry in metrics. Loading FSImage from disk will usually cross this > threshold. We can suppress this FsNamesystem lock warning on NameNode startup. > {code} > 17/09/20 21:41:39 INFO namenode.FSNamesystem: FSNamesystem write lock held > for 7159 ms via > java.lang.Thread.getStackTrace(Thread.java:1552) > org.apache.hadoop.util.StringUtils.getStackTrace(StringUtils.java:945) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.writeUnlock(FSNamesystem.java:1659) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1074) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) > org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) > org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:992) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:976) > org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) > Number of suppressed write-lock reports: 0 > Longest write-lock held interval: 7159 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178000#comment-16178000 ] Anu Engineer commented on HDFS-12516: - Forgive me if my code reading is wrong. This is more of a question. It looks to me that you might not need this line change. {{if (logReport && !suppressWriteLockReport)}} Here what I am thinking, the only way {{logReport}} can be true is if {{needReport}} is true, but {{needReport}} is dependent on {{supressWriteReportLockReport}}. As I said, this is based on a casual code reading so please check and confirm. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar > Attachments: HDFS-12516.01.patch, HDFS-12516.02.patch > > > Whenever FsNameSystemLock is held for more than configured value of > {{dfs.namenode.write-lock-reporting-threshold-ms}}, we log stacktrace and an > entry in metrics. Loading FSImage from disk will usually cross this > threshold. We can suppress this FsNamesystem lock warning on NameNode startup. > {code} > 17/09/20 21:41:39 INFO namenode.FSNamesystem: FSNamesystem write lock held > for 7159 ms via > java.lang.Thread.getStackTrace(Thread.java:1552) > org.apache.hadoop.util.StringUtils.getStackTrace(StringUtils.java:945) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.writeUnlock(FSNamesystem.java:1659) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1074) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) > org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) > org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:992) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:976) > org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) > Number of suppressed write-lock reports: 0 > Longest write-lock held interval: 7159 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177348#comment-16177348 ] Hadoop QA commented on HDFS-12516: -- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 13s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 15m 5s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 0s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 45s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 1m 2s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 55s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 48s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 1m 2s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 47s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 47s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 36s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 51s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 47s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 38s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:red}-1{color} | {color:red} unit {color} | {color:red} 99m 43s{color} | {color:red} hadoop-hdfs in the patch failed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 18s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black}127m 51s{color} | {color:black} {color} | \\ \\ || Reason || Tests || | Failed junit tests | hadoop.hdfs.server.datanode.TestDataNodeVolumeFailureReporting | | | hadoop.hdfs.server.datanode.TestBlockScanner | | | hadoop.hdfs.server.namenode.TestReencryptionWithKMS | | | hadoop.hdfs.server.blockmanagement.TestBlockStatsMXBean | \\ \\ || Subsystem || Report/Notes || | Docker | Image:yetus/hadoop:71bbb86 | | JIRA Issue | HDFS-12516 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12888576/HDFS-12516.02.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit findbugs checkstyle | | uname | Linux f735c2a86ad5 3.13.0-129-generic #178-Ubuntu SMP Fri Aug 11 12:48:20 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/hadoop/patchprocess/precommit/personality/provided.sh | | git revision | trunk / 8d29bf5 | | Default Java | 1.8.0_144 | | findbugs | v3.1.0-RC1 | | unit | https://builds.apache.org/job/PreCommit-HDFS-Build/21311/artifact/patchprocess/patch-unit-hadoop-hdfs-project_hadoop-hdfs.txt | | Test Results | https://builds.apache.org/job/PreCommit-HDFS-Build/21311/testReport/ | | modules | C: hadoop-hdfs-project/hadoop-hdfs U: hadoop-hdfs-project/hadoop-hdfs | | Console output | https://builds.apache.org/job/PreCommit-HDFS-Build/21311/console | | Powered by | Apache Yetus 0.6.0-SNAPSHOT http://yetus.apache.org | This message was automatically generated. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type:
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177187#comment-16177187 ] Arpit Agarwal commented on HDFS-12516: -- +1 pending Jenkins. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar > Attachments: HDFS-12516.01.patch, HDFS-12516.02.patch > > > Whenever FsNameSystemLock is held for more than configured value of > {{dfs.namenode.write-lock-reporting-threshold-ms}}, we log stacktrace and an > entry in metrics. Loading FSImage from disk will usually cross this > threshold. We can suppress this FsNamesystem lock warning on NameNode startup. > {code} > 17/09/20 21:41:39 INFO namenode.FSNamesystem: FSNamesystem write lock held > for 7159 ms via > java.lang.Thread.getStackTrace(Thread.java:1552) > org.apache.hadoop.util.StringUtils.getStackTrace(StringUtils.java:945) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.writeUnlock(FSNamesystem.java:1659) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1074) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) > org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) > org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:992) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:976) > org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) > Number of suppressed write-lock reports: 0 > Longest write-lock held interval: 7159 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177167#comment-16177167 ] Ajay Kumar commented on HDFS-12516: --- [~arpitagarwal] thanks for review. Patch v2 has suggested changes. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar > Attachments: HDFS-12516.01.patch, HDFS-12516.02.patch > > > Whenever FsNameSystemLock is held for more than configured value of > {{dfs.namenode.write-lock-reporting-threshold-ms}}, we log stacktrace and an > entry in metrics. Loading FSImage from disk will usually cross this > threshold. We can suppress this FsNamesystem lock warning on NameNode startup. > {code} > 17/09/20 21:41:39 INFO namenode.FSNamesystem: FSNamesystem write lock held > for 7159 ms via > java.lang.Thread.getStackTrace(Thread.java:1552) > org.apache.hadoop.util.StringUtils.getStackTrace(StringUtils.java:945) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.writeUnlock(FSNamesystem.java:1659) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1074) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) > org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) > org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:992) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:976) > org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) > Number of suppressed write-lock reports: 0 > Longest write-lock held interval: 7159 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176755#comment-16176755 ] Arpit Agarwal commented on HDFS-12516: -- The change looks great! Minor comments: # The method {{writeUnlockFS(String opName, boolean suppressWriteLockReport)}} can be combined into writeUnlock as there is no other caller. # We can update the check in writeUnlockFS so suppressWriteLockReport is rolled into needReport e.g. {code} final boolean needReport = !suppressWriteLockReport && coarseLock.getWriteHoldCount() == 1 && coarseLock.isWriteLockedByCurrentThread(); {code} Also the test case seems to pass even without the suppressWriteLockReport check in writeUnlockFS. Didn't debug it further. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar > Attachments: HDFS-12516.01.patch > > > Whenever FsNameSystemLock is held for more than configured value of > {{dfs.lock.suppress.warning.interval}}, we log stacktrace and an entry in > metrics. Loading FSImage from disk will usually cross this threshold. We can > suppress this FsNamesystem lock warning on NameNode startup. > {code} > 17/09/20 21:41:39 INFO namenode.FSNamesystem: FSNamesystem write lock held > for 7159 ms via > java.lang.Thread.getStackTrace(Thread.java:1552) > org.apache.hadoop.util.StringUtils.getStackTrace(StringUtils.java:945) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.writeUnlock(FSNamesystem.java:1659) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1074) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) > org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) > org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:992) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:976) > org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) > Number of suppressed write-lock reports: 0 > Longest write-lock held interval: 7159 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176647#comment-16176647 ] Ajay Kumar commented on HDFS-12516: --- Test failures are unrelated. All 3 failed tests passes locally. In jenkins 2 were timed out while 3rd one has connection refused error. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >Assignee: Ajay Kumar > Attachments: HDFS-12516.01.patch > > > Whenever FsNameSystemLock is held for more than configured value of > {{dfs.lock.suppress.warning.interval}}, we log stacktrace and an entry in > metrics. Loading FSImage from disk will usually cross this threshold. We can > suppress this FsNamesystem lock warning on NameNode startup. > {code} > 17/09/20 21:41:39 INFO namenode.FSNamesystem: FSNamesystem write lock held > for 7159 ms via > java.lang.Thread.getStackTrace(Thread.java:1552) > org.apache.hadoop.util.StringUtils.getStackTrace(StringUtils.java:945) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.writeUnlock(FSNamesystem.java:1659) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFSImage(FSNamesystem.java:1074) > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.loadFromDisk(FSNamesystem.java:703) > org.apache.hadoop.hdfs.server.namenode.NameNode.loadNamesystem(NameNode.java:688) > org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:752) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:992) > org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:976) > org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1701) > org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1769) > Number of suppressed write-lock reports: 0 > Longest write-lock held interval: 7159 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Commented] (HDFS-12516) Suppress the fsnamesystem lock warning on nn startup
[ https://issues.apache.org/jira/browse/HDFS-12516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16176013#comment-16176013 ] Hadoop QA commented on HDFS-12516: -- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 12s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 15m 4s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 48s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 39s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 56s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 39s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 41s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 49s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 48s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 48s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 35s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 52s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 1m 47s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 38s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:red}-1{color} | {color:red} unit {color} | {color:red} 96m 24s{color} | {color:red} hadoop-hdfs in the patch failed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 23s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black}123m 33s{color} | {color:black} {color} | \\ \\ || Reason || Tests || | Failed junit tests | hadoop.hdfs.server.namenode.TestReencryptionWithKMS | | | hadoop.hdfs.server.blockmanagement.TestUnderReplicatedBlocks | | | hadoop.hdfs.server.datanode.TestDataNodeVolumeFailureReporting | \\ \\ || Subsystem || Report/Notes || | Docker | Image:yetus/hadoop:71bbb86 | | JIRA Issue | HDFS-12516 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12888433/HDFS-12516.01.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit findbugs checkstyle | | uname | Linux 802dde5f8151 3.13.0-123-generic #172-Ubuntu SMP Mon Jun 26 18:04:35 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/hadoop/patchprocess/precommit/personality/provided.sh | | git revision | trunk / 53be075 | | Default Java | 1.8.0_144 | | findbugs | v3.1.0-RC1 | | unit | https://builds.apache.org/job/PreCommit-HDFS-Build/21296/artifact/patchprocess/patch-unit-hadoop-hdfs-project_hadoop-hdfs.txt | | Test Results | https://builds.apache.org/job/PreCommit-HDFS-Build/21296/testReport/ | | modules | C: hadoop-hdfs-project/hadoop-hdfs U: hadoop-hdfs-project/hadoop-hdfs | | Console output | https://builds.apache.org/job/PreCommit-HDFS-Build/21296/console | | Powered by | Apache Yetus 0.6.0-SNAPSHOT http://yetus.apache.org | This message was automatically generated. > Suppress the fsnamesystem lock warning on nn startup > > > Key: HDFS-12516 > URL: https://issues.apache.org/jira/browse/HDFS-12516 > Project: Hadoop HDFS > Issue Type: Improvement >Reporter: Ajay Kumar >