[jira] [Commented] (RATIS-692) RaftStorageDirectory.tryLock throws a very deep IOException
[ https://issues.apache.org/jira/browse/RATIS-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16958407#comment-16958407 ] Hadoop QA commented on RATIS-692: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 16s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:blue}0{color} | {color:blue} findbugs {color} | {color:blue} 0m 0s{color} | {color:blue} Findbugs executables are not available. {color} | | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} master Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 55s{color} | {color:blue} Maven dependency ordering for branch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 2m 12s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 50s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 18s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 40s{color} | {color:green} master passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 5s{color} | {color:blue} Maven dependency ordering for patch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 59s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 50s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 50s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 10s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 34s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:red}-1{color} | {color:red} unit {color} | {color:red} 32m 19s{color} | {color:red} root in the patch failed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 17s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 40m 41s{color} | {color:black} {color} | \\ \\ || Reason || Tests || | Failed junit tests | ratis.logservice.server.TestMetaServer | | | ratis.grpc.TestRaftAsyncExceptionWithGrpc | | | ratis.grpc.TestServerRestartWithGrpc | | | ratis.grpc.TestRaftOutputStreamWithGrpc | | | ratis.grpc.TestRaftServerWithGrpc | | | ratis.grpc.TestWatchRequestWithGrpc | | | ratis.grpc.TestLeaderElectionWithGrpc | | | ratis.examples.filestore.TestFileStoreWithGrpc | | | ratis.examples.filestore.TestFileStoreAsyncWithGrpc | \\ \\ || Subsystem || Report/Notes || | Docker | Client=19.03.4 Server=19.03.4 Image:yetus/ratis:date2019-10-23 | | JIRA Issue | RATIS-692 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12982035/r692_20191003.patch | | Optional Tests | dupname asflicense javac javadoc unit findbugs checkstyle compile | | uname | Linux 72d4e94dd3dc 4.15.0-54-generic #58-Ubuntu SMP Mon Jun 24 10:55:24 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /home/jenkins/jenkins-slave/workspace/PreCommit-RATIS-Build/yetus-personality.sh | | git revision | master / 55cbfbb | | maven | version: Apache Maven 3.6.2 (40f52333136460af0dc0d7232c0dc0bcf0d9e117; 2019-08-27T15:06:16Z) | | Default Java | 1.8.0_222 | | unit | https://builds.apache.org/job/PreCommit-RATIS-Build/1102/artifact/out/patch-unit-root.txt | | Test Results | https://builds.apache.org/job/PreCommit-RATIS-Build/1102/testReport/ | | Max. process+thread count | 1285 (vs. ulimit of 5000) | | modules | C: ratis-common ratis-server U: . | | Console output | https://builds.apache.org/job/PreCommit-RATIS-Build/1102/console | | Powered by | Apache Yetus 0.8.0 http://yetus.apache.org | This message was automatically generated. > RaftStorageDirectory.tryLock throws a very deep IOException >
[jira] [Commented] (RATIS-692) RaftStorageDirectory.tryLock throws a very deep IOException
[ https://issues.apache.org/jira/browse/RATIS-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16958324#comment-16958324 ] Jitendra Nath Pandey commented on RATIS-692: +1 for the patch, pending jenkins. > RaftStorageDirectory.tryLock throws a very deep IOException > --- > > Key: RATIS-692 > URL: https://issues.apache.org/jira/browse/RATIS-692 > Project: Ratis > Issue Type: Sub-task > Components: server >Reporter: Clay B. >Assignee: Tsz-wo Sze >Priority: Major > Labels: namazu > Attachments: r692_20190928.patch, r692_20191002.patch, > r692_20191003.patch > > > Working with our Namazu infrastructure, the first issue I hit when dialing up > the faulty I/O injection rate is as follows: > {code} > 2019-09-27 14:13:45 ERROR RaftStorageDirectory:336 - Failed to acquire lock > on > /home/vagrant/test_data/data0_slowed/64656d6f-5261-6674-4772-6f7570313233/in_use.lock. > If this storage directory is mounted via NFS, ensure that the appropriate > nfs lock services are running. > java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > Exception in thread "main" java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > {code} > It looks like the call chain does not re-try anywhere however. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (RATIS-692) RaftStorageDirectory.tryLock throws a very deep IOException
[ https://issues.apache.org/jira/browse/RATIS-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16958266#comment-16958266 ] Hadoop QA commented on RATIS-692: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s{color} | {color:blue} Docker mode activated. {color} | | {color:red}-1{color} | {color:red} patch {color} | {color:red} 0m 6s{color} | {color:red} RATIS-692 does not apply to master. Rebase required? Wrong Branch? See https://yetus.apache.org/documentation/0.8.0/precommit-patchnames for help. {color} | \\ \\ || Subsystem || Report/Notes || | JIRA Issue | RATIS-692 | | Console output | https://builds.apache.org/job/PreCommit-RATIS-Build/1101/console | | Powered by | Apache Yetus 0.8.0 http://yetus.apache.org | This message was automatically generated. > RaftStorageDirectory.tryLock throws a very deep IOException > --- > > Key: RATIS-692 > URL: https://issues.apache.org/jira/browse/RATIS-692 > Project: Ratis > Issue Type: Sub-task > Components: server >Reporter: Clay B. >Assignee: Tsz-wo Sze >Priority: Major > Labels: namazu > Attachments: image.png, r692_20190928.patch, r692_20191002.patch, > r692_20191003.patch > > > Working with our Namazu infrastructure, the first issue I hit when dialing up > the faulty I/O injection rate is as follows: > {code} > 2019-09-27 14:13:45 ERROR RaftStorageDirectory:336 - Failed to acquire lock > on > /home/vagrant/test_data/data0_slowed/64656d6f-5261-6674-4772-6f7570313233/in_use.lock. > If this storage directory is mounted via NFS, ensure that the appropriate > nfs lock services are running. > java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > Exception in thread "main" java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > {code} > It looks like the call chain does not re-try anywhere however. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (RATIS-692) RaftStorageDirectory.tryLock throws a very deep IOException
[ https://issues.apache.org/jira/browse/RATIS-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16943241#comment-16943241 ] Hadoop QA commented on RATIS-692: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 17s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:blue}0{color} | {color:blue} findbugs {color} | {color:blue} 0m 0s{color} | {color:blue} Findbugs executables are not available. {color} | | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} master Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 56s{color} | {color:blue} Maven dependency ordering for branch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 2m 22s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 53s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 19s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 41s{color} | {color:green} master passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 6s{color} | {color:blue} Maven dependency ordering for patch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 1m 2s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 52s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 52s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 11s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 35s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:red}-1{color} | {color:red} unit {color} | {color:red} 17m 58s{color} | {color:red} root in the patch failed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 19s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 26m 50s{color} | {color:black} {color} | \\ \\ || Reason || Tests || | Failed junit tests | ratis.netty.TestRaftReconfigurationWithNetty | | | ratis.grpc.TestWatchRequestWithGrpc | | | ratis.examples.filestore.TestFileStoreWithGrpc | \\ \\ || Subsystem || Report/Notes || | Docker | Client=19.03.1 Server=19.03.1 Image:yetus/ratis:date2019-10-02 | | JIRA Issue | RATIS-692 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12982035/r692_20191003.patch | | Optional Tests | dupname asflicense javac javadoc unit findbugs checkstyle compile | | uname | Linux 6ed74d209442 4.15.0-54-generic #58-Ubuntu SMP Mon Jun 24 10:55:24 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /home/jenkins/jenkins-slave/workspace/PreCommit-RATIS-Build/yetus-personality.sh | | git revision | master / ecef287 | | maven | version: Apache Maven 3.6.2 (40f52333136460af0dc0d7232c0dc0bcf0d9e117; 2019-08-27T15:06:16Z) | | Default Java | 1.8.0_222 | | unit | https://builds.apache.org/job/PreCommit-RATIS-Build/1026/artifact/out/patch-unit-root.txt | | Test Results | https://builds.apache.org/job/PreCommit-RATIS-Build/1026/testReport/ | | Max. process+thread count | 2693 (vs. ulimit of 5000) | | modules | C: ratis-common ratis-server U: . | | Console output | https://builds.apache.org/job/PreCommit-RATIS-Build/1026/console | | Powered by | Apache Yetus 0.8.0 http://yetus.apache.org | This message was automatically generated. > RaftStorageDirectory.tryLock throws a very deep IOException > --- > > Key: RATIS-692 > URL: https://issues.apache.org/jira/browse/RATIS-692 > Project: Ratis > Issue Type: Sub-task > Components: server >Reporter: Clay B. >
[jira] [Commented] (RATIS-692) RaftStorageDirectory.tryLock throws a very deep IOException
[ https://issues.apache.org/jira/browse/RATIS-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16943214#comment-16943214 ] Tsz-wo Sze commented on RATIS-692: -- r692_20191002.patch: fixes checkstyle warnings. > RaftStorageDirectory.tryLock throws a very deep IOException > --- > > Key: RATIS-692 > URL: https://issues.apache.org/jira/browse/RATIS-692 > Project: Ratis > Issue Type: Sub-task > Components: server >Reporter: Clay B. >Assignee: Tsz-wo Sze >Priority: Major > Attachments: r692_20190928.patch, r692_20191002.patch, > r692_20191003.patch > > > Working with our Namazu infrastructure, the first issue I hit when dialing up > the faulty I/O injection rate is as follows: > {code} > 2019-09-27 14:13:45 ERROR RaftStorageDirectory:336 - Failed to acquire lock > on > /home/vagrant/test_data/data0_slowed/64656d6f-5261-6674-4772-6f7570313233/in_use.lock. > If this storage directory is mounted via NFS, ensure that the appropriate > nfs lock services are running. > java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > Exception in thread "main" java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > {code} > It looks like the call chain does not re-try anywhere however. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (RATIS-692) RaftStorageDirectory.tryLock throws a very deep IOException
[ https://issues.apache.org/jira/browse/RATIS-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16942879#comment-16942879 ] Tsz-wo Sze commented on RATIS-692: -- Thanks [~clayb] for testing the patch. r692_20191002.patch: some refactoring so that the code can be shared with RATIS-696. > RaftStorageDirectory.tryLock throws a very deep IOException > --- > > Key: RATIS-692 > URL: https://issues.apache.org/jira/browse/RATIS-692 > Project: Ratis > Issue Type: Sub-task > Components: server >Reporter: Clay B. >Assignee: Tsz-wo Sze >Priority: Major > Attachments: r692_20190928.patch, r692_20191002.patch > > > Working with our Namazu infrastructure, the first issue I hit when dialing up > the faulty I/O injection rate is as follows: > {code} > 2019-09-27 14:13:45 ERROR RaftStorageDirectory:336 - Failed to acquire lock > on > /home/vagrant/test_data/data0_slowed/64656d6f-5261-6674-4772-6f7570313233/in_use.lock. > If this storage directory is mounted via NFS, ensure that the appropriate > nfs lock services are running. > java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > Exception in thread "main" java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > {code} > It looks like the call chain does not re-try anywhere however. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (RATIS-692) RaftStorageDirectory.tryLock throws a very deep IOException
[ https://issues.apache.org/jira/browse/RATIS-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16940046#comment-16940046 ] Clay B. commented on RATIS-692: --- Thanks [~szetszwo]; this solved this issue in my tests. > RaftStorageDirectory.tryLock throws a very deep IOException > --- > > Key: RATIS-692 > URL: https://issues.apache.org/jira/browse/RATIS-692 > Project: Ratis > Issue Type: Sub-task > Components: server >Reporter: Clay B. >Assignee: Tsz-wo Sze >Priority: Major > Attachments: r692_20190928.patch > > > Working with our Namazu infrastructure, the first issue I hit when dialing up > the faulty I/O injection rate is as follows: > {code} > 2019-09-27 14:13:45 ERROR RaftStorageDirectory:336 - Failed to acquire lock > on > /home/vagrant/test_data/data0_slowed/64656d6f-5261-6674-4772-6f7570313233/in_use.lock. > If this storage directory is mounted via NFS, ensure that the appropriate > nfs lock services are running. > java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > Exception in thread "main" java.io.IOException: Input/output error > at java.io.RandomAccessFile.writeBytes(Native Method) > at java.io.RandomAccessFile.write(RandomAccessFile.java:512) > at > org.apache.ratis.server.storage.RaftStorageDirectory.tryLock(RaftStorageDirectory.java:327) > at > org.apache.ratis.server.storage.RaftStorageDirectory.lock(RaftStorageDirectory.java:291) > at > org.apache.ratis.server.storage.RaftStorageDirectory.analyzeStorage(RaftStorageDirectory.java:264) > at > org.apache.ratis.server.storage.RaftStorage.analyzeAndRecoverStorage(RaftStorage.java:100) > at > org.apache.ratis.server.storage.RaftStorage.(RaftStorage.java:63) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:109) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:110) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:208) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > {code} > It looks like the call chain does not re-try anywhere however. -- This message was sent by Atlassian Jira (v8.3.4#803005)