[ https://issues.apache.org/jira/browse/KAFKA-19289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
felixzh updated KAFKA-19289: ---------------------------- Description: Read only disk failure, ReplicaManager.scala's logManager.handleLogDirFailure(dir) not capturing exception correctly, interrupt process causing topic with multiple copies partition leader failure not switching. was: Read only disk failure, ReplicaManager.scala's logManager.handleLogDirFailure(dir) not capturing exception correctly, interrupt process causing topic with multiple copies partition leader failure not switching. kernel: print_req_error: I/O error, dev sdc, sector 2536082402 kernel: sd 1:0:7:0: [sdc] tag#14 FAILED Result: hostbyte=DID_RESET driverbyte=DRIVER_OK kernel: sd 1:0:7:0: [sdc] tag#14 CDB: Write(16) 8a 00 00 00 00 00 97 29 90 4a 00 00 04 00 00 00 java.io.IOException: Read-only file system at java.io.UnixFileSystem.createFileExclusively(Native Method) at java.io.File.createNewFile(File.java:1012) at kafka.log.AbstractIndex.<init>(AbstractIndex.scala:113) at kafka.log.OffsetIndex.<init>(OffsetIndex.scala:54) at kafka.log.LazyOffsetIndex.get(OffsetIndex.scala:238) at kafka.log.LogSegment.offsetIndex(LogSegment.scala:64) at kafka.log.LogSegment$$anonfun$closeHandlers$1.apply$mcV$sp(LogSegment.scala:612) at kafka.utils.CoreUtils$.swallow(CoreUtils.scala:88) at kafka.log.LogSegment.closeHandlers(LogSegment.scala:612) at kafka.log.Log$$anonfun$closeHandlers$2.apply(Log.scala:803) at kafka.log.Log$$anonfun$closeHandlers$2.apply(Log.scala:803) at scala.collection.Iterator$class.foreach(Iterator.scala:891) at scala.collection.AbstractIterator.foreach(Iterator.scala:1334) at scala.collection.IterableLike$class.foreach(IterableLike.scala:72) at scala.collection.AbstractIterable.foreach(Iterable.scala:54) at kafka.log.Log.closeHandlers(Log.scala:803) at kafka.log.LogManager$$anonfun$handleLogDirFailure$5.apply(LogManager.scala:210) at kafka.log.LogManager$$anonfun$handleLogDirFailure$5.apply(LogManager.scala:207) at scala.collection.immutable.List.foreach(List.scala:392) at kafka.log.LogManager.handleLogDirFailure(LogManager.scala:207) at kafka.server.ReplicaManager.handleLogDirFailure(ReplicaManager.scala:1489) at kafka.server.ReplicaManager$LogDirFailureHandler.doWork(ReplicaManager.scala:210) at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:89) > Read only disk failure, ReplicaManager.scala's > logManager.handleLogDirFailure(dir) not capturing exception correctly, > interrupt process causing topic with multiple copies partition leader > failure not switching. > -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- > > Key: KAFKA-19289 > URL: https://issues.apache.org/jira/browse/KAFKA-19289 > Project: Kafka > Issue Type: Bug > Components: core > Affects Versions: 2.7.2, 3.2.3, 3.8.0, 3.9.0, 4.0.0 > Reporter: felixzh > Assignee: felixzh > Priority: Major > > Read only disk failure, ReplicaManager.scala's > logManager.handleLogDirFailure(dir) not capturing exception correctly, > interrupt process causing topic with multiple copies partition leader > failure not switching. > > -- This message was sent by Atlassian Jira (v8.20.10#820010)