Apache9 commented on PR #5443:
URL: https://github.com/apache/hbase/pull/5443#issuecomment-1751987507
```
2023-10-08T06:42:20,756 ERROR
[RS:0;c85796445aa7:36275.replicationSource.wal-reader.c85796445aa7%2C36275%2C1696747321009-1696747340579-1,1-c85796445aa7,36275,1696747321009
{}] regionserver.ReplicationSource(452): Unexpected exception in
RS:0;c85796445aa7:36275.replicationSource.wal-reader.c85796445aa7%2C36275%2C1696747321009-1696747340579-1,1-c85796445aa7,36275,1696747321009
currentPath=hdfs://localhost:32875/user/jenkins/test-data/daf7aa3e-9117-b174-d093-9e8787b5dfb7/WALs/c85796445aa7,36275,1696747321009/c85796445aa7%2C36275%2C1696747321009-1696747340579-1.1696747340583.syncrep
java.lang.AssertionError: null
at
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.lastAttempt(WALEntryStream.java:303)
~[classes/:?]
at
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.tryAdvanceEntry(WALEntryStream.java:369)
~[classes/:?]
at
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.hasNext(WALEntryStream.java:129)
~[classes/:?]
at
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.run(ReplicationSourceWALReader.java:147)
~[classes/:?]
2023-10-08T06:42:20,757 ERROR
[RS:0;c85796445aa7:36275.replicationSource.wal-reader.c85796445aa7%2C36275%2C1696747321009-1696747340579-1,1-c85796445aa7,36275,1696747321009
{}] regionserver.HRegionServer(2389): ***** ABORTING region server
c85796445aa7,36275,1696747321009: Unexpected exception in
RS:0;c85796445aa7:36275.replicationSource.wal-reader.c85796445aa7%2C36275%2C1696747321009-1696747340579-1,1-c85796445aa7,36275,1696747321009
*****
java.lang.AssertionError: null
at
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.lastAttempt(WALEntryStream.java:303)
~[classes/:?]
at
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.tryAdvanceEntry(WALEntryStream.java:369)
~[classes/:?]
at
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.hasNext(WALEntryStream.java:129)
~[classes/:?]
at
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.run(ReplicationSourceWALReader.java:147)
~[classes/:?]
```
Really strange, it fails in assertion in lastAttemp, where we assume that
beingWritten must be false...
```
case EOF_AND_RESET:
case EOF_AND_RESET_COMPRESSION:
if (beingWritten) {
// just sleep a bit and retry to see if there are new entries
coming since the file is
// still being written
return HasNext.RETRY;
}
// no more entries in this log file, and the file is already closed,
i.e, rolled
// Before dequeuing, we should always get one more attempt at
reading.
// This is in case more entries came in after we opened the reader,
and the log is rolled
// while we were reading. See HBASE-6758
return lastAttempt();
```
If beingWritten == true, we will not call lastAttempt, but when calling
lastAttempt, the file is being written again?
Need to dig more...
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]