Apache9 commented on PR #5443:
URL: https://github.com/apache/hbase/pull/5443#issuecomment-1751987507

   ```
   2023-10-08T06:42:20,756 ERROR 
[RS:0;c85796445aa7:36275.replicationSource.wal-reader.c85796445aa7%2C36275%2C1696747321009-1696747340579-1,1-c85796445aa7,36275,1696747321009
 {}] regionserver.ReplicationSource(452): Unexpected exception in 
RS:0;c85796445aa7:36275.replicationSource.wal-reader.c85796445aa7%2C36275%2C1696747321009-1696747340579-1,1-c85796445aa7,36275,1696747321009
 
currentPath=hdfs://localhost:32875/user/jenkins/test-data/daf7aa3e-9117-b174-d093-9e8787b5dfb7/WALs/c85796445aa7,36275,1696747321009/c85796445aa7%2C36275%2C1696747321009-1696747340579-1.1696747340583.syncrep
   java.lang.AssertionError: null
        at 
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.lastAttempt(WALEntryStream.java:303)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.tryAdvanceEntry(WALEntryStream.java:369)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.hasNext(WALEntryStream.java:129)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.run(ReplicationSourceWALReader.java:147)
 ~[classes/:?]
   2023-10-08T06:42:20,757 ERROR 
[RS:0;c85796445aa7:36275.replicationSource.wal-reader.c85796445aa7%2C36275%2C1696747321009-1696747340579-1,1-c85796445aa7,36275,1696747321009
 {}] regionserver.HRegionServer(2389): ***** ABORTING region server 
c85796445aa7,36275,1696747321009: Unexpected exception in 
RS:0;c85796445aa7:36275.replicationSource.wal-reader.c85796445aa7%2C36275%2C1696747321009-1696747340579-1,1-c85796445aa7,36275,1696747321009
 *****
   java.lang.AssertionError: null
        at 
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.lastAttempt(WALEntryStream.java:303)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.tryAdvanceEntry(WALEntryStream.java:369)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.hasNext(WALEntryStream.java:129)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.run(ReplicationSourceWALReader.java:147)
 ~[classes/:?]
   ```
   
   Really strange, it fails in assertion in lastAttemp, where we assume that 
beingWritten must be false...
   
   ```
         case EOF_AND_RESET:
         case EOF_AND_RESET_COMPRESSION:
           if (beingWritten) {
             // just sleep a bit and retry to see if there are new entries 
coming since the file is
             // still being written
             return HasNext.RETRY;
           }
           // no more entries in this log file, and the file is already closed, 
i.e, rolled
           // Before dequeuing, we should always get one more attempt at 
reading.
           // This is in case more entries came in after we opened the reader, 
and the log is rolled
           // while we were reading. See HBASE-6758
           return lastAttempt();
   ```
   
   If beingWritten == true, we will not call lastAttempt, but when calling 
lastAttempt, the file is being written again?
   
   Need to dig more...


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to