Duo Zhang created HBASE-27162:
---------------------------------

             Summary: Could get malformed WAL PB messge when replication if WAL 
value compression is enabled
                 Key: HBASE-27162
                 URL: https://issues.apache.org/jira/browse/HBASE-27162
             Project: HBase
          Issue Type: Bug
          Components: Replication, wal
            Reporter: Duo Zhang


https://nightlies.apache.org/hbase/HBase-Flaky-Tests/master/3752/hbase-server/target/surefire-reports/org.apache.hadoop.hbase.replication.regionserver.TestReplicationValueCompressedWAL-output.txt

{noformat}
2022-06-26T08:41:27,359 DEBUG 
[RS_REFRESH_PEER-regionserver/jenkins-hbase19:0-0.replicationSource,2.replicationSource.wal-reader.jenkins-hbase19.apache.org%2C44659%2C1656232862372,2]
 wal.ProtobufLogReader(448): Encountered a malformed edit, seeking back to last 
good position in file, from 65537 to 65536
java.io.EOFException: Partial PB while reading WAL, probably an unexpected EOF, 
ignoring. current offset=65537
        at 
org.apache.hadoop.hbase.regionserver.wal.ProtobufLogReader.readNext(ProtobufLogReader.java:390)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.regionserver.wal.ReaderBase.next(ReaderBase.java:104) 
~[classes/:?]
        at 
org.apache.hadoop.hbase.regionserver.wal.ReaderBase.next(ReaderBase.java:92) 
~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.readNextEntryAndRecordReaderPosition(WALEntryStream.java:264)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.tryAdvanceEntry(WALEntryStream.java:178)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.hasNext(WALEntryStream.java:103)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.readWALEntries(ReplicationSourceWALReader.java:212)
 ~[classes/:?]
        at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.run(ReplicationSourceWALReader.java:148)
 ~[classes/:?]
{noformat}

Actually, we should still have a lot of edits after this point. This could  be 
a critical problem as we will hang replication forever...



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to