[
https://issues.apache.org/jira/browse/HBASE-27162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17558960#comment-17558960
]
Duo Zhang commented on HBASE-27162:
-----------------------------------
[~apurtell] FYI.
> Could get malformed WAL PB messge when replication if WAL value compression
> is enabled
> --------------------------------------------------------------------------------------
>
> Key: HBASE-27162
> URL: https://issues.apache.org/jira/browse/HBASE-27162
> Project: HBase
> Issue Type: Bug
> Components: Replication, wal
> Reporter: Duo Zhang
> Priority: Major
>
> https://nightlies.apache.org/hbase/HBase-Flaky-Tests/master/3752/hbase-server/target/surefire-reports/org.apache.hadoop.hbase.replication.regionserver.TestReplicationValueCompressedWAL-output.txt
> {noformat}
> 2022-06-26T08:41:27,359 DEBUG
> [RS_REFRESH_PEER-regionserver/jenkins-hbase19:0-0.replicationSource,2.replicationSource.wal-reader.jenkins-hbase19.apache.org%2C44659%2C1656232862372,2]
> wal.ProtobufLogReader(448): Encountered a malformed edit, seeking back to
> last good position in file, from 65537 to 65536
> java.io.EOFException: Partial PB while reading WAL, probably an unexpected
> EOF, ignoring. current offset=65537
> at
> org.apache.hadoop.hbase.regionserver.wal.ProtobufLogReader.readNext(ProtobufLogReader.java:390)
> ~[classes/:?]
> at
> org.apache.hadoop.hbase.regionserver.wal.ReaderBase.next(ReaderBase.java:104)
> ~[classes/:?]
> at
> org.apache.hadoop.hbase.regionserver.wal.ReaderBase.next(ReaderBase.java:92)
> ~[classes/:?]
> at
> org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.readNextEntryAndRecordReaderPosition(WALEntryStream.java:264)
> ~[classes/:?]
> at
> org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.tryAdvanceEntry(WALEntryStream.java:178)
> ~[classes/:?]
> at
> org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.hasNext(WALEntryStream.java:103)
> ~[classes/:?]
> at
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.readWALEntries(ReplicationSourceWALReader.java:212)
> ~[classes/:?]
> at
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.run(ReplicationSourceWALReader.java:148)
> ~[classes/:?]
> {noformat}
> Actually, we should still have a lot of edits after this point. This could
> be a critical problem as we will hang replication forever...
--
This message was sent by Atlassian Jira
(v8.20.7#820007)