Hernan Gelaf-Romer created HBASE-29744:
------------------------------------------
Summary: Incremental backups fail with FileNotFoundException when
trying to process WAL files from RegionServers that were added to the cluster
after the last successful backup.
Key: HBASE-29744
URL: https://issues.apache.org/jira/browse/HBASE-29744
Project: HBase
Issue Type: Bug
Components: backup&restore
Reporter: Hernan Gelaf-Romer
Incremental backups can fail with a FileNotFoundException when trying to
process Write-Ahead Log (WAL) files from RegionServers that were added to the
cluster after the last successful backup.
The issue occurs in BackupLogCleaner.canDeleteFile(), which checks timestamp
boundaries (stored in the backup system table) to determine if WAL files are
safe to delete. When no boundary exists for a RegionServer address, the cleaner
incorrectly assumes that the WALs can safely be deleted and returns true. This
situation arises when a new RegionServer is added between backups. The new
server generates WAL files for tables, but since a backup has not yet
completed, no timestamp boundary for this server is recorded. As a result, the
cleaner may delete these WAL files before the next backup can process them,
leading to a FileNotFoundException.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)