Alexey Serbin created KUDU-3453:
-----------------------------------
Summary: Fine-grained anchoring for WAL segments during tablet
copying
Key: KUDU-3453
URL: https://issues.apache.org/jira/browse/KUDU-3453
Project: Kudu
Issue Type: Improvement
Components: tablet, tserver
Reporter: Alexey Serbin
Tablet copying is a provision to implement the process of automatic tablet
re-replication in Kudu. When the system catalog (Kudu master) detects that a
tablet replica is no longer available, it automatically re-replicates a tablet
to a destination tablet server using another healthy tablet replica in the
cluster as the source.
When copying a tablet from one tablet server to another, the source tablet
copying session "anchors" WAL segments to be transfered to the destination
server, so they are not GC-ed by the tablet maintenance operation when they are
no longer needed locally, but the tablet copy session is still in progress.
The anchored WAL segments are releases all at once when the tablet copying
session completes with success of failure. However, there might be long
running tablet copying sessions, and with high data ingest rate, the source
tablet replica might accumulate huge amount of WAL data which isn't relevant at
both the source and the destination server.
To prevent accumulation of WAL data for long-running tablet copying sessions,
it's necessary to update the WAL anchors in a more granular manner, e.g.
un-anchor a segment once it has been successfully copied and persisted by the
client tablet copying session.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)