Saketa Chalamchala created HDDS-15393:
-----------------------------------------
Summary: Implement DAG Diff with multi-stage sequential read
Key: HDDS-15393
URL: https://issues.apache.org/jira/browse/HDDS-15393
Project: Apache Ozone
Issue Type: Sub-task
Reporter: Saketa Chalamchala
Assignee: Saketa Chalamchala
Implement a multi-stage scan for an optimized DAG based diff:
1. Sequential scan of `toSnapshot` diff SSTs via K-way merge reads key entries
into a `newList` and track the diff candidates.
2. Full table scan of `toSnapshot.directoryTable` to build intermediate column
families used for `toSnapshot` key's path resolution.
3. Full table scan of `fromSnapshot.directoryTable` to build intermediate
column families used for `fromSnapshot` key's path resolution and `oldList`
population.
4. Batch point lookups (`multiGet`) of `fromSnapshot.fileTable` for diff
candidates and read then into `oldList`.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]