Joey Pereira created FLINK-17971:
------------------------------------
Summary: Speed up RocksDB bulk loading with SST generation and
ingestion
Key: FLINK-17971
URL: https://issues.apache.org/jira/browse/FLINK-17971
Project: Flink
Issue Type: Improvement
Components: Runtime / State Backends
Reporter: Joey Pereira
RocksDB provides an API for creating SST files and ingesting them directly into
RocksDB:
[https://github.com/facebook/rocksdb/wiki/Creating-and-Ingesting-SST-files]
Using this method for bulk loading data into RocksDB may provide a significant
performance increase, specifically for paths doing inserts such as full
savepoint recovery and state migrations. This is one method of optimizing bulk
loads, as described in https://issues.apache.org/jira/browse/FLINK-17288
This was discussed on the user maillist:
[http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/RocksDB-savepoint-recovery-performance-improvements-td35238.html]
A draft PR is here: [https://github.com/apache/flink/pull/12345/]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)