Nicolas Carlot created KAFKA-9880:
-------------------------------------

             Summary: Error while range compacting during bulk loading of FIFO 
compacted RocksDB Store
                 Key: KAFKA-9880
                 URL: https://issues.apache.org/jira/browse/KAFKA-9880
             Project: Kafka
          Issue Type: Bug
          Components: streams
    Affects Versions: 2.4.1
            Reporter: Nicolas Carlot


When restoring a non empty RocksDB state store, if it is customized to use 
FIFOCompaction, the following exception is thrown:

 
{code:java}
exception thrown by the KStream process 
is:org.apache.kafka.streams.errors.ProcessorStateException: Error while range 
compacting during restoring  store merge_store         at 
org.apache.kafka.streams.state.internals.RocksDBStore$SingleColumnFamilyAccessor.toggleDbForBulkLoading(RocksDBStore.java:615)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.state.internals.RocksDBStore.toggleDbForBulkLoading(RocksDBStore.java:398)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.state.internals.RocksDBStore$RocksDBBatchingRestoreCallback.onRestoreStart(RocksDBStore.java:644)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.processor.internals.CompositeRestoreListener.onRestoreStart(CompositeRestoreListener.java:59)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.processor.internals.StateRestorer.restoreStarted(StateRestorer.java:76)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.processor.internals.StoreChangelogReader.startRestoration(StoreChangelogReader.java:211)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.processor.internals.StoreChangelogReader.initialize(StoreChangelogReader.java:185)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.processor.internals.StoreChangelogReader.restore(StoreChangelogReader.java:81)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.processor.internals.TaskManager.updateNewAndRestoringTasks(TaskManager.java:389)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.processor.internals.StreamThread.runOnce(StreamThread.java:769)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.processor.internals.StreamThread.runLoop(StreamThread.java:698)
 ~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.processor.internals.StreamThread.run(StreamThread.java:671)
 [kafka-stream-router.jar:?] Caused by: org.rocksdb.RocksDBException: Target 
level exceeds number of levels         at 
org.rocksdb.RocksDB.compactRange(Native Method) ~[kafka-stream-router.jar:?]    
     at org.rocksdb.RocksDB.compactRange(RocksDB.java:2636) 
~[kafka-stream-router.jar:?]         at 
org.apache.kafka.streams.state.internals.RocksDBStore$SingleColumnFamilyAccessor.toggleDbForBulkLoading(RocksDBStore.java:613)
 ~[kafka-stream-router.jar:?]         ... 11 more
{code}
 

 

Compaction is configured through an implementation of RocksDBConfigSetter. The 
exception si gone as soon as I remove:

 
{code:java}
CompactionOptionsFIFO fifoOptions = new 
CompactionOptionsFIFO();CompactionOptionsFIFO fifoOptions = new 
CompactionOptionsFIFO(); fifoOptions.setMaxTableFilesSize(maxSize); 
fifoOptions.setAllowCompaction(true); 
options.setCompactionOptionsFIFO(fifoOptions); 
options.setCompactionStyle(CompactionStyle.FIFO);
{code}
 

 

Bulk loading works fine when the store is non-existent / empty. This occurs 
only when there are a minimum amount of data in it. I guess it happens when the 
amount SST layers is increased.

I'm currently using a forked version of Kafka 2.4.1 customizing the 
RocksDBStore class with this modification as a work around:
{code:java}
@Override@Override @SuppressWarnings("deprecation") public void 
toggleDbForBulkLoading() { try { db.compactRange(columnFamily, true, 1, 0); } 
catch (final RocksDBException e) { try { if 
(columnFamily.getDescriptor().getOptions().compactionStyle() != 
CompactionStyle.FIFO) { throw new ProcessorStateException("Error while range 
compacting while restoring  store " + name, e); } else { log.warn("Compaction 
of store " + name + " for bulk loading failed. Will continue without compacted 
store, which will be slower.", e); } } catch (RocksDBException e1) { throw new 
ProcessorStateException("Error while range compacting during restoring  store " 
+ name, e); } } }
{code}
I'm not very proud of this workaround, but it suits my use cases well.

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to