Sergey Menshikov created KAFKA-9598: ---------------------------------------
Summary: RocksDB exception when grouping dynamically appearing topics into a KTable Key: KAFKA-9598 URL: https://issues.apache.org/jira/browse/KAFKA-9598 Project: Kafka Issue Type: Bug Components: streams Affects Versions: 2.4.0, 2.2.0 Reporter: Sergey Menshikov Attachments: exception-details.txt A streams application, consumer a number of topics via a whitelisted regex. The topics appear dynamically, generated from dynamically appearing MongoDB collections by debezium MongoDB source driver. The development is running on debezium docker images (Debezium 0.9 and Debezium 1.0 -> Kafka 2.2.0 and 2.4.0), single instance of Kafka, Connect and the streams consumer app. As the MongoDB driver provides only deltas of the changes, to collect full record for each key, the code creates KTable which is then transformed into a KStream for further joining with other KTables and Global KTables. The following piece of code results in the exception when a new topic is added: {code:java} Pattern tResultPattern = Pattern.compile(config.getProperty("mongodb_source_prefix")+".tr[0-9a-fA-F]{32}"); KStream<String, JsonNode> tResultsTempStream = builder.stream(tResultPattern, Consumed.with(stringSerde, jsonSerde)); KTable<String, JsonNode> tResultsTempTable = tResultsTempStream.groupByKey(Grouped.with(stringSerde,jsonSerde)) .reduce((aggValue, newValue) -> mergeNodes(aggValue,newValue)); // mergeNodes is a Json traverse/merger procedure KStream<String, JsonNode> tResults = tResultsTempTable.toStream(); {code} kconsumer_1 | Exception in thread "split-reader-client3-StreamThread-1" org.apache.kafka.streams.errors.ProcessorStateException: Error opening store KSTREAM-REDUCE-STATE-STORE-0000000032 at location /tmp/split-reader3/10_0/rocksdb/KSTREAM-REDUCE-STATE-STORE-0000000032 ... kconsumer_1 | Caused by: org.rocksdb.RocksDBException: lock : /tmp/split-reader3/10_0/rocksdb/KSTREAM-REDUCE-STATE-STORE-0000000032/LOCK: No locks available Kstore 10_0 contains tr[0-9a-fA-F]\{32} records, I checked. more details about exception are in the attached file. The exception is no longer present when I use an intermediate topic instead: {code:java} Pattern tResultPattern = Pattern.compile(config.getProperty("mongodb_source_prefix")+".tr[0-9a-fA-F]{32}");{code} {code:java} KStream<String, JsonNode> tResultsTempStream = builder.stream(tResultPattern, Consumed.with(stringSerde, jsonSerde)); tResultsTempStream.transform(trTransformer::new).to(config.getProperty("tr_intermediate_topic_name"),Produced.with(stringSerde, jsonSerde)); // trTransformer adds topic name into value Json, in previous snippet it was done in the pipeline after grouping/streaming KStream<String, JsonNode> tResultsTempStream2 = builder.stream(config.getProperty("tr_intermediate_topic_name"), Consumed.with(stringSerde, jsonSerde)); KTable<String, JsonNode> tResultsTempTable = tResultsTempStream2.groupByKey(Grouped.with(stringSerde,jsonSerde)) .reduce((aggValue, newValue) -> mergeNodes(aggValue,newValue)); KStream<String, JsonNode> tResults = tResultsTempTable.toStream(); {code} If making KTable from multiple whitelisted topics is something that is outside of scope of Kafka Streams, perhaps it would make sense to mention it in the docs. -- This message was sent by Atlassian Jira (v8.3.4#803005)