[ 
https://issues.apache.org/jira/browse/FLINK-21121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315298#comment-17315298
 ] 

binguo commented on FLINK-21121:
--------------------------------

Thank you for your reply. What you mean is that the state-processor-api 
dependency must be added to the cluster. This problem will occur if you compile 
it with maven. [~tomas.witzany]

> TaggedOperatorSubtaskState is missing when creating a new savepoint using 
> state processor api
> ---------------------------------------------------------------------------------------------
>
>                 Key: FLINK-21121
>                 URL: https://issues.apache.org/jira/browse/FLINK-21121
>             Project: Flink
>          Issue Type: Bug
>          Components: API / State Processor
>    Affects Versions: 1.11.0
>            Reporter: binguo
>            Priority: Major
>
> I am getting an exception when using the Flink State Processor API to write a 
> new SavePoint, which is:
> {code:java}
> java.lang.Exception: Exception while creating StreamOperatorStateContext.
>     at 
> org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.streamOperatorStateContext(StreamTaskStateInitializerImpl.java:204)
>     at 
> org.apache.flink.streaming.api.operators.AbstractStreamOperator.initializeState(AbstractStreamOperator.java:247)
>     at 
> org.apache.flink.streaming.runtime.tasks.OperatorChain.initializeStateAndOpenOperators(OperatorChain.java:290)
>     at 
> org.apache.flink.streaming.runtime.tasks.StreamTask.lambda$beforeInvoke$0(StreamTask.java:473)
>     at 
> org.apache.flink.streaming.runtime.tasks.StreamTaskActionExecutor$SynchronizedStreamTaskActionExecutor.runThrowing(StreamTaskActionExecutor.java:92)
>     at 
> org.apache.flink.streaming.runtime.tasks.StreamTask.beforeInvoke(StreamTask.java:469)
>     at 
> org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:522)
>     at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:721)
>     at org.apache.flink.runtime.taskmanager.Task.run(Task.java:546)
>     at java.lang.Thread.run(Thread.java:748)
> Caused by: org.apache.flink.util.FlinkException: Could not restore operator 
> state backend for StreamSource_e8ea6e352a1a627513ffbd4573fa1628_(1/1) from 
> any of the 1 provided restore options.
>     at 
> org.apache.flink.streaming.api.operators.BackendRestorerProcedure.createAndRestore(BackendRestorerProcedure.java:135)
>     at 
> org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.operatorStateBackend(StreamTaskStateInitializerImpl.java:265)
>     at 
> org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.streamOperatorStateContext(StreamTaskStateInitializerImpl.java:152)
>     ... 9 more
> Caused by: org.apache.flink.runtime.state.BackendBuildingException: Failed 
> when trying to restore operator state backend
>     at 
> org.apache.flink.runtime.state.DefaultOperatorStateBackendBuilder.build(DefaultOperatorStateBackendBuilder.java:86)
>     at 
> org.apache.flink.contrib.streaming.state.RocksDBStateBackend.createOperatorStateBackend(RocksDBStateBackend.java:552)
>     at 
> org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.lambda$operatorStateBackend$0(StreamTaskStateInitializerImpl.java:256)
>     at 
> org.apache.flink.streaming.api.operators.BackendRestorerProcedure.attemptCreateAndRestore(BackendRestorerProcedure.java:142)
>     at 
> org.apache.flink.streaming.api.operators.BackendRestorerProcedure.createAndRestore(BackendRestorerProcedure.java:121)
>     ... 11 more
> Caused by: java.lang.IllegalStateException: Missing value for the key 
> 'org.apache.flink.state.api.output.TaggedOperatorSubtaskState'
>     at 
> org.apache.flink.util.LinkedOptionalMap.unwrapOptionals(LinkedOptionalMap.java:190)
>     at 
> org.apache.flink.api.java.typeutils.runtime.kryo.KryoSerializerSnapshot.restoreSerializer(KryoSerializerSnapshot.java:86)
>     at 
> java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193)
>     at 
> java.util.Spliterators$ArraySpliterator.forEachRemaining(Spliterators.java:948)
>     at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:481)
>     at 
> java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:471)
>     at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:545)
>     at 
> java.util.stream.AbstractPipeline.evaluateToArrayNode(AbstractPipeline.java:260)
>     at java.util.stream.ReferencePipeline.toArray(ReferencePipeline.java:438)
>     at 
> org.apache.flink.api.common.typeutils.NestedSerializersSnapshotDelegate.snapshotsToRestoreSerializers(NestedSerializersSnapshotDelegate.java:225)
>     at 
> org.apache.flink.api.common.typeutils.NestedSerializersSnapshotDelegate.getRestoredNestedSerializers(NestedSerializersSnapshotDelegate.java:83)
>     at 
> org.apache.flink.api.common.typeutils.CompositeTypeSerializerSnapshot.restoreSerializer(CompositeTypeSerializerSnapshot.java:204)
>     at 
> org.apache.flink.runtime.state.StateSerializerProvider.previousSchemaSerializer(StateSerializerProvider.java:189)
>     at 
> org.apache.flink.runtime.state.StateSerializerProvider.currentSchemaSerializer(StateSerializerProvider.java:164)
>     at 
> org.apache.flink.runtime.state.RegisteredOperatorStateBackendMetaInfo.getPartitionStateSerializer(RegisteredOperatorStateBackendMetaInfo.java:113)
>     at 
> org.apache.flink.runtime.state.OperatorStateRestoreOperation.restore(OperatorStateRestoreOperation.java:94)
>     at 
> org.apache.flink.runtime.state.DefaultOperatorStateBackendBuilder.build(DefaultOperatorStateBackendBuilder.java:83)
>     ... 15 more
> {code}
> My java code:
> {code:java}
> @Override
>  public void createNewSavepoint(ExecutionEnvironment env, String 
> savepointPath, StateBackend stateBackend,
>  ParameterTool config) {
>      String savepointOutputPath = 
> config.get(EapSavepointConstants.EAP_SAVEPOINT_OUTPUT_PATH);
>      int maxParallelism = 
> config.getInt(EapSavepointConstants.EAP_SAVEPOINT_MAX_PARALLELISM);
>      Long windowTimeSize = 
> config.getLong(EapSavepointConstants.WINDOW_TIME_SIZE);
>      TumblingProcessingTimeWindows processTimeWindows =
>          TumblingProcessingTimeWindows.of(Time.seconds(windowTimeSize));
>      try {
>          ExistingSavepoint existingSavepoint = Savepoint.load(env, 
> savepointPath, stateBackend);
>          DataSet<Tuple2<KafkaTopicPartition, Long>> kafkaListState = 
> existingSavepoint.readUnionState(
>          OperatorUidAndNameConstants.KAFKA_SOURCE_UID, 
> StateNameConstants.KAFKA_OFFSET_STATE_NAME,
>          KafkaStateUtils.createTypeInformation(),
>          KafkaStateUtils.createStateDescriptorSerializer(env.getConfig()));
>          logger.info("Print kafka offset");
>          kafkaListState.print();
>          Savepoint.create(stateBackend, maxParallelism)
>              .withOperator(OperatorUidAndNameConstants.KAFKA_SOURCE_UID, 
> kafkaTransformation)
>              .write(savepointOutputPath);
>      } catch (IOException e) {
>           logger.error("Savepoint load: " + e.getMessage());
>           e.printStackTrace();
>     } catch (Exception e) {
>          logger.error("print state: " + e.getMessage());
>          e.printStackTrace();
>     }
>  }
>  
> // KafkaStateUtils.java
> public class KafkaStateUtils {
>     /**
>      * Creates state serializer for kafka topic partition to offset tuple.
>      * Using of the explicit state serializer with KryoSerializer is needed 
> because otherwise
>      * users cannot use 'disableGenericTypes' properties with KafkaConsumer.
>      * @param executionConfig
>      * @return
>      */
>      public static TupleSerializer<Tuple2<KafkaTopicPartition, Long>> 
> createStateDescriptorSerializer(
>                ExecutionConfig executionConfig) {
>          // explicit serializer will keep the compatibility with 
> GenericTypeInformation
>          // and allow to disableGenericTypes for users
>          TypeSerializer<?>[] fieldSerializers = new TypeSerializer<?>[]{
>                  new KryoSerializer<>(KafkaTopicPartition.class, 
> executionConfig),
>                  LongSerializer.INSTANCE
>          };
>         @SuppressWarnings("unchecked")
>         Class<Tuple2<KafkaTopicPartition, Long>> tupleClass = 
> (Class<Tuple2<KafkaTopicPartition, Long>>) (Class<?>) Tuple2.class;
>         return new TupleSerializer<>(tupleClass, fieldSerializers);
>      }
>      public static TypeInformation<Tuple2<KafkaTopicPartition, Long>> 
> createTypeInformation() {
>          return TypeInformation.of(new TypeHint<Tuple2<KafkaTopicPartition, 
> Long>>() {});
>      }
> }
> {code}
> After remote debugging, it was found that the value of 
> `org.apache.flink.state.api.output.TaggedOperatorSubtaskStated` could not be 
> parsed in `org.apache.flink.util.LinkedOptionalMapSerializer#readOptionalMap`
>  
> Personally think that `TaggedOperatorSubtaskState` should implement 
> `CompositeStateHandle`, please give some suggestions, thank you.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to