Catlyn Kong created BEAM-8112:
---------------------------------

             Summary: Support passing stateBackend through pipeline options in 
python sdks
                 Key: BEAM-8112
                 URL: https://issues.apache.org/jira/browse/BEAM-8112
             Project: Beam
          Issue Type: New Feature
          Components: runner-flink
            Reporter: Catlyn Kong


Currently the only way for python sdks to instruct flink to use a StateBackend 
different than the default (MemoryStateBackend) would be to specify 
state.backend in flink-conf.yaml, which creates the limitation of using the 
same statebackend for every job running on the same flink cluster. Ideally we 
should be able to pass it in to flink runner through PipelineOptions. Here's 
the error it spits out when I flag  --state_backend=RocksDBStateBackend:

 
{code:java}
RuntimeError: Pipeline failed in state FAILED: 
com.fasterxml.jackson.databind.exc.InvalidDefinitionException: Cannot construct 
instance of `org.apache.flink.runtime.state.StateBackend` (no Creators, like 
default construct, exist): abstract types either need to be mapped to concrete 
types, have custom deserializer, or contain additional type information
 at [Source: (String)""RocksDBStateBackend""; line: 1, column: 1]
{code}
Acceptance Criteria:

Flink StateBackend is configurable via command line options from python.

 



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

Reply via email to