[ 
https://issues.apache.org/jira/browse/FLINK-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16410753#comment-16410753
 ] 

ASF GitHub Bot commented on FLINK-9060:
---------------------------------------

GitHub user sihuazhou opened a pull request:

    https://github.com/apache/flink/pull/5751

    [FLINK-9060][state] Deleting state using KeyedStateBackend.getKeys() throws 
Exception

    ## What is the purpose of the change
    
    This PR fixes the problem when deleting state using 
`KeyedStateBackend.getKeys()` throws Exception.
    
    ## Brief change log
    
      - copy the result of `getKeys()` into `list` to avoid concurrency problem.
    
    ## Verifying this change
    
      - *add a unit test in 
`StateBackendTest#testConcurrentModificationWithGetKeys()` to verify this*
    
    ## Does this pull request potentially affect one of the following parts:
    
      - Dependencies (does it add or upgrade a dependency): (no)
      - The public API, i.e., is any changed class annotated with 
`@Public(Evolving)`: (no)
      - The serializers: (no)
      - The runtime per-record code paths (performance sensitive): (no)
      - Anything that affects deployment or recovery: JobManager (and its 
components), Checkpointing, Yarn/Mesos, ZooKeeper: (no)
      - The S3 file system connector: (no)
    
    ## Documentation
     no


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/sihuazhou/flink 
deletingStateUsingKeyedStateBackendGetKeys

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/5751.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #5751
    
----
commit cba3a32f3af16ee92676b1e5b82b21af6fee610d
Author: sihuazhou <summerleafs@...>
Date:   2018-03-23T03:20:42Z

    fix concurrency risk in HeapKeyedStateBackend#getKeys().

----


> Deleting state using KeyedStateBackend.getKeys() throws Exception
> -----------------------------------------------------------------
>
>                 Key: FLINK-9060
>                 URL: https://issues.apache.org/jira/browse/FLINK-9060
>             Project: Flink
>          Issue Type: Bug
>          Components: State Backends, Checkpointing
>            Reporter: Aljoscha Krettek
>            Assignee: Sihua Zhou
>            Priority: Blocker
>             Fix For: 1.5.0
>
>
> Adding this test to {{StateBackendTestBase}} showcases the problem:
> {code}
> @Test
> public void testConcurrentModificationWithGetKeys() throws Exception {
>       AbstractKeyedStateBackend<Integer> backend = 
> createKeyedBackend(IntSerializer.INSTANCE);
>       try {
>               ListStateDescriptor<String> listStateDescriptor =
>                       new ListStateDescriptor<>("foo", 
> StringSerializer.INSTANCE);
>               backend.setCurrentKey(1);
>               backend
>                       .getPartitionedState(VoidNamespace.INSTANCE, 
> VoidNamespaceSerializer.INSTANCE, listStateDescriptor)
>                       .add("Hello");
>               backend.setCurrentKey(2);
>               backend
>                       .getPartitionedState(VoidNamespace.INSTANCE, 
> VoidNamespaceSerializer.INSTANCE, listStateDescriptor)
>                       .add("Ciao");
>               Stream<Integer> keys = backend
>                       .getKeys(listStateDescriptor.getName(), 
> VoidNamespace.INSTANCE);
>               keys.forEach((key) -> {
>                       backend.setCurrentKey(key);
>                       try {
>                               backend
>                                       .getPartitionedState(
>                                               VoidNamespace.INSTANCE,
>                                               
> VoidNamespaceSerializer.INSTANCE,
>                                               listStateDescriptor)
>                                       .clear();
>                       } catch (Exception e) {
>                               e.printStackTrace();
>                       }
>               });
>       }
>       finally {
>               IOUtils.closeQuietly(backend);
>               backend.dispose();
>       }
> }
> {code}
> This should work because one of the use cases of {{getKeys()}} and 
> {{applyToAllKeys()}} is to do stuff for every key, which includes deleting 
> them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to