[
https://issues.apache.org/jira/browse/STORM-3583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ethan Li updated STORM-3583:
----------------------------
Description:
Supervisor relies on AsyncLocalizer to download blobs from blob store.
AsyncLocalizer uses downloadService pool to process CompletableFuture objects
in parallel.
We have noticed a case that while the downloading task is waiting for a thread
to execute, new assignment changes will try to release the slot by dereferences
all of the related local resources.
However, reading local resources assumes two base blob downloading task have
been completed which is not always true.
was:
The supervisor should not restart if blobstore does not have a file it is
trying to localize. Supervisor should simply assume that the blob will be
available soon but no need for daemon to keel over.
> Localizer should not cause supervisor restart on FileNotFoundException
> ----------------------------------------------------------------------
>
> Key: STORM-3583
> URL: https://issues.apache.org/jira/browse/STORM-3583
> Project: Apache Storm
> Issue Type: Bug
> Components: storm-server
> Reporter: Kishor Patil
> Assignee: Rui Li
> Priority: Major
> Labels: pull-request-available
> Fix For: 2.2.0
>
> Time Spent: 4h 50m
> Remaining Estimate: 0h
>
> Supervisor relies on AsyncLocalizer to download blobs from blob store.
> AsyncLocalizer uses downloadService pool to process CompletableFuture objects
> in parallel.
> We have noticed a case that while the downloading task is waiting for a
> thread to execute, new assignment changes will try to release the slot by
> dereferences all of the related local resources.
> However, reading local resources assumes two base blob downloading task have
> been completed which is not always true.
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)