GitHub user tdas opened a pull request:
https://github.com/apache/spark/pull/6559
[SPARK-7958] Handled exception in StreamingContext.start() to prevent
leaking of actors
StreamingContext.start() can throw exception because
DStream.validateAtStart() fails (say, checkpoint directory not set for
StateDStream). But by then JobScheduler, JobGenerator, and ReceiverTracker has
already started, along with their actors. But those cannot be shutdown because
the only way to do that is call StreamingContext.stop() which cannot be called
as the context has not been marked as ACTIVE.
The solution in this PR is to stop the internal scheduler if start throw
exception, and mark the context as STOPPED.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/tdas/spark SPARK-7958
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/6559.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #6559
----
commit 790b617ff34848183054c5eb83d016217a25ada8
Author: Tathagata Das <[email protected]>
Date: 2015-06-01T20:31:20Z
Handled exception in StreamingContext.start()
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]