[
https://issues.apache.org/jira/browse/SAMZA-1455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16201372#comment-16201372
]
Jagadish commented on SAMZA-1455:
---------------------------------
The fix is straightforward:
1. Cleanly close the coordinatorStream producer/ consumer in the JobRunner.
2. The finally block that tears these down is an obvious candidate for this.
> Shutdown coordinator stream producers and consumers cleanly in JobRunner
> ------------------------------------------------------------------------
>
> Key: SAMZA-1455
> URL: https://issues.apache.org/jira/browse/SAMZA-1455
> Project: Samza
> Issue Type: Bug
> Reporter: Jagadish
>
> Currently, we do not cleanly close the producer and consumer in the
> JobRunner. This means that, any exception happening in the JobRunner will
> simply exit the main thread but not call tear-down the producers/consumers.
> For producers and consumers that spawn non-daemon threads (for example, a
> KafkaConsumer), this has the effect of not shutting down the JVM cleanly.
> In our production clusters, We have observed that JVM processes
> (corresponding to the JobRunner) do not shut-down. Often, these processes
> hold on to deleted file handles leading to multiple resource leaks.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)