[
https://issues.apache.org/jira/browse/HIVE-15893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15866506#comment-15866506
]
Xuefu Zhang edited comment on HIVE-15893 at 2/17/17 6:47 PM:
-------------------------------------------------------------
[~lirui], I didn't mean HIVE-15860 will provide a solution that solves the
problem described here, which is about detecting issues in the driver. I was
saying that with the job monitoring thread monitor jobs submitted to the driver
and the fix in HIVE-15860, maybe the problem is mitigated or avoided. If this
is true, then we might not need the proposal here. This needs further
investigation though.
was (Author: xuefuz):
[~lirui], I didn't mean HIVE-15860 will provide a solution that solves the
problem described here, which is about detecting issues in the driver. I was
saying that with the job monitoring thread monitor jobs submitted to the driver
and the fix here, maybe the problem is mitigated or avoided. If this is true,
then we might not need the proposal here. This needs further investigation
though.
> Followup on HIVE-15671
> ----------------------
>
> Key: HIVE-15893
> URL: https://issues.apache.org/jira/browse/HIVE-15893
> Project: Hive
> Issue Type: Improvement
> Components: Spark
> Affects Versions: 2.2.0
> Reporter: Xuefu Zhang
> Assignee: Xuefu Zhang
>
> In HIVE-15671, we fixed a type where server.connect.timeout is used in the
> place of client.connect.timeout. This might solve some potential problems,
> but the original problem reported in HIVE-15671 might still exist. (Not sure
> if HIVE-15860 helps). Here is the proposal suggested by Marcelo:
> {quote}
> bq: server detecting a driver problem after it has connected back to the
> server.
> Hmm. That is definitely not any of the "connect" timeouts, which probably
> means it isn't configured and is just using netty's default (which is
> probably no timeout?). Would probably need something using
> io.netty.handler.timeout.IdleStateHandler, and also some periodic "ping" so
> that the connection isn't torn down without reason.
> {quote}
> We will use this JIRA to track the issue.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)