[
https://issues.apache.org/jira/browse/HDFS-9095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14901293#comment-14901293
]
James Clampffer commented on HDFS-9095:
---------------------------------------
Agree with bob about making the CMakeLists as robust as possible, otherwise +1
on the patch. Getting in the basics for logging is very nice as well.
Re: In RpcConnection methods, should we be calling into the handler while
holding the lock on the engine state? If any method there does synchronous I/O
or hangs for any reason, the whole Rpc system locks up.
This was done to avoid using a std::recursive_mutex because right now that
handler only gets called from OnRecvCompleted. I don't think the handler is
going to be changing much unless we start using multiple connections from a
single RpcEngine. Lock contention is one of the things I hope to start
profiling soon; if the overhead is negligible I'll switch that back to a
recursive_mutex and grab the lock in the handler as well (I'll file a jira if
that's the case).
> RPC client should fail gracefully when the connection is timed out or reset
> ---------------------------------------------------------------------------
>
> Key: HDFS-9095
> URL: https://issues.apache.org/jira/browse/HDFS-9095
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Components: hdfs-client
> Reporter: Haohui Mai
> Assignee: Haohui Mai
> Attachments: HDFS-9095.000.patch
>
>
> The RPC client should fail gracefully when the connection is timed out or
> reset. instead of bailing out.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)