[
https://issues.apache.org/jira/browse/HADOOP-11574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14335407#comment-14335407
]
Ray Chiang commented on HADOOP-11574:
-------------------------------------
I like the user-centric definitions above. Then for each type of error, such
as:
- DNS/UnknownHostException
- RPC/RemoteException
- SecurityException
We can see where it's deficient in the context of each user.
As with most of our log messages, I might worry a bit about finding the right
balance of giving notification and filling the logs too much.
> Uber-JIRA: improve Hadoop network resilience & diagnostics
> ----------------------------------------------------------
>
> Key: HADOOP-11574
> URL: https://issues.apache.org/jira/browse/HADOOP-11574
> Project: Hadoop Common
> Issue Type: Task
> Components: net
> Affects Versions: 2.6.0
> Reporter: Steve Loughran
> Labels: supportability
>
> Improve Hadoop's resilience to bad network conditions/problems, including
> * improving recognition of problem states
> * improving diagnostics
> * better handling of IPv6 addresses, even if the protocol is unsupported.
> * better behaviour client-side when there are connectivity problems. (i.e
> while some errors you can spin on, DNS failures are not on the list)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)