[ 
https://issues.apache.org/jira/browse/HADOOP-11802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14509652#comment-14509652
 ] 

Colin Patrick McCabe commented on HADOOP-11802:
-----------------------------------------------

bq. extra imports in DataXCeiver, though really you probably meant to add the 
@Private annotation and just forgot.

fixed

bq. Add a newline in the DSW C file change, break the new POLLHUP check to the 
next line (like the other if you changed)

ok

bq. Adding a link to the webpage reference (along with mentioning portability / 
Cygwin) would also be nice, since I wondered why we didn't have to catch yet 
more poll errors.

I added a comment explaining why POLLHUP

bq. Typo "repsponse" in DataXceiver

fixed

bq. We typically have used a singleton to do fault injection, would be good to 
be consistent since it doesn't look like we need per-instance injection. See 
DataNodeFaultInjector, probably the best home.

OK.  That would eliminate the need to make the DataXceiver class public, which 
would be nice.

bq. Good fix on the javadoc for allocSlot, but mind adding the blockId param 
doc too for full coverage?

Hey, I'm trying to make incremental changes here :)  Fixed.

bq. The Throwable catch, it subsumes the IOException catch, so can we just 
delete it? I think the more specific name of the exception will be printed by 
its toString.

ok

bq. Param indentation in TestSCCache#checkNumberOfSeg... is inconsistent, I 
think we typically do double indent?

ok

bq. TestSCCache, the comment "Remove the failure injector" should be moved up a 
few lines

let me just get rid of that since the log messages says the same thing

> DomainSocketWatcher thread terminates sometimes after there is an I/O error 
> during requestShortCircuitShm
> ---------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-11802
>                 URL: https://issues.apache.org/jira/browse/HADOOP-11802
>             Project: Hadoop Common
>          Issue Type: Bug
>    Affects Versions: 2.7.0
>            Reporter: Eric Payne
>            Assignee: Colin Patrick McCabe
>         Attachments: HADOOP-11802.001.patch, HADOOP-11802.002.patch, 
> HADOOP-11802.003.patch
>
>
> In {{DataXceiver#requestShortCircuitShm}}, we attempt to recover from some 
> errors by closing the {{DomainSocket}}.  However, this violates the invariant 
> that the domain socket should never be closed when it is being managed by the 
> {{DomainSocketWatcher}}.  Instead, we should call {{shutdown}} on the 
> {{DomainSocket}}.  When this bug hits, it terminates the 
> {{DomainSocketWatcher}} thread.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to