[
https://issues.apache.org/jira/browse/HDFS-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jing Zhao updated HDFS-6227:
----------------------------
Attachment: HDFS-6227.000.patch
I'm not familiar with the ShortCircuitCache code. I'm thinking if force the
ShortCircuitCache to purge/close the replica when hit an exception while
reading can solve the issue, since DFSInputStream can handle the retry logic.
The uploaded patch demos the idea. In my local test looks like this can let
other readers continue.
[~cmccabe], thanks for taking this. Please let me know if I can help test your
fix.
> Short circuit read failed due to ClosedChannelException
> -------------------------------------------------------
>
> Key: HDFS-6227
> URL: https://issues.apache.org/jira/browse/HDFS-6227
> Project: Hadoop HDFS
> Issue Type: Bug
> Affects Versions: 2.4.0
> Reporter: Jing Zhao
> Assignee: Colin Patrick McCabe
> Attachments: HDFS-6227.000.patch,
> ShortCircuitReadInterruption.test.patch
>
>
> While running tests in a single node cluster, where short circuit read is
> enabled and multiple threads may read the same file concurrently, one of the
> read got ClosedChannelException and failed. Full exception trace see comment.
--
This message was sent by Atlassian JIRA
(v6.2#6252)