[jira] [Commented] (HDFS-12943) Consistent Reads from Standby Node

Chen Liang (JIRA) Wed, 19 Dec 2018 14:03:24 -0800


    [ 
https://issues.apache.org/jira/browse/HDFS-12943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725379#comment-16725379
 ]


Chen Liang commented on HDFS-12943:
-----------------------------------

Hi [~brahmareddy],

Some more notes to add:
1. getHAServiceState() only gets called when initialization of client proxies 
(and of course when existing proxies failed and client reinitialize them). In 
regular operation, this call will not happen so it should not be a concern in 
benchmarks.
2. I tried the unit test you shared locally with Observer read 
enabled/disabled. I did not see difference in terms of mkdir time, it has been 
about 2ms the whole time regardless. I saw some degradation on get content 
summary though. But this is due to that the unit test is doing mkdir -> 
getContentSummary -> getFileStatus -> repeat. So the client is constantly 
switching between write and read, and thus constantly switching between 
proxies(NNs). This is not the IO pattern Observer is mainly targeting for, and 
probably the worst case for Observer read because every single 
getContentSummary call here could potentially trigger Observer catch up wait.

> Consistent Reads from Standby Node
> ----------------------------------
>
>                 Key: HDFS-12943
>                 URL: https://issues.apache.org/jira/browse/HDFS-12943
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>          Components: hdfs
>            Reporter: Konstantin Shvachko
>            Priority: Major
>         Attachments: ConsistentReadsFromStandbyNode.pdf, 
> ConsistentReadsFromStandbyNode.pdf, HDFS-12943-001.patch, 
> HDFS-12943-002.patch, TestPlan-ConsistentReadsFromStandbyNode.pdf
>
>
> StandbyNode in HDFS is a replica of the active NameNode. The states of the 
> NameNodes are coordinated via the journal. It is natural to consider 
> StandbyNode as a read-only replica. As with any replicated distributed system 
> the problem of stale reads should be resolved. Our main goal is to provide 
> reads from standby in a consistent way in order to enable a wide range of 
> existing applications running on top of HDFS.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HDFS-12943) Consistent Reads from Standby Node

Reply via email to