[ 
https://issues.apache.org/jira/browse/HDFS-12943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16297979#comment-16297979
 ] 

Chao Sun commented on HDFS-12943:
---------------------------------

[~chris.douglas] I did some experiment with the POC patch, on 2.8.3. It uses 
5000 containers to issue read/write requests that mimic production workloads 
(~95% reads, ~5% write).
With stale reads enabled, I observed around 60-80K throughput on the SBN, and 
around 20K on the ANN for write throughput. Without stale reads, the total 
throughput on the ANN was around 35-40K.
Also, with stale reads, the write throughput on ANN was 2-2.5X higher, while 
the GC time dropped from around 6s/min to 2s/min.

Hope this helps, and let me know if you need more data.


> Consistent Reads from Standby Node
> ----------------------------------
>
>                 Key: HDFS-12943
>                 URL: https://issues.apache.org/jira/browse/HDFS-12943
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>          Components: hdfs
>            Reporter: Konstantin Shvachko
>         Attachments: ConsistentReadsFromStandbyNode.pdf
>
>
> StandbyNode in HDFS is a replica of the active NameNode. The states of the 
> NameNodes are coordinated via the journal. It is natural to consider 
> StandbyNode as a read-only replica. As with any replicated distributed system 
> the problem of stale reads should be resolved. Our main goal is to provide 
> reads from standby in a consistent way in order to enable a wide range of 
> existing applications running on top of HDFS.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to