[ 
https://issues.apache.org/jira/browse/HDFS-6581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14138165#comment-14138165
 ] 

Colin Patrick McCabe commented on HDFS-6581:
--------------------------------------------

It's exciting to see this making progress.

One thing that I would really like to see is an eviction policy other than 
least recently used (LRU).  LRU has been well-documented to be a very poor 
policy for scan workloads, which most of HDFS' workloads are.  It would be nice 
to see something like 2Q (See 
http://www.tedunangst.com/flak/post/2Q-buffer-cache-algorithm), or at least a 
pluggable interface that would allow this to be implemented.  I think that very 
few users, if any, will want LRU.

Do we have performance benchmarks for this?  It would be nice to see that would 
could get a substantial fraction of memory bandwidth when writing to a single 
replica in-memory.  That usually means somewhere between 5 gigs/s and 10 
gigs/s.  Perhaps it's too early to ask for these, since we haven't implemented 
HDFS-6933.  But it would be good to start thinking about this.  Also, some of 
the recent work to use native checksums has substantially reduced the checksum 
overhead in the write path so maybe we can get an estimate without HDFS-6933.

> Write to single replica in memory
> ---------------------------------
>
>                 Key: HDFS-6581
>                 URL: https://issues.apache.org/jira/browse/HDFS-6581
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: datanode
>            Reporter: Arpit Agarwal
>            Assignee: Arpit Agarwal
>         Attachments: HDFS-6581.merge.01.patch, HDFS-6581.merge.02.patch, 
> HDFS-6581.merge.03.patch, HDFSWriteableReplicasInMemory.pdf, 
> Test-Plan-for-HDFS-6581-Memory-Storage.pdf
>
>
> Per discussion with the community on HDFS-5851, we will implement writing to 
> a single replica in DN memory via DataTransferProtocol.
> This avoids some of the issues with short-circuit writes, which we can 
> revisit at a later time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to