[ 
https://issues.apache.org/jira/browse/HDFS-6581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14138383#comment-14138383
 ] 

Colin Patrick McCabe commented on HDFS-6581:
--------------------------------------------

bq. I don't recall any performance numbers were published before merging to 
trunk. Looking through the Jira even now I don't see any numbers. Maybe it's 
there - feel free to give a pointer.

There are a bunch of performance numbers on HDFS-4953 (a subtask of HDFS-4949). 
 Enabling zero-copy reads was an important goal of HDFS-4949.  On HDFS-4953, 
Todd and I discussed how to get close to the optimal 9GB/s per core.  More 
generally, we knew the HDFS read path could push a few gigs a second even 
without zero-copy because we'd done the testing beforehand (some of the results 
are on HDFS-347, some of that discussion is scattered other places).  So we 
were confident that HDFS-4949 was a performance improvement at the time of the 
merge.

bq. The current direction is based on broad discussion with the community here 
and on HDFS-5851, and the overall consensus was to keep DataTransferProtocol in 
the picture for now.

I'm fine with keeping {{DataTransferProtocol}} in the picture for now.  But you 
should still be able to show a performance improvement when reading (not 
writing) the data which you have previously written to the single replica in 
memory.  If there are flaws in this (for example, if short-circuit doesn't yet 
work for these blocks, causing slowness) then we should address this before 
merging.

We should also quantify exactly what the write performance is, even if it's not 
as good as it could be.  This will tell us what we need to work on in the 
future and what the priorities should be.

> Write to single replica in memory
> ---------------------------------
>
>                 Key: HDFS-6581
>                 URL: https://issues.apache.org/jira/browse/HDFS-6581
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: datanode
>            Reporter: Arpit Agarwal
>            Assignee: Arpit Agarwal
>         Attachments: HDFS-6581.merge.01.patch, HDFS-6581.merge.02.patch, 
> HDFS-6581.merge.03.patch, HDFSWriteableReplicasInMemory.pdf, 
> Test-Plan-for-HDFS-6581-Memory-Storage.pdf
>
>
> Per discussion with the community on HDFS-5851, we will implement writing to 
> a single replica in DN memory via DataTransferProtocol.
> This avoids some of the issues with short-circuit writes, which we can 
> revisit at a later time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to