[ 
https://issues.apache.org/jira/browse/HDFS-755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Todd Lipcon updated HDFS-755:
-----------------------------

    Attachment: hdfs-755.txt

Here's an updated patch which fixes some behavior when running against an 
unpatched Common. If Common includes HADOOP-3205, it will be faster, and if it 
doesn't include HADOOP-3205, it should still work at the old speed.

I also ran some more benchmarks over lunch, running "fs -cat bigfile bigfile 
bigfile ...20 times..." repeatedly with and without the patch. This differs 
from my previous benchmark in that each JVM runs for a good 40-50 seconds - 
enough time to fully JIT the code, etc. The patch is about a 3.4% speedup 
compared to trunk for these long reads as well (at 95% significance level).

> Read multiple checksum chunks at once in DFSInputStream
> -------------------------------------------------------
>
>                 Key: HDFS-755
>                 URL: https://issues.apache.org/jira/browse/HDFS-755
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: hdfs client
>    Affects Versions: 0.22.0
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>         Attachments: hdfs-755.txt, hdfs-755.txt
>
>
> HADOOP-3205 adds the ability for FSInputChecker subclasses to read multiple 
> checksum chunks in a single call to readChunk. This is the HDFS-side use of 
> that new feature.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to