interesting.

FWIW the work on formally specify (in the Computer Science notion of
"formally") is in HADOOP-9361; the HCFS work being driven by redhat is more
about testing.

Some extra ideas on benchmarking
# something to assess performance of cross FS operations
# it'd be nice to have something that would let you experiment with
different hardware options in that NN
# the gridmix3 MapReduce benchmarks can collect job use stats to generate
synthetic workloads. Maybe we could derive something similar from NN
metrics, so that we could build up a better pool of operations on different
workloads (e.g. HBase, Hive + Tez) and apply them.
# there's work needed on scalability tests across filesystems; for the
'9361 tests I'm making them per-FS programmable for options like max #of
files in a directory test, max filesize etc -any additions there would be
welcome




On 4 September 2013 22:27, Erik Paulson <epaul...@unit1127.com> wrote:

> Hello all -
>
> As part of a side project, I've been interested in HDFS benchmarking,
> particularly of the Namenode. To get started, I tried to track down a
> number of different benchmarks and collect a few observations about each.
> I've put together a list here:
>
> http://epaulson.github.io/HadoopInternals/benchmarks.html
>
> The benchmarks I included were:
> DFSIO
> DFSIO-e
> NNBench and NNBenchWithoutMR
> S-Live
> LoadGenerator
> NNThroughputBenchmark
> TestEditLog
> MStress, from Quantcast
> Ohio State Microbenchmarks
> SWIM
>
> (I also wrote a bit about what else I'd like to see in a NN benchmark)
>
> I'd appreciate any corrections, feedback, and pointers to code that I
> missed!
>
> Thanks!
>
> -Erik
>

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

Reply via email to