Bench-marking Hadoop Performance

2014-07-22 Thread Charley Newtonne
This is a new cluster I'm putting up and I need to get an idea on what to expect from a performance standpoint. Older docs point to gridmix and TestDFSIO . However, most of this doc is obsolete and no longer applies on 2.4. Where can I find benchmarking docs for 2.4? What are my options? Also, I

Re: Bench-marking Hadoop Performance

2014-07-22 Thread jay vyas
There are alot of tests out there and it can be tough to determine what is a standard. - TeraGen/TearSort and testdfsio are starting points. - Various other non apache projects (such as ycsb or hibench) will have good benchmarks for certain type sof cases. -If looking for a more comprehensive