Re: [jira] Updated: (HADOOP-1180) NNbench test should be able to test the checksumfilesystem as well as the raw filesystem

Nigel Daley Thu, 29 Mar 2007 11:27:32 -0800


On Mar 29, 2007, at 12:07 PM, Doug Cutting wrote:

Nigel Daley wrote:
So shouldn't fixing this test to conform to the new model inHADOOP-1134 be the concern of the patch for HADOOP-1134?
Yes, but, as it stands, this patch would silently stop workingcorrectly once HADOOP-1134 is committed. It should instead bewritten in a more robust way, that can survive expected changes.Relying on HDFS using ChecksumFileSystem isn't as reliable as anexplicit constructor that says "I want an unchecksummed FileSystem."

Ya, that's fine. I have no problem's changing the way the patch isimplemented.

As it stand, I can't run NNBench at scale without using a raw filesystem, which is what this patch is intended to allow.
It seems strange to disable things in an undocumented andunsupported way in order to get a benchmark to complete. How doesthat prove scalability? Rather, leaving NNBench alone seems like astrong argument for implementing HADOOP-1134 sooner.

As you realized below, the test was using raw methods beforeHADOOP-928. I don't understand your reference to "undocumented" and"unsupported", but I'm not sure it matters.

Still, if you want to be able to disable checksums, for benchmarksor whatever, we can permit that, but should do so explicitly.
HADOOP-928 caused this test to use a ChecksumFileSystem andsubsequently we saw our "read" TPS metric plummet from 20,000 to acouple hundred.
Ah, NNBench used the 'raw' methods before, which was kind of sneakyon its part, since it didn't benchmark the typical user experience.Although the namenode performance should only halve at worst withchecksums as currently implemented, no?

One of the design goals of the test is to remove the effects ofDataNodes as much as possible since this is a NameNode benchmark.That's why we used the raw methods (therefore no crc's). We run itwith 1 byte files with 1 byte blocks with a replication factor of 1,all designed to maximize the load on the NameNode and minimize theeffects of the DataNodes.

Let's get our current benchmark back on track before we commitHADOOP-1134 (which will likely take a while before it is "PatchAvailable").
I'd argue that we should fix the benchmark to accurately reflectwhat users see, so that we see real improvement when HADOOP-1134 iscommitted. That would make it a more useful and realisticbenchmark. However if you believe that a checksum-free benchmark isstill useful, I think it should be more future-proof.

I think this is the crux of the misunderstanding. This is a NameNodebenchmark, not a DataNode benchmark nor a system benchmark. Itattempts to measure the TPS that are possible in the extreme.

I think you want a different kind of benchmark, which is fair. It'sjust not this benchmark.


Cheers,
Nige

Re: [jira] Updated: (HADOOP-1180) NNbench test should be able to test the checksumfilesystem as well as the raw filesystem

Reply via email to