Hairong Kuang wrote:
What Doug suggested makes sense. We should make the initial buffer size to be bytesPerChecksum and the user defined buffer size to be the size of the second buffer. This will also solve most of the problems that I described in HADOOP-1124.
But this will not fix the same problem with block-level checksums. Pretty soon, HDFS will not use ChecksumFileSystem at all. Ideally we should let the implementations decide how to buffer.
Raghu.
Hairong