> In our setup, we didn't change io.seqfile.compress.blocksize (1MB) and > it's still fairly good. > You are free to try 100MB for better compression ratio, but I would > recommend to keep the default setting to minimize the possibilities of > hitting unknown bugs.
Makes sense. Better compression brought down a count(1) query from 100+ sec down to 40sec. The ETL phase is now taking 510sec as opposed to 700sec earlier. Do you also compress all tables, not just the raw ones? Would you recommend it? Saurabh. -- http://nandz.blogspot.com http://foodieforlife.blogspot.com
