Skye Wanderman-Milne has posted comments on this change. Change subject: IMPALA-3038: Add multistream gzip/bzip2 test coverage ......................................................................
Patch Set 7: (1 comment) http://gerrit.cloudera.org:8080/#/c/2543/7/be/src/util/decompress-test.cc File be/src/util/decompress-test.cc: Line 255: // Repeatedly pick random-size input data(~1MB), compress it, then concatenate > I try to simulate pbzip2, it split large input into smaller chunks then com Ah ok, sorry I keep getting confused about the sizes. Makes sense though. Can you include some of this reasoning in the comment? I think it's useful to know this is an approximation of pbzip2's behavior, that you want at least 8MB of comprssed data to be bigger than the decompressor buffer size (actually unrelated to the IO buffer size I think), and that you expect ~2:1 compression ratio. -- To view, visit http://gerrit.cloudera.org:8080/2543 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: comment Gerrit-Change-Id: I9b0e1971145dd457e71fc9c00ce7c06fff8dea88 Gerrit-PatchSet: 7 Gerrit-Project: Impala Gerrit-Branch: cdh5-trunk Gerrit-Owner: Juan Yu <[email protected]> Gerrit-Reviewer: Juan Yu <[email protected]> Gerrit-Reviewer: Skye Wanderman-Milne <[email protected]> Gerrit-HasComments: Yes
