Marcel Kornacker has posted comments on this change. Change subject: IMPALA-5378: Disk IO manager needs to understand ADLS ......................................................................
Patch Set 1: (1 comment) http://gerrit.cloudera.org:8080/#/c/7033/1/be/src/runtime/disk-io-mgr-scan-range.cc File be/src/runtime/disk-io-mgr-scan-range.cc: Line 402: // ADLS uses buffer sizes of 4k. Given that, and the above JNI array allocation overhead > Yes, it would cut a buffer at 4MB or a flush, whatever comes first. We'd wa the data pages themselves are small, but the contiguous section occupied by a single column's data pages within a row group is in many cases >4mb (depends on the file size = block size and the number of columns). -- To view, visit http://gerrit.cloudera.org:8080/7033 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: comment Gerrit-Change-Id: I067f053fec941e3631610c5cc89a384f257ba906 Gerrit-PatchSet: 1 Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-Owner: Sailesh Mukil <[email protected]> Gerrit-Reviewer: Marcel Kornacker <[email protected]> Gerrit-Reviewer: Sailesh Mukil <[email protected]> Gerrit-HasComments: Yes
