Dan Hecht has posted comments on this change. Change subject: IMPALA-3453: S3: Uneven split sizes are generated for Parquet causing execution skew ......................................................................
Patch Set 1: Code-Review+2 (1 comment) Why didn't the multiple row group test find this issue? Is that test being skipped on S3? I think we should figure out a way to enable that test (or a variant) to test this. Did you manually test it, or will Mostafa? http://gerrit.cloudera.org:8080/#/c/2968/1//COMMIT_MSG Commit Message: Line 23: is governed by "fs.s3a.block.size". Its default value is 32MB. we should probably explain this in the Impala+S3 documentation. can you add a doc note to the JIRA? -- To view, visit http://gerrit.cloudera.org:8080/2968 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: comment Gerrit-Change-Id: Ib1518ad0c89ef35a3b0567c3902e85a41e34bc3d Gerrit-PatchSet: 1 Gerrit-Project: Impala Gerrit-Branch: cdh5-trunk Gerrit-Owner: Sailesh Mukil <[email protected]> Gerrit-Reviewer: Dan Hecht <[email protected]> Gerrit-Reviewer: Sailesh Mukil <[email protected]> Gerrit-HasComments: Yes
