Csaba Ringhofer has uploaded this change for review. ( http://gerrit.cloudera.org:8080/23958
Change subject: IMPALA-14734: Optimize sorting file descriptors during planning ...................................................................... IMPALA-14734: Optimize sorting file descriptors during planning IcebergScanNode sorts the file descriptors (by path) if Iceberg's planFiles() is not called (IMPALA-12765). This can dominate planning time if there are many files. This change makes this faster by avoiding extracting Java Strings from flatbuffer, which involves utf8 decoding. Also changes a few similar functions to avoid duplicate decoding. For a table with ~1 million files: explain select * from bigice limit 1; before: ~12s after: ~6.5s Change-Id: Icb914eb4de7bdadeb876f7dd101e8737b9527b6f --- M fe/src/main/java/org/apache/impala/catalog/FileDescriptor.java M fe/src/main/java/org/apache/impala/planner/IcebergScanNode.java 2 files changed, 23 insertions(+), 8 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/58/23958/1 -- To view, visit http://gerrit.cloudera.org:8080/23958 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: Icb914eb4de7bdadeb876f7dd101e8737b9527b6f Gerrit-Change-Number: 23958 Gerrit-PatchSet: 1 Gerrit-Owner: Csaba Ringhofer <[email protected]>
