Arnab Karmakar has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/23946


Change subject: IMPALA-14623: Optimize memory usage for Iceberg file path hashes
......................................................................

IMPALA-14623: Optimize memory usage for Iceberg file path hashes

Use THash128 Thrift struct (16 bytes) instead of String (64 bytes)
for storing 128-bit Murmur3 hashes of Iceberg file paths, achieving
4x memory reduction.

Key Changes:
- Added THash128 Thrift struct with two i64 fields (high/low)
- Updated TIcebergContentFileStore to use THash128 as map keys
- Created Hash128 Java class with Thrift serialization support
- Added C++ comparison operators and hash functions for THash128

Testing:
- Added comprehensive JUnit tests for Hash128 class

Change-Id: Ie0de793de2434dae3b60c3aa4f87dba203eee3c1
---
M be/src/catalog/catalog-server.cc
M be/src/catalog/catalog.cc
M be/src/runtime/descriptors.cc
M be/src/service/frontend.cc
M be/src/util/container-util.h
A be/src/util/thash128-util.h
M common/thrift/CatalogObjects.thrift
M fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/IcebergContentFileStore.java
M fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java
A fe/src/main/java/org/apache/impala/util/Hash128.java
M fe/src/main/java/org/apache/impala/util/IcebergUtil.java
M fe/src/test/java/org/apache/impala/catalog/local/LocalCatalogTest.java
A fe/src/test/java/org/apache/impala/util/Hash128Test.java
M fe/src/test/java/org/apache/impala/util/IcebergUtilTest.java
15 files changed, 357 insertions(+), 29 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/46/23946/1
--
To view, visit http://gerrit.cloudera.org:8080/23946
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ie0de793de2434dae3b60c3aa4f87dba203eee3c1
Gerrit-Change-Number: 23946
Gerrit-PatchSet: 1
Gerrit-Owner: Arnab Karmakar <[email protected]>

Reply via email to