lxy-9602 opened a new pull request, #50: URL: https://github.com/apache/paimon-cpp/pull/50
<!-- Please specify the module before the PR name: feat: ... or fix: ... --> ### Purpose <!-- Linking this pull request to the issue --> No Linked issue. Migrate bloom filter and bit-sliced index (BSI) file index implementations: **Bloom filter file index (`src/paimon/common/file_index/bloomfilter/`):** - `BloomFilterFileIndex` — bloom filter based file index for membership queries (`bloom_filter_file_index.h/cpp`) - `FastHash` — fast hash function used by bloom filter for hash computation (`fast_hash.h/cpp`) - `BloomFilterFileIndexFactory` — factory for registering bloom filter index implementation (`bloom_filter_file_index_factory.h/cpp`) **Bit-sliced index (`src/paimon/common/file_index/bsi/`):** - `BitSliceIndexBitmapFileIndex` — BSI-based file index for range and equality queries on numeric columns (`bit_slice_index_bitmap_file_index.h/cpp`) - `BitSliceIndexRoaringBitmap` — roaring bitmap backed bit-slice index supporting arithmetic comparisons (`bit_slice_index_roaring_bitmap.h/cpp`) - `BitSliceIndexBitmapFileIndexFactory` — factory for registering BSI index implementation (`bit_slice_index_bitmap_file_index_factory.h/cpp`) <!-- What is the purpose of the change --> ### Tests - `bloom_filter_file_index_test.cpp` — bloom filter write/read round-trip, false positive rate - `fast_hash_test.cpp` — hash function correctness and distribution - `bit_slice_index_bitmap_file_index_test.cpp` — BSI index write/read, range/equality queries - `bit_slice_index_roaring_bitmap_test.cpp` — roaring bitmap BSI arithmetic operations <!-- List UT and IT cases to verify this change --> ### API and Format <!-- Does this change affect API in include dir or storage format or protocol --> ### Documentation <!-- Does this change introduce a new feature --> ### Generative AI tooling Migrate-by: Aone Copilot (Claude) <!-- If generative AI tooling has been used in the process of authoring this patch, please include the phrase: 'Generated-by: ' followed by the name of the tool and its version. If no, write 'No'. Please refer to the [ASF Generative Tooling Guidance](https://www.apache.org/legal/generative-tooling.html) for details. --> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
