lxy-9602 opened a new pull request, #37:
URL: https://github.com/apache/paimon-cpp/pull/37

   <!-- Please specify the module before the PR name: feat: ... or fix: ... -->
   
   ### Purpose
   
   <!-- Linking this pull request to the issue -->
   No Linked issue.
   
   Introduce hashing, checksum, and compression utilities:
   
   - `BloomFilter` / `BloomFilter64` — probabilistic membership filters with 
32-bit and 64-bit hashing (`bloom_filter.h/cpp`, `bloom_filter64.h/cpp`)
   - `CRC32C` — CRC32-Castagnoli checksum with SSE4.2 hardware acceleration 
fallback (`crc32c.h/cpp`)
   - `MurmurHashUtils` — MurmurHash3 implementation for byte arrays and 
MemorySegments (`murmurhash_utils.h`)
   - `VarLengthIntUtils` — variable-length integer encoding/decoding 
(`var_length_int_utils.h`)
   - `DeltaVarintCompressor` — delta + varint compression for integer sequences 
(`delta_varint_compressor.h/cpp`)
   
   Also adds `xxhash_test.cpp` and `data_define_test.cpp` test coverage and 
complete `testharness.h/.cpp`.
   
   
   <!-- What is the purpose of the change -->
   
   ### Tests
   
   - `bloom_filter_test.cpp`, `bloom_filter64_test.cpp` — filter add/contains, 
false-positive rate
   - `crc32c_test.cpp` — checksum correctness across input sizes
   - `murmurhash_utils_test.cpp` — hash consistency and known-vector checks
   - `var_length_int_utils_test.cpp` — varint round-trip encoding
   - `delta_varint_compressor_test.cpp` — compression/decompression round-trip
   - `xxhash_test.cpp`, `data_define_test.cpp`
   
   <!-- List UT and IT cases to verify this change -->
   
   ### API and Format
   
   <!-- Does this change affect API in include dir or storage format or 
protocol -->
   
   ### Documentation
   
   <!-- Does this change introduce a new feature -->
   
   ### Generative AI tooling
   Migrate-by: Aone Copilot (Claude)
   <!--
   If generative AI tooling has been used in the process of authoring this 
patch, please include the
   phrase: 'Generated-by: ' followed by the name of the tool and its version.
   If no, write 'No'.
   Please refer to the [ASF Generative Tooling 
Guidance](https://www.apache.org/legal/generative-tooling.html) for details.
   -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to