yugan95 commented on PR #7621: URL: https://github.com/apache/paimon/pull/7621#issuecomment-4645273715
@JingsongLi Good suggestion — I've split this into three independent PRs, one per root cause: 1. **HeapBytesVector `reserveBytes()` integer overflow** — promote to `long` arithmetic, extract `calculateNewBytesCapacity(long)` helper, cap at `MAX_ARRAY_SIZE`. #8158 2. **RowHelper / InternalRowSerializer buffer release** — add `resetIfTooLarge()` with hysteresis in `finally` block to handle EOFException exit path. #8159 3. **BinaryRowSerializer reuse buffer shrink** — add hysteresis shrink in `deserialize(reuse)` to reclaim oversized buffers when workload transitions to small records. #8160 The Parquet config pass-through is already merged in #7956. Each PR is self-contained with its own tests and no cross-dependencies. Will link them here once submitted. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
