[
https://issues.apache.org/jira/browse/ARROW-5221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rok Mihevc updated ARROW-5221:
------------------------------
External issue URL: https://github.com/apache/arrow/issues/21695
> Improvement the performance of class SegmentsUtil
> -------------------------------------------------
>
> Key: ARROW-5221
> URL: https://issues.apache.org/jira/browse/ARROW-5221
> Project: Apache Arrow
> Issue Type: Improvement
> Reporter: Liya Fan
> Assignee: Liya Fan
> Priority: Minor
>
> Improve the performance of class SegmentsUtil from two points:
> # In method allocateReuseBytes, the generated byte array should be cached
> for reuse, if the size does not exceed MAX_BYTES_LENGTH. However, the array
> is not cached if bytes.length < length, and this will lead to performance
> overhead:
>
> if (bytes == null) {
> if (length <= MAX_BYTES_LENGTH) {
> bytes = new byte[MAX_BYTES_LENGTH];
> BYTES_LOCAL.set(bytes);
> } else {
> bytes = new byte[length];
> }
> } else if (bytes.length < length) {
> bytes = new byte[length];
> }
>
> 2. To evaluate the offset, an integer is bitand with a mask to clear to low
> bits, and then shift right. The bitand is useless:
>
> ((index & BIT_BYTE_POSITION_MASK) >>> 3)
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)