[
https://issues.apache.org/jira/browse/ARROW-18309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matthew Topol updated ARROW-18309:
----------------------------------
Component/s: Parquet
> [Go] delta_bit_packing Decode may panic
> ---------------------------------------
>
> Key: ARROW-18309
> URL: https://issues.apache.org/jira/browse/ARROW-18309
> Project: Apache Arrow
> Issue Type: Bug
> Components: Go, Parquet
> Affects Versions: 9.0.0
> Environment: all release version
> Reporter: jun wang
> Assignee: Matthew Topol
> Priority: Major
> Labels: pull-request-available
> Fix For: 11.0.0
>
> Attachments: @timestamp.data
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> [https://github.com/apache/arrow/blob/master/go/parquet/internal/encoding/delta_bit_packing.go]
> The DeltaBitPackInt32 and DeltaBitPackInt64 Decode method did not use
> d.nvals subtract decoded number at end, which lead streaming decode panic.
> Also, when copy the decoded value to out, the end value should be
> shared_utils.MinInt(int(d.valsPerMini), start + len(out))
> When encode 68610 timestamp data, and decode 1024 value a batch, we encounter
> the panic
--
This message was sent by Atlassian Jira
(v8.20.10#820010)