mapleFU commented on issue #14923: URL: https://github.com/apache/arrow/issues/14923#issuecomment-1367033261
> @mapleFU I am worried about this part of [the spec](https://github.com/apache/parquet-format/blob/master/Encodings.md#delta-encoding-delta_binary_packed--5): > I'm not sure how other encoder authors would interpret this, but I expected that uninitialized `bit_width_data[i]` would produce a random `int` that wouldn't cause problems roundtripping. My interpretation is probably wrong. Maybe we should propose to make the spec clearer or stricter? Hi, I think `the arbitrary value` means the bit_width could be 1, 2... But once the bit width is not zero, the miniblock should have corresbond bits. And I met the exceptions because: * The `bit_width` is greater than 32 I didn't meet the case that it reaches EOF, because the implemention already handle this cases. @rok -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
