asfimport opened a new issue, #426: URL: https://github.com/apache/parquet-format/issues/426
I brought this issue up on some time ago on the mailing list [1]; in short I would like to add some clarification to the DELTA_BINARY_PACKED section of Encodings.md. The issue is that while the specification does not limit the number of bits that can be used to encode deltas, some readers expect a maximum of 32 bits for INT32 data, and 64 bits for INT64 data [2]. I propose adding verbiage to the specification to the effect that while using 33 bits to encode INT32 data (or 65 for INT64), it is not recommended, and that readers _should_ be able to read such data, but are not required to. [1] <https://lists.apache.org/thread/2wj88oghc0t6qqj8ojp5p5tf8wg11840> [2] https://github.com/apache/arrow/issues/20374 **Reporter**: [Edward Seidl](https://issues.apache.org/jira/secure/ViewProfile.jspa?name=etseidl) / @etseidl #### PRs and other links: - [GitHub Pull Request #231](https://github.com/apache/parquet-format/pull/231) <sub>**Note**: *This issue was originally created as [PARQUET-2435](https://issues.apache.org/jira/browse/PARQUET-2435). Please see the [migration documentation](https://issues.apache.org/jira/browse/PARQUET-2502) for further details.*</sub> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
