jorisvandenbossche commented on PR #240:
URL: https://github.com/apache/parquet-format/pull/240#issuecomment-2122876371
Just to ensure my understanding is correct:
- This is proposing to add a new _logical type_ annotating the BYTE_ARRAY
physical type. For readers that expect just such a BYTE_ARRAY column (e.g.
existing GeoParquet implementations), that is compatible if the column would
start having a logical type as well? (although I assume this might depend on
how the specific parquet reader implementation deals with an unknown logical
type, i.e. error about that or automatically fall back to the physical type).
- For such "legacy" readers (just reading the WKB values from a binary
column), the only thing that _actually_ changes (apart from the logical type
annotation) are the values of the statistics? Now, I assume that right now no
GeoParquet reader is using the statistics of the binary column, because the
physical statistics for BYTE_ARRAY ("unsigned byte-wise comparison") are
essentially useless in the case those binary blobs represent WKB geometries. So
again that should probably not give any compatibility issues?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]