jorisvandenbossche commented on PR #240:
URL: https://github.com/apache/parquet-format/pull/240#issuecomment-2122876371

   Just to ensure my understanding is correct:
   
   - This is proposing to add a new _logical type_ annotating the BYTE_ARRAY 
physical type. For readers that expect just such a BYTE_ARRAY column (e.g. 
existing GeoParquet implementations), that is compatible if the column would 
start having a logical type as well? (although I assume this might depend on 
how the specific parquet reader implementation deals with an unknown logical 
type, i.e. error about that or automatically fall back to the physical type).
   - For such "legacy" readers (just reading the WKB values from a binary 
column), the only thing that _actually_ changes (apart from the logical type 
annotation) are the values of the statistics? Now, I assume that right now no 
GeoParquet reader is using the statistics of the binary column, because the 
physical statistics for BYTE_ARRAY ("unsigned byte-wise comparison") are 
essentially useless in the case those binary blobs represent WKB geometries. So 
again that should probably not give any compatibility issues?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to