[GitHub] [sedona] sebbegg commented on issue #1031: Errors reading data written with sedona 1.3.1

via GitHub Tue, 26 Sep 2023 23:58:37 -0700


sebbegg commented on issue #1031:
URL: https://github.com/apache/sedona/issues/1031#issuecomment-1736812643


   So we gave it a try - without success.
   
   Also when using `spark.read.format('parquet')` spark reads (tries to read) 
the geometry column as the UDT, since that information is encoded in the 
parquet file's metadata.
   So also with plain parquet, when reading the file spark tries to convert the 
parquet schema/types to the spark schema, leading the errors in the stacktrace.
   
   One not really relevant observation: the `GeometryUDT` does not overwrite 
the `sql` and `catalogString` properties, leading to hard-to-understand error 
messages like this:
   
   ```
   AnalysisException: [DATATYPE_MISMATCH.UNEXPECTED_INPUT_TYPE] Cannot resolve 
"st_geomfromwkb(gps_dr_position)" due to data type mismatch: parameter 1 
requires ("STRING" or "BINARY") type, however, "gps_dr_position" is of "BINARY" 
type.; line 1 pos 0
   ```
   The above happens, since `gps_dr_position` is already of the geometry type.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [sedona] sebbegg commented on issue #1031: Errors reading data written with sedona 1.3.1

Reply via email to