sebbegg commented on issue #1031:
URL: https://github.com/apache/sedona/issues/1031#issuecomment-1736812643
So we gave it a try - without success.
Also when using `spark.read.format('parquet')` spark reads (tries to read)
the geometry column as the UDT, since that information is encoded in the
parquet file's metadata.
So also with plain parquet, when reading the file spark tries to convert the
parquet schema/types to the spark schema, leading the errors in the stacktrace.
One not really relevant observation: the `GeometryUDT` does not overwrite
the `sql` and `catalogString` properties, leading to hard-to-understand error
messages like this:
```
AnalysisException: [DATATYPE_MISMATCH.UNEXPECTED_INPUT_TYPE] Cannot resolve
"st_geomfromwkb(gps_dr_position)" due to data type mismatch: parameter 1
requires ("STRING" or "BINARY") type, however, "gps_dr_position" is of "BINARY"
type.; line 1 pos 0
```
The above happens, since `gps_dr_position` is already of the geometry type.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]