Kristin Cowalcijk created SEDONA-222:
----------------------------------------
Summary: GeoParquet reader does not work in non-local mode
Key: SEDONA-222
URL: https://issues.apache.org/jira/browse/SEDONA-222
Project: Apache Sedona
Issue Type: Bug
Affects Versions: 1.3.0, 1.3.1
Reporter: Kristin Cowalcijk
GeoParquet reader works fine when loading one of the [example parquet
files|https://github.com/apache/incubator-sedona/tree/sedona-1.3.1-incubating/core/src/test/resources/geoparquet]
in local mode:
{code:scala}
scala>
spark.read.format("geoparquet").load("/path/to/example1.parquet").printSchema
root
|-- pop_est: long (nullable = true)
|-- continent: string (nullable = true)
|-- name: string (nullable = true)
|-- iso_a3: string (nullable = true)
|-- gdp_md_est: double (nullable = true)
|-- geometry: geometry (nullable = true)
{code}
When running the same code in standalone cluster mode, the type of the geometry
column is {{binary}} instead of {{{}geometry{}}}:
{code:scala}
scala>
spark.read.format("geoparquet").load("/path/to/example1.parquet").printSchema
root
|-- pop_est: long (nullable = true)
|-- continent: string (nullable = true)
|-- name: string (nullable = true)
|-- iso_a3: string (nullable = true)
|-- gdp_md_est: double (nullable = true)
|-- geometry: binary (nullable = true)
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)