Kristin Cowalcijk created SEDONA-222:
----------------------------------------

             Summary: GeoParquet reader does not work in non-local mode
                 Key: SEDONA-222
                 URL: https://issues.apache.org/jira/browse/SEDONA-222
             Project: Apache Sedona
          Issue Type: Bug
    Affects Versions: 1.3.0, 1.3.1
            Reporter: Kristin Cowalcijk


GeoParquet reader works fine when loading one of the [example parquet 
files|https://github.com/apache/incubator-sedona/tree/sedona-1.3.1-incubating/core/src/test/resources/geoparquet]
 in local mode:
{code:scala}
scala> 
spark.read.format("geoparquet").load("/path/to/example1.parquet").printSchema
root                                                                            
 |-- pop_est: long (nullable = true)
 |-- continent: string (nullable = true)
 |-- name: string (nullable = true)
 |-- iso_a3: string (nullable = true)
 |-- gdp_md_est: double (nullable = true)
 |-- geometry: geometry (nullable = true)
{code}
When running the same code in standalone cluster mode, the type of the geometry 
column is {{binary}} instead of {{{}geometry{}}}:
{code:scala}
scala> 
spark.read.format("geoparquet").load("/path/to/example1.parquet").printSchema
root                                                                            
 |-- pop_est: long (nullable = true)
 |-- continent: string (nullable = true)
 |-- name: string (nullable = true)
 |-- iso_a3: string (nullable = true)
 |-- gdp_md_est: double (nullable = true)
 |-- geometry: binary (nullable = true)
{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to