zhangfengcdt opened a new pull request, #2359:
URL: https://github.com/apache/sedona/pull/2359

   ## Did you read the Contributor Guide?
   
   - Yes, I have read the [Contributor 
Rules](https://sedona.apache.org/latest/community/rule/) and [Contributor 
Development Guide](https://sedona.apache.org/latest/community/develop/)
   
   ## Is this PR related to a ticket?
   
   - Yes, and the PR name follows the format `[GH-XXX] my subject`. Closes 
#<issue_number>
   
   ## What changes were proposed in this PR?
   
   This PR addresses SPARK-48942, a bug that occurs when reading nested 
geometry arrays from Parquet files using Spark's vectorized reader. The fix 
implements compatibility checks for UserDefinedTypes (UDTs) in nested 
structures and adds workaround utilities for schema transformation.
   
   - Adds UDT compatibility checking in Parquet column vector operations to 
handle type mismatches 
   - Implements utility classes for transforming nested GeometryUDT schemas to 
avoid the bug 
   - Provides comprehensive test coverage for various nested geometry scenarios
   
   ## How was this patch tested?
   new tests are added to geoparquetIOTests.scala
   
   ## Did this PR include necessary documentation updates?
   
   - No, this PR does not affect any public API so no need to change the 
documentation.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to