Yicong-Huang opened a new pull request, #53904:
URL: https://github.com/apache/spark/pull/53904

   ### What changes were proposed in this pull request?
   
   This PR removes unused `timezone` and `assign_cols_by_name` parameters from 
`ArrowStreamArrowUDFSerializer` and related classes.
   
   ### Why are the changes needed?
   
   As noted in the JIRA ticket, `ArrowStreamArrowUDFSerializer` stores 
`timezone` and `assign_cols_by_name` but never uses them. Arrow serializers 
operate directly on Arrow arrays without pandas conversion, so these parameters 
are unnecessary.
   
   This cleanup:
   1. Removes dead code and unnecessary parameters
   2. Simplifies the serializer API
   3. Reduces memory footprint by not storing unused instance variables
   
   ### Does this PR introduce _any_ user-facing change?
   
   No. These are internal implementation details.
   
   ### How was this patch tested?
   
   Existing unit tests should cover this change as the functionality remains 
the same.
   
   ### Was this patch authored or co-authored using generative AI tooling?
   
   No.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to