HyukjinKwon commented on PR #38624: URL: https://github.com/apache/spark/pull/38624#issuecomment-1687250743
> I don't see any reason to oppose this My main reason is that I would like to avoid adding a bunch of API to address a couple of corner cases. Ideally we should have a set of API as few as possible that handles 99% cases. That's why `mapInArrow` itself is marked as a developer API. - Once we add this, we should also think about adding Arrow version of other pandas friends, e.g., Arrow UDF - The benefit of doing this isn't very significant considering that most of the cases are already covered by `mapInArrow`, and by converting pandas to Arrow. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
