qlong commented on PR #53458: URL: https://github.com/apache/spark/pull/53458#issuecomment-3656226846
@rluvaton I was a bit surprised the issue was discovered only recently. If this PR is approved, the community can decide if it needs be backported to previous versions. With regard to the behavior of returning null in the presence of invalid utf-8 bytes, I think it is more aligned with what other engines do. Redshift has a separate function that allows users to specify the replacement byte. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
