uros-db commented on PR #46924:
URL: https://github.com/apache/spark/pull/46924#issuecomment-2157624619

   of course, I'll update the PR description with more details soon
   
   but shortly: as part of the collation effort in Spark, we've moved away from 
byte-by-byte logic towards code point per code point logic, so what we used to 
call UTF8_BINARY_LCASE is now UTF8_LCASE, as this describes more precisely what 
is going on
   
   here's a couple PRs regarding these changes:
   https://github.com/apache/spark/pull/46700
   https://github.com/apache/spark/pull/46761
   https://github.com/apache/spark/pull/46762
   https://github.com/apache/spark/pull/46682
   https://github.com/apache/spark/pull/46589
   https://github.com/apache/spark/pull/46682
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to