vitaliili-db commented on PR #37631:
URL: https://github.com/apache/spark/pull/37631#issuecomment-1228950512
@cloud-fan trailing empty string is not actually a bug, it has consistent
behavior in all systems, e.g. `split("aaAbbAccA", "A")` gives same result in
all systems => `['aa', 'bb', 'cc', '']`. So were are pretty consistent here.
The only difference in absence of `limit` parameter (e.g. for migration
purposes) is how empty delimiter/regex behaves. As mentioned we might choose
either ignore regex (return original string), split and drop trailing empty
string (this PR) or do nothing and return array with trailing empty string.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]