Josh Rosen created SPARK-27839:
----------------------------------
Summary: Improve UTF8String.replace() / StringReplace performance
Key: SPARK-27839
URL: https://issues.apache.org/jira/browse/SPARK-27839
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 2.4.0
Reporter: Josh Rosen
Assignee: Josh Rosen
The UTF8String.replace() function and StringReplace expression are missing a
few common-case optimizations, such as avoiding copies when the replacement
does not change the string and avoiding redundant copying / decoding of the
search and replacement strings in case they are constants.
I think there's room to significantly improve performance here, especially for
single-character replacements.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]