Josh Rosen created SPARK-27841:
----------------------------------

             Summary: Improve UTF8String fromString()/toString()/numChars() 
performance when strings are ASCII
                 Key: SPARK-27841
                 URL: https://issues.apache.org/jira/browse/SPARK-27841
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 2.4.0
            Reporter: Josh Rosen
            Assignee: Josh Rosen


UTF8String's fromString(), toString(), and numChars() methods are performance 
hotspots. For strings which consist entirely of ASCII characters we can make 
performance optimizations which significantly reduce memory allocation and 
copying, greatly improving performance for many common workloads.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to