zhishun luan created SPARK-34669:
------------------------------------

             Summary: Spark SQL uses the function[ length()] to return the 
length of the string rather than the length of the character
                 Key: SPARK-34669
                 URL: https://issues.apache.org/jira/browse/SPARK-34669
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.1.1
         Environment: spark 3.1.1,scala 2.12
            Reporter: zhishun luan


Such as the title.

For the function length (). MySQL and other relational databases, we get the 
byte length, but spark SQL gets the string length. For these two cases, please 
provide a new function to get the byte length, otherwise it is easy to mislead 
users

----------------------------------------------------------------------------------------
{code:java}
// code placeholder
SparkConf conf = new SparkConf();
conf.setMaster("local");
SparkSession session = SparkSession.builder().config(conf).getOrCreate();
session.sql("select length('我a')").show();

{code}
 

[result]

+-----------+
|length(测a)|
+-----------+
| 2|
+-----------+

in mysql 

+-----------+
|length(测a)|
+-----------+
| 4|
+-----------+

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to