zhishun luan created SPARK-34669:
------------------------------------
Summary: Spark SQL uses the function[ length()] to return the
length of the string rather than the length of the character
Key: SPARK-34669
URL: https://issues.apache.org/jira/browse/SPARK-34669
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 3.1.1
Environment: spark 3.1.1,scala 2.12
Reporter: zhishun luan
Such as the title.
For the function length (). MySQL and other relational databases, we get the
byte length, but spark SQL gets the string length. For these two cases, please
provide a new function to get the byte length, otherwise it is easy to mislead
users
----------------------------------------------------------------------------------------
{code:java}
// code placeholder
SparkConf conf = new SparkConf();
conf.setMaster("local");
SparkSession session = SparkSession.builder().config(conf).getOrCreate();
session.sql("select length('我a')").show();
{code}
[result]
+-----------+
|length(测a)|
+-----------+
| 2|
+-----------+
in mysql
+-----------+
|length(测a)|
+-----------+
| 4|
+-----------+
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]