Github user felixcheung commented on a diff in the pull request:
https://github.com/apache/spark/pull/20495#discussion_r165854982
--- Diff: python/pyspark/sql/functions.py ---
@@ -1705,10 +1705,12 @@ def unhex(col):
@ignore_unicode_prefix
@since(1.5)
def length(col):
- """Calculates the length of a string or binary expression.
+ """Computes the character length of a given string or number of bytes
or a binary string.
+ The length of character strings include the trailing spaces. The
length of binary strings
--- End diff --
as a side note, why is it calling out trailing spaces? what about leading
spaces? isn't all spaces factored into the character length?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]