Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/20495#discussion_r165855729 --- Diff: python/pyspark/sql/functions.py --- @@ -1705,10 +1705,12 @@ def unhex(col): @ignore_unicode_prefix @since(1.5) def length(col): - """Calculates the length of a string or binary expression. + """Computes the character length of a given string or number of bytes or a binary string. + The length of character strings include the trailing spaces. The length of binary strings --- End diff -- The reason is `LEN` in MS SQL Server excludes trailing blanks. : ) Yeah. This PR also can updates it in R side too.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org