[GitHub] [spark] mkaravel commented on a change in pull request #34154: [SPARK-37047][SQL] Add lpad and rpad functions for binary strings

GitBox Wed, 20 Oct 2021 08:47:23 -0700


mkaravel commented on a change in pull request #34154:
URL: https://github.com/apache/spark/pull/34154#discussion_r732916603




##########
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala
##########
@@ -1328,14 +1328,31 @@ case class StringLocate(substr: Expression, str: 
Expression, start: Expression)
 
 }
 
+/**
+ * Helper class for implementing StringLPad and StringRPad.
+ * Returns the default expression to be used in StringLPad or StringRPad based 
on the type of
+ * the input expression.
+ * For character string expressions the default padding expression is the 
string literal ' '.
+ * For byte sequence expressions the default padding expression is the byte 
literal 0x00.
+ */
+object StringPadDefaultValue {
+  def get(str: Expression): Expression = {
+    str.dataType match {
+      case StringType => Literal(" ")
+      case BinaryType => Literal(Array[Byte](0x00))

Review comment:
       Yes it is (it is the hex representation of one byte whose value is `0`).
   I think that both choices are equally good. I ended up opting for the even 
simpler: `Array[Byte](0)`.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] mkaravel commented on a change in pull request #34154: [SPARK-37047][SQL] Add lpad and rpad functions for binary strings

Reply via email to