Tim Armstrong created SPARK-34909:
-------------------------------------
Summary: conv() does not convert negative inputs to unsigned
correctly
Key: SPARK-34909
URL: https://issues.apache.org/jira/browse/SPARK-34909
Project: Spark
Issue Type: Bug
Components: SQL
Affects Versions: 3.1.0
Reporter: Tim Armstrong
{noformat}
scala> spark.sql("select conv('-10', 11, 7)").show(20, 150)
+-----------------------+
| conv(-10, 11, 7)|
+-----------------------+
|4501202152252313413456|
+-----------------------+
scala> spark.sql("select hex(conv('-10', 11, 7))").show(20, 150)
+----------------------------------------------+
| hex(conv(-10, 11, 7))|
+----------------------------------------------+
|3435303132303231353232353233313334313334353600|
+----------------------------------------------+
{noformat}
The correct result is 45012021522523134134555. The above output has an
incorrect second-to-last digit (6 instead of 5) and the last digit is a
non-printing character the null byte.
I tracked the bug down to NumberConverter.unsignedLongDiv returning incorrect
results. I tried replacing with java.lang.Long.divideUnsigned and that fixed it.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]