Tim Armstrong created SPARK-34909:
-------------------------------------

             Summary: conv() does not convert negative inputs to unsigned 
correctly
                 Key: SPARK-34909
                 URL: https://issues.apache.org/jira/browse/SPARK-34909
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 3.1.0
            Reporter: Tim Armstrong


{noformat}
scala> spark.sql("select conv('-10', 11, 7)").show(20, 150)
+-----------------------+
|       conv(-10, 11, 7)|
+-----------------------+
|4501202152252313413456|
+-----------------------+
scala> spark.sql("select hex(conv('-10', 11, 7))").show(20, 150)
+----------------------------------------------+
|                         hex(conv(-10, 11, 7))|
+----------------------------------------------+
|3435303132303231353232353233313334313334353600|
+----------------------------------------------+
{noformat}

The correct result is 45012021522523134134555. The above output has an 
incorrect second-to-last digit (6 instead of 5) and the last digit is a 
non-printing character the null byte.

I tracked the bug down to NumberConverter.unsignedLongDiv returning incorrect 
results. I tried replacing with java.lang.Long.divideUnsigned and that fixed it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to