[ 
https://issues.apache.org/jira/browse/SPARK-22771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiao Li resolved SPARK-22771.
-----------------------------
       Resolution: Fixed
         Assignee: Takeshi Yamamuro
    Fix Version/s: 2.3.0

> SQL concat for binary 
> ----------------------
>
>                 Key: SPARK-22771
>                 URL: https://issues.apache.org/jira/browse/SPARK-22771
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 2.2.1
>            Reporter: Fernando Pereira
>            Assignee: Takeshi Yamamuro
>            Priority: Minor
>             Fix For: 2.3.0
>
>
> spark.sql {{concat}}  function automatically casts arguments to StringType 
> and returns a String.
> This might be the behavior of traditional databases, however in Spark there's 
> Binary as a standard type, and concat'ing binary seems reasonable if it 
> returns another binary sequence.
> Taking the example of, e.g. Python where both {{bytes}} and {{unicode}} 
> represent text, by concat'ing both we end up with the same type as the 
> arguments, and in case they are intermixed (str + unicode) the most generic 
> type is returned (unicode).
> Following the same principle, I believe that when concat'ing binary it would 
> make sense to return a binary. 
> In terms of Spark behavior, it would affect only the case when all arguments 
> are binary. All other cases should remain unchanged.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to