[ 
https://issues.apache.org/jira/browse/SPARK-33317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17225141#comment-17225141
 ] 

Liu Neng commented on SPARK-33317:
----------------------------------

I run these sql on spark 3.0.0, condition 1 +(between ' 1000405134' and 
'1000772585')+ find 6012 records, condition 2 ++(between '1000405134' and 
'1000772585'++) find 2798 records.

I find that comparator in codegen is UTF8String

!image-2020-11-03-13-30-12-049.png!

" 1000405134"  is smaller than "1000405134" 

I think that it isn't an issue, because comparing value is String not Number. 

I tried to analyze the parse tree, "1000405134"  is a String literal.

> Spark Hive SQL returning empty dataframe
> ----------------------------------------
>
>                 Key: SPARK-33317
>                 URL: https://issues.apache.org/jira/browse/SPARK-33317
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core, Spark Shell
>    Affects Versions: 2.4.6
>            Reporter: Debadutta
>            Priority: Major
>         Attachments: farmers.csv, image-2020-11-03-13-30-12-049.png
>
>
> I am trying to run a sql query on a hive table using hive connector in spark 
> but I am getting an empty dataframe. The query I am trying to run:-
> {{sparkSession.sql("select fmid from farmers where fmid between ' 1000405134' 
> and '1000772585'")}}
> This is failing but if I remove the leading whitespaces it works.
> {{sparkSession.sql("select fmid from farmers where fmid between '1000405134' 
> and '1000772585'")}}
> Currently, I am removing leading and trailing whitespaces as a workaround. 
> But the same query with whitespaces works fine in hive console.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to