Re: [SparkSQL 1.4.0]The result of SUM(xxx) in SparkSQL is 0.0 but not null when the column xxx is all null

2015-07-06 Thread Michael Armbrust
This was a change that was made to match a wrong answer coming from older
versions of Hive.  Unfortunately I think its too late to fix this in the
1.4 branch (as I'd like to avoid changing answers at all in point
releases), but in Spark 1.5 we revert to the correct behavior.

https://issues.apache.org/jira/browse/SPARK-8828

On Thu, Jul 2, 2015 at 11:58 PM, StanZhai m...@zhaishidan.cn wrote:

 Hi all,

 I have a table named test like this:

 |  a  |  b  |
 |  1  | null |
 |  2  | null |

 After upgraded the cluster from spark 1.3.1 to 1.4.0, I found the Sum
 function in spark 1.4 and 1.3 are different.

 The SQL is: select sum(b) from test

 In Spark 1.4.0 the result is 0.0, in spark 1.3.1 the result is null. I
 think
 the result should be null, why the result is 0.0 in 1.4.0 but not null? Is
 this a bug?

 Any hint is appreciated.



 --
 View this message in context:
 http://apache-spark-developers-list.1001551.n3.nabble.com/SparkSQL-1-4-0-The-result-of-SUM-xxx-in-SparkSQL-is-0-0-but-not-null-when-the-column-xxx-is-all-null-tp13008.html
 Sent from the Apache Spark Developers List mailing list archive at
 Nabble.com.

 -
 To unsubscribe, e-mail: dev-unsubscr...@spark.apache.org
 For additional commands, e-mail: dev-h...@spark.apache.org




asf git merge currently not working

2015-07-06 Thread Reynold Xin
FYI there are some problems with ASF's git or ldap infra. As a result, we
cannot merge anything into Spark right now.

An infra ticket has been created:
https://issues.apache.org/jira/browse/INFRA-9932

Please watch/vote on that ticket for progress. Thanks.