Egor Pahomov created SPARK-17616: ------------------------------------ Summary: Getting "java.lang.RuntimeException: Distinct columns cannot exist in Aggregate " Key: SPARK-17616 URL: https://issues.apache.org/jira/browse/SPARK-17616 Project: Spark Issue Type: Bug Components: SQL Affects Versions: 2.0.0 Reporter: Egor Pahomov Priority: Minor
I execute: {code} select platform, collect_set(user_auth) as paid_types, count(distinct sessionid) as sessions from non_hss.session where event = 'stop' and platform != 'testplatform' and not (month = MONTH(current_date()) AND year = YEAR(current_date()) and day = day(current_date())) and ( (month >= MONTH(add_months(CURRENT_DATE(), -5)) AND year = YEAR(add_months(CURRENT_DATE(), -5))) OR (month <= MONTH(add_months(CURRENT_DATE(), -5)) AND year > YEAR(add_months(CURRENT_DATE(), -5))) ) group by platform {code} I get: {code} java.lang.RuntimeException: Distinct columns cannot exist in Aggregate operator containing aggregate functions which don't support partial aggregation. {code} IT WORKED IN 1.6.2. I've read error 5 times, and read code once. I still don't understand what I do incorrectly. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org