[ https://issues.apache.org/jira/browse/HIVE-647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732775#action_12732775 ]
Namit Jain commented on HIVE-647: --------------------------------- Are you sure - I just tried the same query and it works for me - are you using trunk ? Also, can you look at the plan file and search for numReducers (the plan file for the second job) The plan file can be found by: hive.exec.plan from the tracker > SORT BY with GROUP ignored without LIMIT > ---------------------------------------- > > Key: HIVE-647 > URL: https://issues.apache.org/jira/browse/HIVE-647 > Project: Hadoop Hive > Issue Type: Bug > Components: Query Processor > Reporter: Bill Graham > > For queries with GROUP BY and SORT BY, the sort is not handled properly when > a LIMIT is not supplied. If I run the following two queries, the first > returns properly sorted results. The second does not. > SELECT user, SUM(numRequests) AS num FROM MyTable GROUP BY user SORT BY num > DESC LIMIT 50; > SELECT user, SUM(numRequests) AS num FROM MyTable GROUP BY user SORT BY num > DESC; > Explain is different for the two queries as well. The first uses 3 M/R jobs > and the second only uses 2, which might be part of the problem. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.