Do you need to get all records in the order? In most of our use cases users are only interested in the top 100 or something. If you do limit 100 together with order by, it will be much faster.

Sent from my iPhone

On May 12, 2010, at 1:54 PM, [email protected] wrote:

Thanks, Ted.
If I have very big data to sort, only 1 reduce task will have performance issue.
Do hive have some skill to optimize it?
I have observe that the reduce task is very slow in my job.


你的1G网络U盘真好用!
查薪酬:对比同行工资!

Reply via email to