Hi all,
I have a question which is bugging me for more than a week.
I'm doing some computation across 3 tables in hbase where 1 table is around
25m rows, 2nd table 5m rows and 3rd table 1m rows..

My Java MR job takes a long time to execute.. (in terms of hours) but a pig
script does the same task in under an hour.. A 6 node cluster FYI

Can anybody tell me why Java MR application is slower than a pig script?

-- 
Regards-
Pavan

Reply via email to