Hi all, I have a question which is bugging me for more than a week. I'm doing some computation across 3 tables in hbase where 1 table is around 25m rows, 2nd table 5m rows and 3rd table 1m rows..
My Java MR job takes a long time to execute.. (in terms of hours) but a pig script does the same task in under an hour.. A 6 node cluster FYI Can anybody tell me why Java MR application is slower than a pig script? -- Regards- Pavan
