These results are fairly old. Both Hive and Pig have changed significantly since then. Many of Hive's internal operators have been rewritten. Both can now run over Tez or Spark. So at this point it's hard to say which is better. The best option is to develop some tests based on your use case and try both to see which works best for you.

Alan.

Ritesh Kumar Singh <mailto:riteshoneinamill...@gmail.com>
June 2, 2015 at 21:06
Check this paper :
Pig-Hive benchmarks <http://www.ibm.com/developerworks/library/ba-pigvhive/pighivebenchmarking.pdf>

Discusses these things in details.

Cheers,
Ritesh


sreejesh s <mailto:sreejesh...@yahoo.com>
June 1, 2015 at 23:02
Hi,

I have read couple of articles that say pig joins perform better compared to Hive joins... Is that true ? if Yes could you please explain the reason.

Thanks


Reply via email to