These results are fairly old. Both Hive and Pig have changed
significantly since then. Many of Hive's internal operators have been
rewritten. Both can now run over Tez or Spark. So at this point it's
hard to say which is better. The best option is to develop some tests
based on your use case and try both to see which works best for you.
Alan.
Ritesh Kumar Singh <mailto:riteshoneinamill...@gmail.com>
June 2, 2015 at 21:06
Check this paper :
Pig-Hive benchmarks
<http://www.ibm.com/developerworks/library/ba-pigvhive/pighivebenchmarking.pdf>
Discusses these things in details.
Cheers,
Ritesh
sreejesh s <mailto:sreejesh...@yahoo.com>
June 1, 2015 at 23:02
Hi,
I have read couple of articles that say pig joins perform better
compared to Hive joins... Is that true ? if Yes could you please
explain the reason.
Thanks