Using Distributed Cache in PIG

kapil bhosale Mon, 13 Aug 2012 10:47:56 -0700

Hello
Can we use Distributed Cache to store intermediate results after the Map
Phase so that these can be used in Reduce phase from cache.
So as to improve performance of Map-Reduce Job.


I found a Paper regarding usage of Cache in Map-Reduce,
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=5395321&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel5%2F5394475%2F5394991%2F05395321.pdf%3Farnumber%3D5395321

if Hadoop Map-Reduce can be improved with Cache then ultimately Pig script
running in Map-Reduce can be improved.

Thanks
Kapil

Using Distributed Cache in PIG

Reply via email to