Hello
Can we use Distributed Cache to store intermediate results after the Map
Phase so that these can be used in Reduce phase from cache.
So as to improve performance of Map-Reduce Job.

I found a Paper regarding usage of Cache in Map-Reduce,
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=5395321&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel5%2F5394475%2F5394991%2F05395321.pdf%3Farnumber%3D5395321

if Hadoop Map-Reduce can be improved with Cache then ultimately Pig script
running in Map-Reduce can be improved.

Thanks
Kapil

Reply via email to