Hello Can we use Distributed Cache to store intermediate results after the Map Phase so that these can be used in Reduce phase from cache. So as to improve performance of Map-Reduce Job.
I found a Paper regarding usage of Cache in Map-Reduce, http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=5395321&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel5%2F5394475%2F5394991%2F05395321.pdf%3Farnumber%3D5395321 if Hadoop Map-Reduce can be improved with Cache then ultimately Pig script running in Map-Reduce can be improved. Thanks Kapil
