I have a series of mappers that I would like to be passed data using the distributed cache mechanism. At the moment, I am using HDFS to pass the data, but this seems wasteful to me, since they are all reading the same data.
Is there a piece of example code that shows how data files can be placed in the
cache and accessed by mappers?
Thanks,
Andy Doddington
