Hi,
I have a file on amazon aws under :
s3n://<Access Key>:<Secret Key>@<Bucket Name>/file.txt
I want this file to be accessible by the slave nodes via Distributed Cache.
I put the following after the job configuration statements in the Driver
program
:
DistributedCache.addCacheFile(new Path("s3n://<Access Key>:<Secret Key>@<Bucket
Name>/file.txt").toUri(), job.getConfiguration());
Also in my setup method in the mapper class, I have the below statement :
Path[] cacheFiles =
DistributedCache.getLocalCacheFiles(context.getConfiguration());
cacheFiles is gettng assigned null.
Could you please let me know what I am doing wrong here ? The file does exist
on
S3.
Thanks,
Neeral