Hadoop DistributedCache is deprecated - what is the preferred API?

Question

1 Answer

Amit Rawat · Answer 1 · 2019-07-08T05:36:10+0000

The APIs for the Distributed Cache can be found in the Job class itself. The code should be something like

Job job = new Job();
...
job.addCacheFile(new Path(filename).toUri());

In your mapper code:

Path[] localPaths = context.getLocalCacheFiles();
...

If you want to know more about MapReduce, then do check out this awesome video tutorial: