I don't wish to use default partitioning parameters. Is there any way to specify the partitions that I want to create in the MR job?

The method to use a custom partitioner for a Hadoop job, follow the following instructions:

  • Create a class and extend the Partitioner Class in your code.
  • In your new class, override the method, 'getPartition' 
  • In the MapReduce running wrapper, use the Partitioner Class method set or add the custom partitioner in the config.

In this way, you can add the custom number that you want in the code itself. 

