What are SUCCESS and part-r-00000 files in hadoop

Question

1 Answer

Amit Rawat · Answer 1 · 2019-06-20T12:25:00+0000

In Hadoop, whenever there is a successful creation of any job, the MapReduce runtime creates a _SUCCESS file in the output directory. This may be useful for applications that need to see if a result set is complete just by inspecting HDFS.

And coming on the part-x-yyyyy, it is the default name given to the output files.

In such output files:

x is either written 'r' or 'm', depending on the map-only job or reduce-only job, respectively.
yyyyy is the Reducer, or Mapper task number (defined as, 00000)

So, if a job has 20 reducers, it will generate files that are named from part-r-00000 to part-r-00019, one for each reducer task.

If you want to change the default name of your output file. You just need to go to the Driver class to change the default name of the output file:

job.getConfiguration().set("mapreduce.output.basename", "intellipaat")

If you want to know more about Hadoop, refer to the following video tutorial:

What are SUCCESS and part-r-00000 files in hadoop

What are SUCCESS and part-r-00000 files in hadoop

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Browse Categories

Popular Courses

Top Tutorials

Top Articles

Top Interview Questions