What is Apache Hadoop?

1 Answer

answered Jan 13, 2020 by Eresh Kumar (45.3k points)
edited Nov 25, 2022 by admin

Apache Hadoop is an open-source framework that is designed to store and process large data sets across a cluster of machines and computers. It uses simple programming models to process the given data sets. It provides huge storage space for any type of data, processing power and also the ability to take care of concurrent tasks. It consists of the following components:

Hadoop Distributed File System (HDFS): It is used to store the data.
Yarn: It is used for job scheduling as well as a cluster resource management.
MapReduce: It is used to perform parallel processing.
Common Libraries: They are libraries required by subsystems of Hadoop.

If you want to learn more about Hadoop then you must read Hadoop Tutorial.

Enroll in this Big Data Hadoop certification training to learn from the experts.

You should watch this Big Data Hadoop tutorial video to become proficient in Hadoop:

1 Answer

Related questions

Browse Categories

Browse By Domains

Popular Courses

Popular Tutorials

Popular Resources