What is “throughput” in Hadoop?

Question

1 Answer

namanbhargava · Answer 1 · 2020-03-16T08:56:31+0000

In general terms, 'throughput' is defined as the net amount of work done per unit time. And Hadoop is a higher throughput when it concerns large datasets due to the following reasons:

HDFS in Hadoop uses a Write Once Read Many model. The principle implies that once data is written, it can't be modified. This sort of implement simplifies any sort of data coherency issue and in turn, provides high throughput.
Secondly, Hadoop follows a Data Locality principle. Normally, we move data to a location where we process it. In Hadoop, the data is processed at its local storage place. This reduces the overhead of data transport/transfer and increases overall throughput.

Learn more about throughput in Hadoop by signing up for Hadoop Online Training.

What is “throughput” in Hadoop?

1 Answer

Related questions

Browse Categories

Browse By Domains

Popular Courses

Popular Tutorials

Popular Resources