map() vs flatMap() in Spark

Question

1 Answer

ashely · Answer 1 · 2019-08-02T05:50:29+0000

Spark map function expresses a one-to-one transformation. It modifies each element of a collection into one element of the resulting collection. While Spark flatMap function expresses a one-to-many transformation. It modifies each element to 0 or more elements. Both map() and flatMap() are used for transformations.

The map() transformation takes in a function and applies it to each element in the RDD and the result of the function is a new value of each element in the resulting RDD. The flatMap() is used to generate multiple output elements for each input element. When using map(), the function we present to flatMap() is called individually for each element in our input RDD. Instead of returning a single element, an iterator with the return values is returned.

map() vs flatMap() in Spark

1 Answer

Related questions

Browse By Domains

Popular Courses

Popular Tutorials

Popular Resources