Which operations preserve RDD order?

Question

1 Answer

Amit Rawat · Answer 1 · 2019-07-10T10:06:18+0000

Almost all operations preserve the order, except for the operations that explicitly do not intend to preserve the order such as sortBy, partitionBy, join. Ordering is always "meaningful",

Let’s say, if you read a file (sc.textFile) the lines of the RDD will be in the order that they were in the file.

map, filter, flatMap, and coalesce (with shuffle=false) do preserve the order like most of the RDD operations they work on Iterators inside the partitions. So, they just don’t have any choice of messing up the order.

Which operations preserve RDD order?

1 Answer

Related questions

Browse Categories