How to pivot Spark DataFrame?

Question

1 Answer

Amit Rawat · Answer 1 · 2019-07-10T11:24:43+0000

Spark provides pivot function since version 1.6.

Let me give you a example using nycflights13 and csv format.

Nycflights13 is a package that contains information about all flights that departed from NYC (e.g. EWR, JFK and LGA) in 2013: 336,776 flights in total. To help understand what causes delays, it also includes a number of other useful datasets. This package provides the following data tables.

val flights = sqlContext
  .read
  .format("csv")
  .options(Map("inferSchema" -> "true", "header" -> "true"))
  .load("flights.csv")
flights
  .groupBy($"origin", $"dest", $"carrier")
  .pivot("hour")
  .agg(avg($"arr_delay"))

How to pivot Spark DataFrame?

Please log in to add a comment.

Please log in to answer this question.

1 Answer

Please log in to add a comment.

Related questions