Back

Explore Courses Blog Tutorials Interview Questions
0 votes
2 views
in Big Data Hadoop & Spark by (12.9k points)

Can anyone tell me how to learn pyspark?

1 Answer

0 votes
by (108k points)

I recommend following these steps to learn PySpark:

  • Become familiar with Apache Spark, its applicability and Spark 2.0 architecture
  • Gain hands-on expertise with the various tools in the Spark ecosystem, including Spark MLlib, Spark SQL, Kafka, Flume, and Spark Streaming
  • Understand the architecture of RDD, lazy evaluation, etc.
  • Learn how to change the architecture of the DataFrame and how to interact with it using Spark SQL
  • Build various APIs that work with Spark DataFrame
  • Pick up the skills to aggregate, filter, sort and transform data using DataFrame

If you are looking for an online course to learn PySpark, check out this PySpark Certification course by Intellipaat.

Also, check out this video for more information:

Related questions

0 votes
1 answer
0 votes
1 answer
0 votes
1 answer
asked Nov 29, 2019 in Python by Sammy (47.6k points)

Browse Categories

...