Back

Explore Courses Blog Tutorials Interview Questions
0 votes
2 views
in Big Data Hadoop & Spark by (12.9k points)

Can anyone tell me where in PySpark DataFrame?

1 Answer

0 votes
by (108k points)

In PySpark, where() function is used to filter the rows from from RDD/DataFrame according to the given condition.

Here are some of the syntaxes for PySpark DataFrame where():

1) where(condition: Column): Dataset[T]

2) where(conditionExpr: String): Dataset[T] //using SQL expression

3) where(func: T => Boolean): Dataset[T]

4) where(func: FilterFunction[T]): Dataset[T]

If you are looking for an online course to learn PySpark, check out this PySpark Course by Intellipaat.

Also, check out this video for more information:

Browse Categories

...