Remember

Register

All Courses Ask a Question

Questions
Unanswered
Ask a Question
Blog
Tutorials
Interview Questions

Back

Login

Explore Courses Blog Tutorials Interview Questions

community
Big Data Hadoop & Spark
PySpark: How to fillna values in dataframe for...

PySpark: How to fillna values in dataframe for specific columns?

PySpark: How to fillna values in dataframe for specific columns?

0 votes

2 views

asked Jul 19, 2019 in Big Data Hadoop & Spark by Aarav (11.4k points)

I have the following sample DataFrame:

a    | b    | c   |
1    | 2    | 4   |
0    | null | null|
null | 3    | 4   |

And I want to replace null values only in the first 2 columns - Column "a" and "b":

a    | b    | c   |
1    | 2    | 4   |
0    | 0    | null|
0    | 3    | 4   |

Here is the code to create sample dataframe:

rdd = sc.parallelize([(1,2,4), (0,None,None), (None,3,4)])
df2 = sqlContext.createDataFrame(rdd, ["a", "b", "c"])

I know how to replace all null values using:

df2 = df2.fillna(0)

And when I try this, I lose the third column:

df2 = df2.select(df2.columns[0:1]).fillna(0)

apache-spark

Please log in to add a comment.

Please log in to answer this question.

1 Answer

0 votes

answered Jul 19, 2019 by Amit Rawat (32.3k points)

Firstly, you will create your dataframe:

Now, in order to replace null values only in the first 2 columns - Column "a" and "b", and that too without losing the third column, you can use:

df.fillna( { 'a':0, 'b':0 } )

Learn Pyspark with the help of Pyspark Course by Intellipaat.

Please log in to add a comment.

Related questions

0 votes

1 answer

How to delete columns in pyspark dataframe

asked Jul 10, 2019 in Big Data Hadoop & Spark by Aarav (11.4k points)

apache-spark

0 votes

1 answer

Apply StringIndexer to several columns in a PySpark Dataframe

asked Jul 15, 2019 in Big Data Hadoop & Spark by Aarav (11.4k points)

apache-spark

0 votes

1 answer

Pyspark filter dataframe by columns of another dataframe

asked Jul 24, 2019 in Big Data Hadoop & Spark by Aarav (11.4k points)

apache-spark

+4 votes

5 answers

Removing duplicates from rows based on specific columns in an RDD/Spark DataFrame

asked Jul 10, 2019 in Big Data Hadoop & Spark by Aarav (11.4k points)

apache-spark

0 votes

1 answer

How to replace null values with a specific value in Dataframe using spark in Java?

asked Jul 29, 2019 in Big Data Hadoop & Spark by Aarav (11.4k points)

apache-spark

31k questions

32.8k answers

501 comments

693 users

Browse Categories

Master Program
Big Data
Data Science
Business Intelligence
Salesforce
Cloud Computing Courses
Digital Marketing
Database
Programming
Testing
Project Management
Web Development Courses

Browse By Domains

Data Science Courses Big Data Analytics Courses Business Intelligence Courses Salesforce Courses Cloud Computing Courses Digital Marketing Courses AI & Machine Learning Courses Programming Courses Database Courses Project Management Courses Cyber Security and Ethical Hacking Courses Web Development Courses Software Testing Courses Automation Courses Job Oriented Courses Degree Courses

Popular Courses

Data Science Course Artificial Intelligence Course Data Analytics Course Machine Learning Course Python Data Science Course Business Analytics Course Python Course Azure Course DevOps Course Cyber Security Course AWS Solutions Architect Salesforce Course Selenium Course AWS DevOps Course Ethical Hacking Course Power BI Course Digital Marketing Course Business Analyst Course Investment Banking Course Azure DevOps Course Azure Data Engineer Course Electric Vehicle Course UI UX Design Course SQL Course Full Stack Developer Course Data Engineering Course Supply Chain Management Course General Management Course Product Management Course

Popular Tutorials

Data Science Tutorial Machine Learning Tutorial Cyber Security Tutorial Salesforce Tutorial AWS Tutorial Azure Tutorial SQL Tutorial Selenium Tutorial Ethical Hacking Tutorial Artificial Intelligence Tutorial

Popular Resources

Data Science Machine Learning AWS Digital Marketing Cyber Security Python Interview Questions and Answers SQL Interview Questions and Answers Data Science Interview Questions and Answers PHP Interview Questions and Answers Azure DevOps Interview Questions and Answers

About Us
Media
Privacy Policy
Terms of Use
Contact Us
Blog
Interview Questions
Tutorials
Become an Instructor

© COPYRIGHT 2011-2024 INTELLIPAAT.COM. ALL RIGHTS RESERVED.

...