Remember

Register

All Courses Ask a Question

Questions
Unanswered
Ask a Question
Blog
Tutorials
Interview Questions

Back

Login

Explore Courses Blog Tutorials Interview Questions

community
Data Science
How to “select distinct” across multiple data...

How to “select distinct” across multiple data frame columns in pandas?

How to “select distinct” across multiple data frame columns in pandas?

0 votes

2 views

asked Jul 31, 2019 in Data Science by sourav (17.6k points)

I'm looking for a way to do the equivalent to the sql

"SELECT DISTINCT col1, col2 FROM dataframe_table"

The pandas sql comparison doesn't have anything about "distinct"

.unique() only works for a single column, so I suppose I could concat the columns, or put them in a list/tuple and compare that way, but this seems like something pandas should do in a more native way.

Am I missing something obvious, or is there no way to do this?

python
pandas

Please log in to add a comment.

Please log in to answer this question.

1 Answer

0 votes

answered Aug 1, 2019 by Shlok Pandey (41.4k points)

Use the drop_duplicates. This method is used to get the unique rows in a DataFrame:

In [29]: df = pd.DataFrame({'a':[1,2,1,2], 'b':[3,4,3,5]})
In [30]: df
Out[30]:
a b
0 1 3
1 2 4
2 1 3
3 2 5
In [32]: df.drop_duplicates()
Out[32]:
a b
0 1 3
1 2 4
3 2 5

Please log in to add a comment.

Related questions

0 votes

1 answer

Importing data from a MySQL database into a Pandas data frame including column names

asked Jul 31, 2019 in Data Science by sourav (17.6k points)

python
pandas
numpy
mysql

0 votes

1 answer

Creating a zero-filled pandas data frame

asked Jul 31, 2019 in Data Science by sourav (17.6k points)

python
pandas
dataframe

0 votes

1 answer

Python Pandas How to assign groupby operation results back to columns in parent dataframe?

asked Jul 30, 2019 in Data Science by sourav (17.6k points)

python
group-by
pandas
dataframe

0 votes

1 answer

Python Pandas add column for row-wise max value of selected columns

asked Jul 31, 2019 in Data Science by sourav (17.6k points)

python
pandas
max

0 votes

1 answer

How to set a cell to NaN in a pandas dataframe

asked Jul 31, 2019 in Data Science by sourav (17.6k points)

pandas
python
nan

1.2k questions

2.7k answers

501 comments

693 users

All categories
Python (132)
Java (165)
SQL (251)
Linux (14)
Big Data Hadoop & Spark (67)
Data Science (75)
R Programming (49)
C Programming (7)
DevOps and Agile (162)
AI and Deep Learning (32)
Machine Learning (9)
AWS (54)
Azure (26)
GCP (4)
RPA (2)
Selenium (12)
Blockchain (1)
Salesforce (24)
Others (12)
BI (30)
Web Technology (57)
Digital Marketing (3)
Technology Trends (6)

Browse Categories

Master Program
Big Data
Data Science
Business Intelligence
Salesforce
Cloud Computing Courses
Digital Marketing
Database
Programming
Testing
Project Management
Web Development Courses

© COPYRIGHT 2011-2024 INTELLIPAAT.COM. ALL RIGHTS RESERVED.

...