How can I remove all duplicates so that NONE are left in a data frame?

Question

1 Answer

anonymous · Answer 1 · 2019-07-23T07:07:15+0000

To remove all the duplicates from the data frame, you can use the following syntax:

df[!(duplicated(df) | duplicated(df, fromLast = TRUE)), ]

For example:

Date <- as.Date(c('2006-08-30','2006-08-23', '2006-09-06','2006-08-23', '2006-09-13','2006-08-23', '2006-09-20'))
ID <- c("x1","x1","X2","x1","X3","x1","x1")
TransNo<-c("123","124","125","124","126","124","127")
df<-data.frame(ID,Date,TransNo)
ID Date TransNo
1 x1 2006-08-30 123
2 x1 2006-08-23 124
3 X2 2006-09-06 125
4 x1 2006-08-23 124
5 X3 2006-09-13 126
6 x1 2006-08-23 124
7 x1 2006-09-20 127

To get rows that occur only once:

df[!(duplicated(df) | duplicated(df, fromLast = TRUE)), ]
ID Date TransNo
1 x1 2006-08-30 123
3 X2 2006-09-06 125
5 X3 2006-09-13 126
7 x1 2006-09-20 127

How can I remove all duplicates so that NONE are left in a data frame?

1 Answer

Related questions

Browse Categories

Browse By Domains

Popular Courses

Popular Tutorials

Popular Resources