How to remove duplicate rows and sort based on a numerical column an R data frame?

R Programming Server Side Programming Programming

If we have duplicate rows in an R data frame then we can remove them by using unique function with data frame object name. And if we want to order the data frame with duplicate rows based on a numerical column then firstly unique rows should be found then order function can be used for sorting as shown in the below examples.

Example

Consider the below data frame −

Live Demo

x1<-rep(c(2,7,1,5),5)
x2<-rep(LETTERS[1:4],5)
df1<-data.frame(x1,x2)
df1

Output

Finding unique rows of df1 −

Example

df1<-unique(df1)
df1

Output

Ordering df1 based on x1 −

Example

df1[order(df1$x1),]

Output

Example

Live Demo

y1<-rep(c(501,278,357,615),5)
y2<-rep(c("G1","G2","G3","G4"),5)
df2<-data.frame(y1,y2)
df2

Output

Finding unique rows of df2 −

Example

df2<-unique(df2)
df2

Output

Ordering df2 based on y1 −

Example

df2[order(df2$y1),]

Output

Nizamuddin Siddiqui

Updated on: 07-Dec-2020

570 Views

Kickstart Your Career

Get certified by completing the course

Get Started