- Trending Categories
Data Structure
Networking
RDBMS
Operating System
Java
iOS
HTML
CSS
Android
Python
C Programming
C++
C#
MongoDB
MySQL
Javascript
PHP
Physics
Chemistry
Biology
Mathematics
English
Economics
Psychology
Social Studies
Fashion Studies
Legal Studies
- Selected Reading
- UPSC IAS Exams Notes
- Developer's Best Practices
- Questions and Answers
- Effective Resume Writing
- HR Interview Questions
- Computer Glossary
- Who is Who
How to subset rows of an R data frame based on duplicate values in a particular column?
Duplication is also a problem that we face during data analysis. We can find the rows with duplicated values in a particular column of an R data frame by using duplicated function inside the subset function. This will return only the duplicate rows based on the column we choose that means the first unique value will not be in the output.
Example
Consider the below data frame: x1<-1:20 x2<-rpois(20,4) df1<-data.frame(x1,x2) df1
Output
x1 x2 1 1 7 2 2 6 3 3 2 4 4 6 5 5 1 6 6 7 7 7 5 8 8 2 9 9 2 10 10 2 11 11 3 12 12 2 13 13 1 14 14 3 15 15 3 16 16 3 17 17 5 18 18 5 19 19 7 20 20 3
Create rows of df1 based on duplicates in column x2 −
Example
subset(df1,duplicated(x2))
Output
x1 x2 4 4 6 6 6 7 8 8 2 9 9 2 10 10 2 12 12 2 13 13 1 14 14 3 15 15 3 16 16 3 17 17 5 18 18 5 19 19 7 20 20 3
Example
y1<-LETTERS[1:20] y2<-sample(0:5,20,replace=TRUE) df2<-data.frame(y1,y2) df2
Output
y1 y2 1 A 5 2 B 4 3 C 1 4 D 2 5 E 3 6 F 4 7 G 1 8 H 4 9 I 3 10 J 1 11 K 5 12 L 5 13 M 0 14 N 3 15 O 5 16 P 0 17 Q 1 18 R 4 19 S 2 20 T 3
Create rows of df2 based on duplicates in column y2 −
Example
subset(df2,duplicated(y2))
Output
y1 y2 6 F 4 7 G 1 8 H 4 9 I 3 10 J 1 11 K 5 12 L 5 14 N 3 15 O 5 16 P 0 17 Q 1 18 R 4 19 S 2 20 T 3
- Related Articles
- How to subset non-duplicate values from an R data frame column?
- How to remove duplicate rows and sort based on a numerical column an R data frame?
- How to remove rows containing missing value based on a particular column in an R data frame?
- How to select rows based on range of values of a column in an R data frame?
- How to remove rows from data frame in R based on grouping value of a particular column?
- How to subset a matrix based on values in a particular column in R?
- How to subset an R data frame based on numerical and categorical column?
- How to remove rows from an R data frame based on frequency of values in grouping column?
- How to subset a data frame based on a vector values in R?
- How to extract columns based on particular column values of an R data frame that match\na pattern?
- How to change row values based on column values in an R data frame?
- How to extract a particular value based on index from an R data frame column?
- How to subset an R data frame based on small letters?
- How to subset rows based on criterion of multiple numerical columns in R data frame?
- How to subset row values based on columns name in R data frame?

Advertisements