Find the frequency of unique values and missing values for each column in an R data frame.

R Programming Server Side Programming Programming

To find the frequency of unique values and missing values for each column in an R data frame, we can use apply function with table function and useNA argument set to always.

For Example, if we have a data frame called df then we can find the frequency of unique values and missing values for each column in df by using the below mentioned command −

apply(df,2,table,useNA="always")

Example 1

Following snippet creates a sample data frame −

x1<-sample(c(NA,1,2),20,replace=TRUE)
x2<-sample(c(NA,1,2),20,replace=TRUE)
df1<-data.frame(x1,x2)
df1

The following dataframe is created

To find the frequency of unique values and missing values for each column in df1 on the above created data frame, add the following code to the above snippet −

x1<-sample(c(NA,1,2),20,replace=TRUE)
x2<-sample(c(NA,1,2),20,replace=TRUE)
df1<-data.frame(x1,x2)
apply(df1,2,table,useNA="always")

Output

If you execute all the above given snippets as a single program, it generates the following Output −

Example 2

Following snippet creates a sample data frame −

y1<-sample(c(NA,5,10),20,replace=TRUE)
y2<-sample(c(NA,5,10,20),20,replace=TRUE)
df2<-data.frame(y1,y2)
df2

The following dataframe is created

To find the frequency of unique values and missing values for each column in df2 on the above created data frame, add the following code to the above snippet −

y1<-sample(c(NA,5,10),20,replace=TRUE)
y2<-sample(c(NA,5,10,20),20,replace=TRUE)
df2<-data.frame(y1,y2)
apply(df2,2,table,useNA="always")

Output

If you execute all the above given snippets as a single program, it generates the following Output −

$y1
5 10 <NA
7  6   7
$y2
5 10 20 <NA
4  8  2   6

Example 3

Following snippet creates a sample data frame −

z1<-sample(c(NA,25,45),20,replace=TRUE)
z2<-sample(c(NA,25,45),20,replace=TRUE)
df3<-data.frame(z1,z2)
df3

The following dataframe is created

To find the frequency of unique values and missing values for each column in df3 on the above created data frame, add the following code to the above snippet −

z1<-sample(c(NA,25,45),20,replace=TRUE)
z2<-sample(c(NA,25,45),20,replace=TRUE)
df3<-data.frame(z1,z2)
apply(df3,2,table,useNA="always")

Output

If you execute all the above given snippets as a single program, it generates the following Output −

Nizamuddin Siddiqui

Updated on: 10-Nov-2021

366 Views

Kickstart Your Career

Get certified by completing the course

Get Started