Subset Data Table Object Using a Range of Values in R

Nizamuddin Siddiqui
Updated on 06-Mar-2021 12:54:18

3K+ Views

To subset a data.table object using a range of values, we can use single square brackets and choose the range using %between%. For example, if we have a data.table object DT that contains a column x and the values in x ranges from 1 to 10 then we can subset DT for values between 3 to 8 by using the command DT[DT$x %between% c(3,8)].Example1Loading data.table package and creating a data.table object −library(data.table) x1

Convert a List to JSON in R

Nizamuddin Siddiqui
Updated on 06-Mar-2021 12:23:45

3K+ Views

To convert a list to JSON, we can use toJSON function of jsonlite package. For example, if we have a list called LIST then it can be converted to a JSON by using the command toJSON(LIST,pretty=TRUE,auto_unbox=TRUE). We need to make sure that the package jsonlite is loaded in R environment otherwise the command won’t work.Example Live DemoList

Randomly Sample Rows from an R Data Frame using sample()

Nizamuddin Siddiqui
Updated on 06-Mar-2021 12:23:14

530 Views

To randomly sample rows from an R data frame using sample_n, we can directly pass the sample size inside sample_n function of dplyr package. For example, if we have data frame called df then to create a random sample of 5 rows in df can be done by using the command −df%>%sample_n(5)Example1Consider the below data frame − Live Demox1

Add Variable Description in R

Nizamuddin Siddiqui
Updated on 06-Mar-2021 12:16:47

2K+ Views

To add a variable description in R, we can use comment function and if we want to have a look at the description then structure call of the data frame will be used. For example, if we have a data frame say df that contains a column x then we can describe x by using the command comment(df$x)

Format All Decimal Places in R Vector and Data Frame

Nizamuddin Siddiqui
Updated on 06-Mar-2021 12:16:25

4K+ Views

To format all decimal places in an R vector and data frame, we can use formattable function of formattable package where we can specify the number of digits after decimal places. For example, if we have a numerical vector say x then the values in x can be formatted to have only 2 decimal places by using the command formattable(x,format="f",digits=2).Example1Loading formattable package −library(formattable) Live Demox1

Create Multiple Bar Plots with Same Width Bars using ggplot2 in R

Nizamuddin Siddiqui
Updated on 06-Mar-2021 12:14:11

1K+ Views

To create multiple bar plots for varying categories with same width bars using ggplot2, we would need to play with width argument inside geom_bar function to match the width of the bars in each bar plot. The best way to do this would be setting the larger ones to 0.25 and the shorter ones to 0.50.ExampleConsider the below data frame − Live Demox1

Find High Leverage Values for a Regression Model in R

Nizamuddin Siddiqui
Updated on 06-Mar-2021 12:08:26

2K+ Views

To find the high leverage values for a regression model, we first need to find the predicted values or hat values that can be found by using hatvalues function and then define the condition for high leverage and extract them. For example if we have a regression model say M then the hat values can be found by using the command hatvalues(M), now to find the high leverage values that are greater than 0.05 can be found by using the below code −which(hatvalues(M)>0.05)Example1Consider the below data frame − Live Demox1

Apply Multiple AND Conditions to a Data Frame in R

Nizamuddin Siddiqui
Updated on 06-Mar-2021 12:04:04

325 Views

To apply multiple conditions to a data frame, we can use double and sign that is &&. For example, if we have a data frame called df that contains three columns say x, y, z and we want to add a value to all columns if first element in z equals to 5 then it can be done by using the command −if(df$x && df$y && df$y == 5){    df$x = df$x+10    df$y = df$y+10    df$z = df$z+10 }Example1Consider the below data frame − Live Demox1

Create Empty Bar Plot in Base R

Nizamuddin Siddiqui
Updated on 06-Mar-2021 11:59:47

397 Views

To create a bar plot in base R, we can use the function barplot and pass the vector or column of the data frame for which we want to create the bar plot but the bars created by using barplot by default has grey color. Therefore, if we want to create an empty bar plot then setting the color of bars to NA will make the plot an empty bar plot.Example1x

Check If a Time Series is Stationary in R

Nizamuddin Siddiqui
Updated on 06-Mar-2021 11:57:19

6K+ Views

To check if a time series is stationary, we can use Dickey-Fuller test using adf.test function of tseries package. For example, if we have a time series object say TimeData then to check whether this time series is stationary or not we can use the command adf.test(TimeData).Example1 Live Demox1

Advertisements