Found 33676 Articles for Programming

How to create gridlines that matches with Y-axis values in the plot created by using plot function in R?

Nizamuddin Siddiqui
Updated on 09-Sep-2020 08:09:05

362 Views

When we create a plot in R and draw gridlines then the gridlines are drawn on the basis of the values provided inside the grid function, therefore, it may or may not match with the Y-axis labels. But it can be done, we just need to set the values inside the grid function to NULL.ExampleConsider the below plot − Live Demox

How to create a column with binary variable based on a condition of other variable in an R data frame?

Nizamuddin Siddiqui
Updated on 09-Sep-2020 08:00:43

9K+ Views

Sometimes we need to create extra variable to add more information about the present data because it adds value. This is especially used while we do feature engineering. If we come to know about something that may affect our response then we prefer to use it as a variable in our data, hence we make up that with the data we have. For example, creating another variable applying conditions on other variable such as creating a binary variable for goodness if the frequency matches a certain criterion.ExampleConsider the below data frame − Live Demoset.seed(100) Group

How to find the raise to the power of all values in an R vector?

Nizamuddin Siddiqui
Updated on 09-Sep-2020 07:49:38

1K+ Views

Often, we need to find the power of a value or the power of all values in an R vector, especially in cases when we are dealing with polynomial models. This can be done by using ^ sign as we do in Excel. For example, if we have a vector x then the square of all values in x can be found as x^2.Example Live Demox1

How to change the position of axes titles to top for X-variable and to right for Y-variable in R?

Nizamuddin Siddiqui
Updated on 09-Sep-2020 07:46:52

269 Views

The default position of axes titles in any software or programming language for any 2D graph is bottom for X-axis and left for Y-axis but we might to change the position of these titles to top and right respectively. This can be done by using scale_x_continuous(position="top") and scale_y_continuous(position="right") functions of ggplot2 package.ExampleConsider the below data frame − Live Demoset.seed(101) x

How to find similar words in vector of strings in R?

Nizamuddin Siddiqui
Updated on 09-Sep-2020 07:43:16

1K+ Views

Sometimes strings in a vector of strings have spelling errors and we want to extract the similar words to avoid that spelling error because similar words are likely to represent the correct and incorrect form of a word. This can be done by using agrep with lapply function.Example 1 Live Demox1

How to change the scale of Y-axis if the histogram bars are higher than the axis size in R?

Nizamuddin Siddiqui
Updated on 09-Sep-2020 07:31:33

2K+ Views

When we create a histogram using hist function in R, often the Y-axis labels are smaller than the one or more bars of the histogram. Therefore, the histogram does not look appealing and it becomes a little difficult to match the Y-axis values with the bars size.To solve this problem, we can use ylim argument of hist function in which the range can be supplied to plot on the Y-axis labels.ExampleConsider the below data and its histogram − Live Demoset.seed(101) x

How to find the number of values in a column of an R data frame that are not zero?

Nizamuddin Siddiqui
Updated on 08-Sep-2020 14:57:37

284 Views

If an R data frame has numerical columns then it is also possible that there exist zeros in few or all columns and we might be interested in finding the number of non-zero values in a column. This will help us to compare the columns based on the number on non-zero values and it can be done by using colSums.ExampleConsider the below data frame − Live Demox1

How to avoid the warning “Cannot compute exact p-value with ties” while perform correlation test for Spearman’s correlation in R?

Nizamuddin Siddiqui
Updated on 08-Sep-2020 14:45:45

22K+ Views

When the variables are not continuous but could be ranked then we do not use pearson correlation coefficient to find the linear relationship, in this case spearman correlation coefficient comes into the scene. Since the spearman correlation coefficient considers the rank of values, the correlation test ignores the same ranks to find the p-values as a result we get the warning “Cannot compute exact p-value with ties”. This can be avoided by using exact = FALSE inside the cor.test function.ExampleConsider the below vectors and perform spearman correlation test to check the relationship between them − Live Demox1

How to get row index based on a value of an R data frame column?

Nizamuddin Siddiqui
Updated on 08-Sep-2020 14:43:24

5K+ Views

A row of an R data frame can have multiple ways in columns and these values can be numerical, logical, string etc. It is easy to find the values based on row numbers but finding the row numbers based on a value is different. If we want to find the row number for a particular value in a specific column then we can extract the whole row which seems to be a better way and it can be done by using single square brackets to take the subset of the row.ExampleConsider the below data frame − Live Demox1

How to display mean inside boxplot created by using boxplot function in R?

Nizamuddin Siddiqui
Updated on 08-Sep-2020 14:30:54

721 Views

A boxplot shows the median as a measure of center along with other values but we might want to compare the means as well. Therefore, showing mean with a point is likely to be preferred if we want to compare many boxplots. This can be done by using points(mean(“Vector_name”)), if we are plotting the columns of an R data frame then we will reference them instead of vector name.ExampleConsider the below data and the boxplot − Live Demox

Advertisements