Perform One-Way ANOVA with Unequal Sample Sizes in R

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:59:43

1K+ Views

To perform the one-way anova with sample sizes having different sizes we can use aov function. Suppose we have a categorical column defined as Group with four categories and a continuous variable Response both stored in a data frame called df then the one-way anova can be performed as −aov(Response~Group,data=df)ExampleConsider the below data frame − Live DemoGroup

Generate Standard Normal Random Numbers in R

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:54:52

4K+ Views

A standard normal distribution is the type of distribution that has mean equals to zero with standard deviation 1. If we want to generate standard normal random numbers then rnorm function of R can be used but need to pass the mean = 0 and standard deviation = 1 inside this function.Example Live Demornorm(10, 0, 1)Output[1] 0.6936607 -0.7967657 -2.7544428 0.2688767 0.5278463 -1.5387568 [7] 1.1716632 -1.5033895 0.8112929 -1.0101065Example Live Demornorm(50, 0, 1)Output[1] 2.58246666 -0.53083341 -0.57343343 1.08172756 1.30341849 -0.07440422 [7] -0.41869305 -0.96227706 -0.46899119 1.55428279 0.09162738 -0.96027221 [13] -0.84735327 -1.74949782 0.58541758 0.23117630 0.47402479 -0.72453853 [19] 0.07171564 1.13088794 0.18735157 0.25091758 -1.34728315 -0.39768159 [25] -0.38109955 -0.34019286 -1.51778561 ... Read More

Remove List Elements by Their Name in R

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:53:06

2K+ Views

Data analysis not only includes mathematical operations, we have to spend a lot of time in data cleaning, data mining, feature engineering etc. One operation could be removing list elements by their name if they are not helpful to achieve our objective. The removal of the elements by using their names can be done with the help of which function.ExampleConsider the below list  Live DemoList1

Create Line Chart Using ggplot2 with Larger Width in R

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:50:43

145 Views

The width of the line chart can be increased by using size argument inside geom_line aesthetics of ggplot2. For example, if we have a data frame df that contains two numerical columns x and y, and we want to create a line chart between the two with larger width then it can be done as −ggplot(df)+geom_line(aes(x,y,size=2))ExampleConsider the below data frame − Live Demox

Save List with Equal Number of Values to Text File in R

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:49:00

228 Views

If we want to save a list to a text file then first step would be converting that list to a data frame then write.table function can be used for saving. For example, if we have a list defined as LIST and it has elements each containing 50 values then we can convert it to a data frame as −LIST_df=as.data.frame(do.call(cbind,LIST))Now we can save it as −write.table(LIST_df,"LIST.txt")ExampleConsider the below list − Live Demox1

Set Comma as Decimal Separator in R

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:45:57

6K+ Views

In European countries, a comma is used to separate the integral part of a number from the decimal part. Thus, we might want to create data or perform calculations with comma as decimal separator. In R, we can do this by just using the code options(OutDec=", "). Once we will type this in R console, all the numerical values with decimals will be printed with commas in place of dots.Example Live Demooptions(OutDec=", ") rnorm(10)Output[1] 0, 14421957 -0, 24152088 -0, 05215867 -0, 40577010 0, 19806357 -1, 49349808 [7] 0, 91085263 0, 43550033 2, 64009603 1, 17177332Example Live Demornorm(50)Output[1] -0, 56186368 -2, 11404777 0, ... Read More

Generate Probability Density Distribution from Observations in R

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:44:08

3K+ Views

The probability density distribution is the synonym of probability density function. It is a function that defines the density of a continuous random variable. In R, we can use density function to create a probability density distribution from a set of observations.Example Live Demox1

Divide Row Values of Numerical Column by Categorical Column in R Data Frame

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:38:51

428 Views

If we have a categorical column that has two or more categories and a numerical column then we might want to divide the one category numerical value from other category numerical value. This can be done by using divide sign / but we need to use the proper subset of the values.ExampleConsider the below data frame − Live Demox1

Find Correlation Matrix of Groups for Data Table Object in R

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:36:27

373 Views

To find the correlation of groups, we can use cor function but it cannot be directly used.For this purpose, we first need to set they key for group column of data table object. For example, if we have a data.table DT with one numerical column defined as x and one group column defined as Group having 4 groups as a, b, c, and d then the correlation of numerical values for groups a and b can be found as −setkey(DT, Group) cor(DT["a"]$x, DT["b"]$x)Loading data.table package −library(data.table)ExampleConsider the below data.table object −xRead More

Replace Upper Triangular Matrix with Lower Triangular Matrix in R

Nizamuddin Siddiqui
Updated on 07-Dec-2020 05:35:21

922 Views

The upper triangular matrix can be replaced with lower triangular matrix by transposing the whole matrix and extracting upper triangular matrix from it then storing it in the original matrix. For example, if we have a matrix M then upper triangular matrix of M can be replaced with lower triangular matrix by using the below code −M1[upper.tri(M1)]

Advertisements