Nizamuddin Siddiqui has Published 2303 Articles

How to split a big data frame into smaller ones in R?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 15:38:43

1K+ Views

Dealing with big data frames is not an easy task therefore we might want to split that into some smaller data frames. These smaller data frames can be extracted from the big one based on some criteria such as for levels of a factor variable or with some other conditions. ... Read More

How to create a polynomial model in R?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 15:25:29

312 Views

Most of the times the explanatory variables are not linearly related to the response variable and we need to find the best model for our data. In this type of situations, we move on to polynomial models to check whether they will be helpful in determining the accuracy of the ... Read More

How to add a column between columns or after last column in an R data frame?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 15:17:08

249 Views

Since no one is perfect, people might forget to add all columns that are necessary for the analysis but this problem can be solved. If a column is missing in our data frame and we came to know about it later then it can be added easily with the help ... Read More

How to delete a row from an R data frame?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 15:06:44

568 Views

While doing the analysis, we might come across with data that is not required and we want to delete it. This data can be a whole row or multiple rows. For example, if a row contains values greater than, less than or equal to a certain threshold then it might ... Read More

How to replace missing values recorded with blank spaces in R with NA or any other value?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 14:49:40

1K+ Views

Sometimes when we read data in R, the missing values are recorded as blank spaces and it is difficult to replace them with any value. The reason behind this is we need to know how many spaces we have used in place of missing values. If we know that then ... Read More

How to find the correlation matrix in R using all variables of a data frame?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 14:42:15

825 Views

Correlation matrix helps us to determine the direction and strength of linear relationship among multiple variables at a time. Therefore, it becomes easy to decide which variables should be used in the linear model and which ones could be dropped. We can find the correlation matrix by simply using cor ... Read More

How to change the order of columns in an R data frame?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 14:32:20

757 Views

Ordering columns might be required when we want to manipulate the data. Manipulation can have several reasons such as cross verification, visualisation, etc. We should also be careful when we change anything in the original data because that might affect our processing. To change the order of columns we can ... Read More

How to create bar chart using ggplot2 with chart sub-title in R?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 14:21:44

207 Views

There are different ways to express any chart. The more information we can provide in a chart, the better it is because a picture says thousand words. Since nobody likes to read a long-reports, we should have better reporting of charts. Therefore, we can add a chart title as well ... Read More

How to create a data frame in R with repeated rows by a sequence of number of times or by a fixed number of times?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 14:18:16

1K+ Views

There are times when duplicated rows in a data frame are required, mainly they are used to extend the data size instead of collecting the raw data. This saves our time but surely it will have some biasedness, which is not recommended. Even though it is not recommended but sometimes ... Read More

How to create a data frame of the maximum value for each group in an R data frame using dplyr?

Nizamuddin Siddiqui

Nizamuddin Siddiqui

Updated on 10-Aug-2020 14:06:37

493 Views

Sometimes subsetting of group wise maximum values is required while doing the data analysis and this subset of the data frame is used for comparative analysis. The main objective is to compare these maximums with each other or with a threshold value. In R, we can find the group wise ... Read More

Advertisements