Articles on Trending Technologies

Technical articles with clear explanations and examples

How to extract values from an R data frame column that do not start and end with certain characters?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 07-Sep-2020 197 Views

Sometimes we just want to extract the values of a data column based on initial and ending values of a column that has strings or sometimes the values of a column that has strings are recorded with some extra characters and we want to extract those values. For this purpose, we can use negation of grepl with single square brackets.ExampleConsider the below data frame −> x2 df2 head(df2, 20)Outputx2 1 Alabama 2 Alaska 3 American Samoa 4 Arizona 5 Arkansas 6 California 7 Colorado 8 Connecticut 9 Delaware 10 District of Columbia 11 Florida 12 Georgia 13 Guam 14 Hawaii ...

Read More

How to extract strings based on first character from a vector of strings in R?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 04-Sep-2020 441 Views

Sometimes a vector strings have patterns and sometimes we need to make patterns from a vector of strings based on the characters. For example, we might want to extract the states name of United States of America from a vector that contains all the names. This can be done by using grepl function.ExampleConsider the below vector containing states name in USA −> US_states US_states[grepl("^A", US_states)] [1] "Alabama" "Alaska" "American Samoa" "Arizona" [5] "Arkansas" > US_states[grepl("^B", US_states)] character(0) > US_states[grepl("^C", US_states)] [1] "California" "Colorado" "Connecticut" > US_states[grepl("^D", US_states)] [1] "Delaware" "District of Columbia" > US_states[grepl("^E", US_states)] character(0) > US_states[grepl("^F", US_states)] [1] ...

Read More

How to find the n number of largest values in an R vector?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 04-Sep-2020 375 Views

A vector may have thousands of values and each of them could be different or same also. It is also possible that values can be grouped or randomly selected but having few similar values. Irrespective of the values in a vector, to find some largest values we need to sort the vector in ascending order then the largest values will be selected.Examples> x1 x1 [1] -1.4447473195 3.2906645299 -0.4680055849 0.1611487482 -0.7715094280 [6] 0.4442103640 0.3702444686 0.0783124252 1.3476432299 1.0140576107 [11] -0.0968917066 0.4628821017 0.3102594626 -0.2946001275 0.1498108166 [16] -0.6002154305 0.5905382364 1.3892651534 0.1008921325 -0.6486318692 [21] -0.0562831933 -0.6887431711 0.4907512082 -0.3994662410 0.7827897030 [26] 0.5294704584 -1.3802965730 -0.6159076490 -0.0009408529 1.6182294859 ...

Read More

How to convert a decimal value or a vector of decimal values to fractional form in R?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 04-Sep-2020 4K+ Views

A fraction form of a decimal value is the form of the value represented with division sign. For example, representing 0.5 as 1 / 2. In R, we can use fractions function of MASS package to convert a decimal value or a vector of decimal values to fractional form. To do so, we just need to pass the value in fractions function as fractions(“Decimal_value or Vector_Of_Decimal_Values”).Loading MASS package −Examples> library(MASS)Output> fractions(0.14) [1] 7/50 > fractions(1.14) [1] 57/50 > library(MASS) > fractions(0.5) [1] 1/2 > fractions(0.3) [1] 3/10 > fractions(0.31) [1] 31/100 > fractions(1.31) [1] 131/100 > fractions(2.01) [1] 201/100 > ...

Read More

How to create a dot plot using ggplot2 in R?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 04-Sep-2020 567 Views

A dot plot is a type of histogram that display dots instead of bars and it is created for small data sets. In ggplot2, we have geom_dotplot function to create the dot plot but we have to pass the correct binwidth which is an argument of the geom_dotplot, so that we don’t get the warning saying “Warning: Ignoring unknown parameters: bins `stat_bindot()` using `bins = 30`. Pick better value with `binwidth`.”ExampleConsider the below data frame −> x df1 library(ggplot2)Creating the dot plot of x −> ggplot(df1, aes(x))+geom_dotplot(binwidth=0.2)OutputLet’s have a look at one more example −> y df2 ggplot(df2, aes(y))+geom_dotplot(binwidth=0.2)Output

Read More

How to delete different rows and columns of a matrix using a single line code in R?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 04-Sep-2020 1K+ Views

Deletion or addition of rows and columns in a matrix of any size is mostly done by using single square brackets and it is also the easiest way. To delete rows and columns, we just need to use the column index or row index and if we want to delete more than one of them then we can separate them by commas by inserting them inside c as c(-1, -2). If we want to delete more than one rows or columns in a sequence then a colon can be used. Examples > M M Output ...

Read More

How to extract the frequencies from a histogram in R?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 04-Sep-2020 375 Views

When we create a histogram and save it in an object name then we can extract the frequencies as count for the mid values or breaks by calling that object. We can consider that mid values or breaks obtained by the object are the actual value against which the frequencies are plotted on the histogram.Examples> x1 Histogram1 Histogram1Output$breaks [1] 0 1 2 3 4 5 6 7 8 9 10 11 12 13 $counts [1] 45 82 150 156 172 142 113 62 43 20 9 5 1 $density [1] 0.045 0.082 0.150 0.156 0.172 0.142 0.113 0.062 0.043 0.020 ...

Read More

How to add a horizontal line to the plot created by ggplot2 in R?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 04-Sep-2020 417 Views

When we create a plot, it shows the values passed by the function for creating the plot but we might want to display some other values to provide some information through the plot and that information could be a threshold value as a horizontal line or we can also call it a cut off value. This can be done by using geom_hline function of ggplot2 package.ExampleConsider the below data frame −> x y df dfOutput      x       y 1 0.27810573 2.6545571 2 1.39185082 3.4845292 3 -0.19068920 1.7043852 4 1.00791317 1.4324814 5 -1.74964913 1.7996093 6 -0.13123079 2.5004350 ...

Read More

How to split a string vector that contain strings of equal sizes and have spaces between values then extract only few values in R?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 04-Sep-2020 239 Views

A string vector can contain any value including spaces. Sometimes a string vector is read with some spaces as well and we want to split the vector then extract few values. For example, if a string has “ABC 123” then we might want to extract the number 123 so that we can use it in analysis. If the string vector has strings of equal sizes then it can be easily done with the help of substr function.Examples> x1 x1 [1] "1 00" "1 01" "1 02" "1 03" "1 03" "1 04" > Numeric_Last_two_x1 Numeric_Last_two_x1 [1] "00" "01" "02" "03" ...

Read More

How to convert unit of measurements of a value or a vector in R?

Nizamuddin Siddiqui
Nizamuddin Siddiqui
Updated on 04-Sep-2020 848 Views

There are many units of measurements for a single object or item. For example, weight can be measured in milligrams, grams, kilograms, tons, oz, lbs, etc. Now suppose we have two variables that belong to the same unit of measurement as weight of a coca cola cans and weight of apple juice, if the weights given for both of these variables have different units like one having grams and the other having oz then we might want to convert one of them. This will help us to compare both the variables easily without conflicting the scale of measurements. Therefore, we ...

Read More
Showing 51401–51410 of 61,297 articles
Advertisements