Server Side Programming Articles - Page 1267 of 2646

How to create a subset using character column with multiple matches in R?

R Programming Server Side Programming Programming

Updated on 11-Feb-2021 12:02:55

740 Views

Subsetting is one of the most important aspects of data analysis. One such situation could be subsetting the character column based on multiple values. For example, if a character column of an R data frame has 5 categories then we might want to extract only 2 or 3 or 4 values then it can be done by using the filter function of dplyr package with str_detect function of stringr package.Consider the below data frame −Example Live DemoGroup

How to find the frequency vector elements that exists in another vector in R?

R Programming Server Side Programming Programming

Nizamuddin Siddiqui

Updated on 11-Feb-2021 11:59:10

230 Views

If a vector value exists in another vector then we might want to find the frequency/count for such values in the other vector. For example, if we have two vectors say x and y, and some of the values in y exists in x as well. Therefore, we can find the frequency of values in x for y values can be found by using the command colSums(outer(x,y,"==")).Example Live Demox1

How to plot time series data with labels in R?

R Programming Server Side Programming Programming

Nizamuddin Siddiqui

Updated on 11-Feb-2021 11:54:39

599 Views

If we have time series data stored in a data frame then plotting the same as a time series cannot be done directly, also the labels for the series might not be possible directly. Therefore, we first need to convert the data frame to a time series object by using the function ts as shown in the below example and then using the plot function to create the plot, this will display the labels for the series as well.Consider the below data frame −Example Live DemoTime

How to find the subtotal in R?

R Programming Server Side Programming Programming

Nizamuddin Siddiqui

Updated on 11-Feb-2021 11:50:16

740 Views

By subtotal we mean finding the sum of values based on grouping column. For example, if we have a data frame called df that contains three numerical columns as x, y, z and one categorical column say Group then the subtotal of x, y, z for each category in Group can be found by using the command aggregate(cbind(x,y,z)~Group,data=df,FUN=sum).Consider the below data frame −Example Live Demox1

How to create a random vector of integers with increasing values only in R?

R Programming Server Side Programming Programming

Nizamuddin Siddiqui

Updated on 11-Feb-2021 11:44:49

311 Views

To create a random vector of integers with increasing values, we can do random sampling with sample.int and for increasing values cummax function needs to be used. For example, to create a random vector of integers of size 5 up to values 5 starting from 1 can be done by using the command cummax(sample.int(5)).Example Live Demox1

How to get the list of available data frames in R environment?

R Programming Server Side Programming Programming

Nizamuddin Siddiqui

Updated on 11-Feb-2021 11:42:02

2K+ Views

When we perform any type of data analysis, there are many types of objects that are created in the R environment such as vector, data frame, matrix, lists, arrays, etc. If we want to get the list of available data frames in R environment then we can use the below command −names(which(unlist(eapply(.GlobalEnv,is.data.frame))))Example Live Demox1

How to convert numeric columns to factor using dplyr package in R?

R Programming Server Side Programming Programming

Nizamuddin Siddiqui

Updated on 11-Feb-2021 11:36:42

4K+ Views

If we have a numeric column in an R data frame and the unique number of values in the column is low that means the numerical column can be treated as a factor. Therefore, we can convert numeric columns to factor. To do this using dplyr package, we can use mutate_if function of dplyr package.Loading dplyr package and converting numerical columns in BOD data set (available in base R) to factor columns −Examplelibrary(dplyr) str(BOD) 'data.frame': 6 obs. of 2 variables: $ Time : num 1 2 3 4 5 7 $ demand: num 8.3 10.3 19 16 15.6 19.8 - ... Read More

Write a Python program to trim the minimum and maximum threshold value in a dataframe

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:46:06

485 Views

Assume, you have a dataframe and the result for trim of minimum and the maximum threshold value, minimum threshold: Column1 Column2 0 30 30 1 34 30 2 56 30 3 78 50 4 30 90 maximum threshold: Column1 Column2 0 12 23 1 34 30 2 50 25 3 50 50 4 28 50 clipped dataframe is: Column1 Column2 0 30 30 1 34 30 2 50 30 3 ... Read More

Write a Python program to quantify the shape of a distribution in a dataframe

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:44:50

358 Views

Assume, you have a dataframe and the result for quantify shape of a distribution is, kurtosis is: Column1 -1.526243 Column2 1.948382 dtype: float64 asymmetry distribution - skewness is: Column1 -0.280389 Column2 1.309355 dtype: float64SolutionTo solve this, we will follow the steps given below −Define a dataframeApply df.kurt(axis=0) to calculate the shape of distribution, df.kurt(axis=0)Apply df.skew(axis=0) to calculate unbiased skew over axis-0 to find asymmetry distribution, df.skew(axis=0)ExampleLet’s see the following code to get a better understanding −import pandas as pd data = {"Column1":[12, 34, 56, 78, 90], "Column2":[23, 30, 45, ... Read More

Write a Python program to find the mean absolute deviation of rows and columns in a dataframe

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:42:20

551 Views

SolutionAssume you have a dataframe and mean absolute deviation of rows and column is, mad of columns: Column1 0.938776 Column2 0.600000 dtype: float64 mad of rows: 0 0.500 1 0.900 2 0.650 3 0.900 4 0.750 5 0.575 6 1.325 dtype: float64To solve this, we will follow the steps given below −Define a dataframeCalculate mean absolute deviation of row as, df.mad()Calculate mean absolute deviation of row as, df.mad(axis=1)ExampleLet’s see the following code to get a better understanding −import pandas as pd data = {"Column1":[6, 5.3, 5.9, 7.8, 7.6, 7.45, 7.75], ... Read More