Programming Articles - Page 1410 of 3366

Write a Python program to quantify the shape of a distribution in a dataframe

Python Pandas Server Side Programming Programming

Updated on 25-Feb-2021 05:44:50

315 Views

Assume, you have a dataframe and the result for quantify shape of a distribution is, kurtosis is: Column1 -1.526243 Column2 1.948382 dtype: float64 asymmetry distribution - skewness is: Column1 -0.280389 Column2 1.309355 dtype: float64SolutionTo solve this, we will follow the steps given below −Define a dataframeApply df.kurt(axis=0) to calculate the shape of distribution, df.kurt(axis=0)Apply df.skew(axis=0) to calculate unbiased skew over axis-0 to find asymmetry distribution, df.skew(axis=0)ExampleLet’s see the following code to get a better understanding −import pandas as pd data = {"Column1":[12, 34, 56, 78, 90], "Column2":[23, 30, 45, ... Read More

Write a Python program to find the mean absolute deviation of rows and columns in a dataframe

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:42:20

523 Views

SolutionAssume you have a dataframe and mean absolute deviation of rows and column is, mad of columns: Column1 0.938776 Column2 0.600000 dtype: float64 mad of rows: 0 0.500 1 0.900 2 0.650 3 0.900 4 0.750 5 0.575 6 1.325 dtype: float64To solve this, we will follow the steps given below −Define a dataframeCalculate mean absolute deviation of row as, df.mad()Calculate mean absolute deviation of row as, df.mad(axis=1)ExampleLet’s see the following code to get a better understanding −import pandas as pd data = {"Column1":[6, 5.3, 5.9, 7.8, 7.6, 7.45, 7.75], ... Read More

Write a Python program to find the average of first row in a Panel

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:37:27

292 Views

Assume, you have Panel and the average of the first row is, Average of first row is: Column1 0.274124 dtype: float64SolutionTo solve this, we will follow the steps given below −Set data value as dictionary key is ‘Column1’ with value as pd.DataFrame(np.random.randn(5, 3))data = {'Column1' : pd.DataFrame(np.random.randn(5, 3))}Assign data to Panel and save it as pp = pd.Panel(data)Print the column using dict key Column1print(p['Column1'])Calculate theAverage of first row using, major_xs(0) ,p.major_xs(0).mean()ExampleLet’s see the following code to get a better understanding −import pandas as pd import numpy as np data = {'Column1' : pd.DataFrame(np.random.randn(5, 3))} p = pd.Panel(data) print("Panel values:") ... Read More

Write a program in Python to find the minimum rank of a particular column in a dataframe

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:33:26

360 Views

SolutionAssume, you have a dataframe and minimum rank of a particular column, Id Name Age Rank 0 1 Adam 12 1.0 1 2 David 13 3.0 2 3 Michael 14 5.0 3 4 Peter 12 1.0 4 5 William 13 3.0To solve this, we will follow the steps given below −Define a dataframe.Assign df[‘Age’] column inside rank function to calculate the minimum rank for axis 0 is, df["Age"].rank(axis=0, method ='min', ascending=True)ExampleLet’s see the following code to get a better understanding −import pandas as pd data = {'Id': [1, 2, 3, ... Read More

Write a program in Python to create a panel from a dictionary of dataframe and print the maximum value of the first column

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:32:24

205 Views

The result for a maximum value of the first column in panel ismaximum value of first column is ; Column1 1.377292SolutionTo solve this, we will follow the below approach −Set data value as dictionary key is ‘Column1’ with value as pd.DataFrame(np.random.randn(5, 3))data = {'Column1' : pd.DataFrame(np.random.randn(5, 3))}Assign data to Panel and save it as pp = pd.Panel(data)Print the column using dict key Column1print(p['Column1'])Calculate the maximum value of first column using, minor_xs(0) ,p.minor_xs(0).max()ExampleLet’s see the following code to get a better understanding −import pandas as pd import numpy as np data = {'Column1' : pd.DataFrame(np.random.randn(5, 3))} p = pd.Panel(data) print("Panel ... Read More

Write a program in Python to shift a dataframe index by two periods in positive and negative direction

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:29:56

173 Views

Assume, you have a dataframe and the shift index by two periods in positive and negative direction is, shift the index by three periods in positive direction Id Age 2020-01-01 00:00:00 NaN NaN 2020-01-01 12:00:00 NaN NaN 2020-01-02 00:00:00 1.0 10.0 2020-01-02 12:00:00 2.0 12.0 2020-01-03 00:00:00 3.0 14.0 shift the index by three periods in negative direction Id Age 2020-01-01 00:00:00 3.0 14.0 2020-01-01 12:00:00 4.0 11.0 2020-01-02 00:00:00 5.0 13.0 2020-01-02 12:00:00 NaN NaN 2020-01-03 00:00:00 NaN NaNSolutionTo ... Read More

Write a program in Python to remove first duplicate rows in a given dataframe

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:28:07

306 Views

Assume, you have a dataframe and the result for removing first duplicate rows are, Id Age 0 1 12 3 4 13 4 5 14 5 6 12 6 2 13 7 7 16 8 3 14 9 9 15 10 10 14SolutionTo solve this, we will follow the steps given below −Define a dataframeApply drop_duplicates function inside Id and Age column then assign keep initial value as ‘last’.df.drop_duplicates(subset=['Id', 'Age'], keep='last')Store the result inside same dataframe and print itExampleLet’s see the below implementation to get a better understanding −import pandas ... Read More

Write a program in Python to compute grouped data covariance and calculate grouped data covariance between two columns in a given dataframe

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:26:51

390 Views

Assume, you have a dataframe and the result for calculating covariance from grouped data and corresponding column as, Grouped data covariance is: mark1 mark2 subjects maths mark1 25.0 12.500000 mark2 12.5 108.333333 science mark1 28.0 50.000000 mark2 50.0 233.333333 Grouped data covariance between two columns: subjects maths 12.5 science 50.0 dtype: float64SolutionTo solve this, we will follow the steps given below −Define a dataframeApply groupby function inside dataframe subjects ... Read More

Write a Python program to reshape a given dataframe in different ways

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 25-Feb-2021 05:24:37

472 Views

We can reshape a dataframe using melt(), stack(), unstack() and pivot() function.Solution 1Define a dataframe.Apply melt() function to convert wide dataframe column as rows. It is defined below, df.melt()ExampleLet’s see the below code to get a better understanding −import pandas as pd df = pd.DataFrame({'Id':[1, 2, 3], 'Age':[13, 14, 13], 'Mark':[80, 90, 85]}) print("Dataframe is:", df) print(df.melt())OutputDataframe is: Id Age Mark 0 1 13 80 1 2 14 90 2 3 13 85 variable value 0 Id 1 1 Id 2 2 Id 3 3 Age 13 4 ... Read More

Write a program to truncate a dataframe time series data based on index value

Python Pandas Server Side Programming Programming

Vani Nalliappan

Updated on 24-Feb-2021 10:35:10

338 Views

Assume you have a dataframe with time series data and the result for truncated data is, before truncate: Id time_series 0 1 2020-01-05 1 2 2020-01-12 2 3 2020-01-19 3 4 2020-01-26 4 5 2020-02-02 5 6 2020-02-09 6 7 2020-02-16 7 8 2020-02-23 8 9 2020-03-01 9 10 2020-03-08 after truncate: Id time_series 1 2 2020-01-12SolutionTo solve this, we will follow the steps given below −Define a dataframe.Create date_range function inside start=’01/01/2020’, periods = 10 and assign freq = ‘W’. It will generate ten dates from given start date to next weekly start dates and store it as df[‘time_series’].df['time_series'] ... Read More