Assume, you have two dataframe, first dataframe is id country 0 1 India 1 2 UK 2 3 US 3 4 China second dataframe is id City 0 1 Chennai 1 11 Cambridge 2 22 Chicago 3 4 ChengduAnd the result for merging based on same column is, Merging data based on same column - id id country City 0 1 India Chennai 1 4 China ChengduSolutionTo solve this, we will follow the steps given below −Define a two dataframesMerge two dataframes based on the same column id is defined below, pd.merge(first_df, ... Read More
Assume, you have a dataframe, col1 col2 0 o e 1 e e 2 i u 3 e o 4 i i 5 u o 6 e a 7 u o 8 a u 9 e aThe result for matched index and count is, index is col1 col2 1 e e 4 i i count is 2SolutionTo solve this, we will follow the steps given below −Define a dataframeCompare first and second matching index values using the below method, df.iloc[np.where(df.col1==df.col2)])Find the total count of matched columns using the ... Read More
Assume, you have a dataframe0 1 20 10 20 30 1 40 50 60 2 70 80 90The result for replaced 1 by diagonal of a dataframe is −0 1 2 0 1 20 30 1 40 1 60 2 70 80 1SolutionTo solve this, we will follow the steps given below −Define a dataframeCreate nested for loop to access all rows and columns, for i in range(len(df)): for j in range(len(df)):Check if the condition to match the diagonals, if it is matched then replace the position by 1. It is defined below, if i == j: df.iloc[i ... Read More
Assume you have a dataframe, one two three 0 12 13 5 1 10 6 4 2 16 18 20 3 11 15 58The result for storing the minimum value in new row and column is −Add new column to store min value one two three min_value 0 12 13 5 5 1 10 6 4 4 2 16 18 20 16 3 11 15 58 11 Add new row to store min value one two three min_value 0 ... Read More
Assume you have a sqlite3 database with student records and the result for reading all the data is, Id Name 0 1 stud1 1 2 stud2 2 3 stud3 3 4 stud4 4 5 stud5SolutionTo solve this, we will follow the steps given below −Define a new connection. It is shown below, con = sqlite3.connect("db.sqlite3")Read sql data from the database using below function, pd.read_sql_query()Select all student data from table using read_sql_query with connection, pd.read_sql_query("SELECT * FROM student", con)ExampleLet us see the complete implementation to get a better understanding −import pandas as pd import sqlite3 con = sqlite3.connect("db.sqlite3") df = ... Read More
Assume you have a series and the result for Boolean operations, And operation is: 0 True 1 True 2 False dtype: bool Or operation is: 0 True 1 True 2 True dtype: bool Xor operation is: 0 False 1 False 2 True dtype: boolSolutionTo solve this, we will follow the below approach.Define a SeriesCreate a series with boolean and nan valuesPerform boolean True against bitwise & operation to each element in the series defined below, series_and = pd.Series([True, np.nan, False], dtype="bool") & TruePerform boolean True against bitwise | operation ... Read More
Input −Assume you have a DataFrame, and the result for transpose of index and columns are, Transposed DataFrame is 0 1 0 1 4 1 2 5 2 3 6Solution 1Define a DataFrameSet nested list comprehension to iterate each element in the two-dimensional list data and store it in result.result = [[data[i][j] for i in range(len(data))] for j in range(len(data[0]))Convert the result to DataFrame, df2 = pd.DataFrame(result)ExampleLet us see the complete implementation to get a better understanding −import pandas as pd data = [[1, 2, 3], [4, 5, 6]] df = pd.DataFrame(data) print("Original DataFrame is", df) result = [[data[i][j] ... Read More
Input −Assume you have a series and default float quantilevalue is 3.0SolutionTo solve this, we will follow the steps given below −Define a SeriesAssign quantile default value .5 to the series and calculate the result. It is defined below,data.quantile(.5) ExampleLet us see the complete implementation to get a better understanding −import pandas as pd l = [10,20,30,40,50] data = pd.Series(l) print(data.quantile(.5))Output30.0
Input −Assume, we have a DataFrame and group the records based on the designation is −Designation architect 1 programmer 2 scientist 2SolutionTo solve this, we will follow the below approaches.Define a DataFrameApply groupby method for Designation column and calculate the count as defined below,df.groupby(['Designation']).count()ExampleLet us see the following implementation to get a better understanding.import pandas as pd data = { 'Id':[1,2,3,4,5], 'Designation': ['architect','scientist','programmer','scientist','programmer']} df = pd.DataFrame(data) print("DataFrame is",df) print("groupby based on designation:") print(df.groupby(['Designation']).count())OutputDesignation architect 1 programmer 2 scientist 2
Input −Assume, we have DataFrame with City and State columns and find the city, state name startswith ‘k’ and store into another CSV file as shown below −City, State Kochi, KeralaSolutionTo solve this, we will follow the steps given below.Define a DataFrameCheck the city starts with ‘k’ as defined below, df[df['City'].str.startswith('K') & df['State'].str.startswith('K')] Finally, store the data in the ‘CSV’ file as below, df1.to_csv(‘test.csv’)ExampleLet us see the following implementation to get a better understanding.import pandas as pd import random as r data = { 'City': ['Chennai', 'Kochi', 'Kolkata'], 'State': ['Tamilnad', 'Kerala', 'WestBengal']} df = pd.DataFrame(data) print("DataFrame is", df) df1 = ... Read More