Python Data Science basics with Numpy, Pandas and Matplotlib
Created by Abhilash Nelson, Last Updated 23-Oct-2019, Language:English
Python Data Science basics with Numpy, Pandas and Matplotlib
Covers all Essential Python topics and Libraries for Data Science or Machine Learning Beginner.
Created by Abhilash Nelson, Last Updated 23-Oct-2019, Language:English
What Will I Get ?
- Essential Python data types and data structure basics with Libraries like NumPy and Pandas for Data Science or Machine Learning Beginner.
- Data science enthusiasts who want to begin their career
Requirements
- A decent configuration computer and the willingness to lay the corner stone for your big data journey.
Description
Welcome to my new course Python Essentials with Pandas and Numpy for Data Science
In this course, we will learn the basics of Python Data Structures and the most important Data Science libraries like NumPy and Pandas with step by step examples!
The first session will be a theory session in which, we will have an introduction to python, its applications and the libraries.
In the next session, we will proceed with installing python in your computer. We will install and configure anaconda which is a platform you can use for quick and easy installation of python and its libraries. We will get ourselves familiar with Jupiter notebook, which is the IDE that we are using throughout this course for python coding.
Then we will go ahead with the basic python data types like strings, numbers and its operations. We will deal with different types of ways to assign and access strings, string slicing, replacement, concatenation, formatting and f strings.
Dealing with numbers, we will discuss the assignment, accessing and different operations with integers and floats. The operations include basic ones and also advanced ones like exponents. Also we will check the order of operations, increments and decrements, rounding values and type casting.
Then we will proceed with basic data structures in python like Lists tuples and set. For lists, we will try different assignment, access and slicing options. Along with popular list methods, we will also see list extension, removal, reversing, sorting, min and max, existence check , list looping, slicing, and also inter-conversion of list and strings.
For Tuples also we will do the assignment and access options and the proceed with different options with set in python.
After that, we will deal with python dictionaries. Different assignment and access methods. Value update and delete methods and also looping through the values in the dictionary.
And after learning all of these basic data types and data structures, its time for us to proceed with the popular libraries for data-science in python. We will start with the NumPy library. We will check different ways to create a new NumPy array, reshaping , transforming list to arrays, zero arrays and one arrays, different array operations, array indexing, slicing, copying. we will also deal with creating and reshaping multi dimensional NumPy arrays, array transpose, and statistical operations like mean variance etc using NumPy
Later we will go ahead with the next popular python library called Pandas. At first we will deal with the one dimensional labelled array in pandas called as the series. We will create assign and access the series using different methods.
Then will go ahead with the Pandas Data frames, which is a 2-dimensional labelled data structure with columns of potentially different types. We will convert NumPy arrays and also pandas series to data frames. We will try column wise and row wise access options, dropping rows and columns, getting the summary of data frames with methods like min, max etc. Also we will convert a python dictionary into a pandas data frame. In large datasets, its common to have empty or missing data. We will see how we can manage missing data within dataframes. We will see sorting and indexing operations for data frames.
Most times, external data will be coming in either a CSV file or a JSON file. We will check how we can import CSV and JSON file data as a dataframe so that we can do the operations and later convert this data frame to either CSV and json objects and write it into the respective files.
Also we will see how we can concatenate, join and merge two pandas data frames. Then we will deal with data stacking and pivoting using the data frame and also to deal with duplicate values within the data-frame and to remove them selectively.
We can group data within a data-frame using group by methods for pandas data frame. We will check the steps we need to follow for grouping. Similarly we can do aggregation of data in the data-frame using different methods available and also using custom functions. We will also see other grouping techniques like Binning and bucketing based on data in the data-frame
At times we may need to use custom indexing for our dataframe. We will see methods to re-index rows and columns of a dataframe and also rename column indexes and rows. We will also check methods to do collective replacement of values in a dataframe and also to find the count of all or unique values in a dataframe.
Then we will proceed with implementing random permutation using both the NumPy and Pandas library and the steps to follow. Since an excelsheet and a dataframe are similar 2d arrays, we will see how we can load values in a dataframe from an excelsheet by parsing it. Then we will do condition based selection of values in a dataframe, also by using lambda functions and also finding rank based on columns.
Then we will go ahead with cross Tabulation of our dataframe using contingency tables. The steps we need to proceed with to create the cross tabulation contingency table.
After all these operations in the data we have, now its time to visualize the data. We will do exercises in which we can generate graphs and plots. We will be using another popular python library called Matplotlib to generate graphs and plots. We will do tweaking of the grpahs and plots by adjusting the plot types, its parameters, labels, titles etc.
Then we will use another visualization option called histogram which can be used to groups numbers into ranges. We will also be trying different options provided by matplotlib library for histogram
Overall this course is a perfect starter pack for your long journey ahead with big data and machine learning. You will also be getting an experience certificate after the completion of the course(only if your learning platform supports)
Course Content
-
Course Introduction and Table of Contents
1 Lectures 00:09:29-
Course Introduction and Table of Contents
Preview00:09:29
-
-
Introduction to Python, Pandas and Numpy
1 Lectures 00:06:53-
Introduction to Python, Pandas and Numpy
Preview00:06:53
-
-
System and Environment Setup
1 Lectures 00:08:25-
System and Environment Setup
Preview00:08:25
-
-
Python Strings
2 Lectures 00:19:36-
Python Strings - Part 1
Preview00:10:46 -
Python Strings - Part 2
00:08:50
-
-
Python Numbers and Operators
2 Lectures 00:13:28-
Python Numbers and Operators - Part 1
00:06:29 -
Python Numbers and Operators - Part 2
00:06:59
-
-
Python Lists
5 Lectures 00:31:03-
Python Lists - Part 1
00:05:08 -
Python Lists - Part 2
00:06:07 -
Python Lists - Part 3
00:05:28 -
Python Lists - Part 4
00:06:53 -
Python Lists - Part 5
00:07:27
-
-
Tuples in Python
1 Lectures 00:05:35-
Tuples in Python
00:05:35
-
-
Sets in Python
2 Lectures 00:08:58-
Sets in Python - Part 1
00:05:00 -
Sets in Python - Part 2
00:03:58
-
-
Python Dictionary
2 Lectures 00:13:29-
Python Dictionary - Part 1
00:06:35 -
Python Dictionary - Part 2
00:06:54
-
-
NumPy Library Introduction
3 Lectures 00:16:43-
NumPy Library Intro - Part 1
00:04:51 -
NumPy Library Intro - Part 2
00:05:25 -
NumPy Library Intro - Part 3
00:06:27
-
-
NumPy Array Operations and Indexing
2 Lectures 00:09:44-
NumPy Array Operations and Indexing - Part 1
00:04:10 -
NumPy Array Operations and Indexing - Part 2
00:05:34
-
-
NumPy Multi-Dimensional Arrayss
3 Lectures 00:18:06-
NumPy Multi-Dimensional Arrays - Part 1
00:07:15 -
NumPy Multi-Dimensional Arrays - Part 2
00:05:33 -
NumPy Multi-Dimensional Arrays - Part 3
00:05:18
-
-
Introduction to Pandas Series
1 Lectures 00:08:09-
Introduction to Pandas Series
00:08:09
-
-
Introduction to Pandas Dataframes
1 Lectures 00:07:07-
Introduction to Pandas Dataframes
00:07:07
-
-
Pandas Dataframe conversion and drop
3 Lectures 00:19:02-
Pandas Dataframe conversion and drop - Part 1
00:06:15 -
Pandas Dataframe conversion and drop - Part 2
00:05:34 -
Pandas Dataframe conversion and drop - Part 3
00:07:13
-
-
Pandas Dataframe summary and selection
3 Lectures 00:18:12-
Pandas Dataframe summary and selection - Part 1
00:05:45 -
Pandas Dataframe summary and selection - Part 2
00:05:30 -
Pandas Dataframe summary and selection - Part 3
00:06:57
-
-
Pandas Missing Data Management and Sorting
2 Lectures 00:13:18-
Pandas Missing Data Management and Sorting - Part 1
00:06:37 -
Pandas Missing Data Management and Sorting - Part 2
00:06:41
-
-
Pandas Hierarchical-Multi Indexing
1 Lectures 00:05:56-
Pandas Hierarchical-Multi Indexing
00:05:56
-
-
Pandas CSV File Read Write
2 Lectures 00:12:19-
Pandas CSV File Read Write - Part 1
00:05:28 -
Pandas CSV File Read Write - Part 2
00:06:51
-
-
Pandas JSON File Read Write
1 Lectures 00:06:41-
Pandas JSON File Read Write Operations
00:06:41
-
-
Pandas Concatenation Merging and Joining
3 Lectures 00:13:15-
Pandas Concatenation Merging and Joining - Part 1
00:04:38 -
Pandas Concatenation Merging and Joining - Part 2
00:04:15 -
Pandas Concatenation Merging and Joining - Part 3
00:04:22
-
-
Pandas Stacking and Pivoting
2 Lectures 00:11:34-
Pandas Stacking and Pivoting - Part 1
00:05:22 -
Pandas Stacking and Pivoting - Part 2
00:06:12
-
-
Pandas Duplicate Data Management
1 Lectures 00:07:19-
Pandas Duplicate Data Management
00:07:19
-
-
Pandas Mapping
1 Lectures 00:04:06-
Pandas Mapping
00:04:06
-
-
Pandas Groupby
1 Lectures 00:05:44-
Pandas Groupby
00:05:44
-
-
Pandas Aggregation
1 Lectures 00:08:33-
Pandas Aggregation
00:08:33
-
-
Pandas Binning or Bucketing
1 Lectures 00:07:35-
Pandas Binning or Bucketing
00:07:35
-
-
Pandas Re-index and Rename
2 Lectures 00:09:00-
Pandas Re-index and Rename - Part 1
00:04:04 -
Pandas Re-index and Rename - Part 2
00:04:56
-
-
Pandas Replace Values
1 Lectures 00:04:36-
Pandas Replace Values
00:04:36
-
-
Pandas Dataframe Metrics
1 Lectures 00:06:48-
Pandas Dataframe Metrics
00:06:48
-
-
Pandas Random Permutation
1 Lectures 00:08:15-
Pandas Random Permutation
00:08:15
-
-
Pandas Excel sheet Import
1 Lectures 00:07:13-
Pandas Excel sheet Import
00:07:13
-
-
Pandas Condition Selection and Lambda Function
2 Lectures 00:09:23-
Pandas Condition Selection and Lambda Function - Part 1
00:04:33 -
Pandas Condition Selection and Lambda Function - Part 2
00:04:50
-
-
Pandas Ranks Min Max
1 Lectures 00:06:02-
Pandas Ranks Min Max
00:06:02
-
-
Pandas Cross Tabulation
1 Lectures 00:06:31-
Pandas Cross Tabulation
00:06:31
-
-
Graphs and plots using Matplotlib
2 Lectures 00:08:41-
Graphs and plots using Matplotlib - Part 1
00:06:24 -
Graphs and plots using Matplotlib - Part 2
00:02:17
-
-
Matplotlib Histograms
1 Lectures 00:03:20-
Matplotlib Histograms
00:03:20
-
-
Source Code
1 Lectures-
Source Code Download Link
-

Abhilash Nelson
I am a pioneering, talented and security-oriented Android/iOS Mobile and PHP/Python Web Developer Application Developer offering more than eight years’ overall IT experience which involves designing, implementing, integrating, testing and supporting impact-full web and mobile applications.
I am a Post Graduate Masters Degree holder in Computer Science and Engineering.
My experience with PHP/Python Programming is an added advantage for server based Android and iOS Client Applications.