Tushar Sharma

Tushar Sharma

61 Articles Published

Articles by Tushar Sharma

61 articles

How to take integer input in Python?

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 2K+ Views

Taking integer input is a fundamental task in Python programming. The input() function returns strings by default, so we need to convert them to integers using int(). This article explores four common approaches to handle integer input effectively. Using input() and int() Conversion The input() function captures user input as a string. To work with integers, we convert the string using int() function ? Single Integer Input # Taking single integer input num = int(input("Enter an integer: ")) print("You entered:", num) print("Type:", type(num)) Enter an integer: 42 You entered: 42 Type: ...

Read More

How to suppress the use of scientific notations for small numbers using NumPy?

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 424 Views

When working with NumPy arrays, small numbers are often displayed in scientific notation (like 1e-10). While compact, this format can be difficult to read and compare. This guide covers four methods to suppress scientific notation for small numbers in NumPy arrays: using numpy.vectorize with string formatting, numpy.ndarray.round, string formatting with list comprehension, and numpy.set_printoptions. Method 1: Using numpy.vectorize with String Formatting The numpy.vectorize function combined with string formatting can suppress scientific notation by converting array elements to formatted strings ? Syntax formatted_array = numpy.vectorize('{:.Nf}'.format)(array) Here, N denotes the decimal places to retain, and ...

Read More

How to Subtract Two Columns in Pandas DataFrame?

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 5K+ Views

When working with Pandas DataFrames, you often need to perform arithmetic operations between columns. One common operation is subtracting two columns. This guide explores four different methods to subtract columns in a Pandas DataFrame, from simple arithmetic operators to specialized Pandas functions. Method 1: Using Simple Arithmetic Operator (-) The most straightforward approach is using the standard subtraction operator. This method is intuitive and commonly used for basic column operations. Syntax result = dataframe['column1'] - dataframe['column2'] Example Let's create a DataFrame with sales and costs data and calculate the profit by subtracting ...

Read More

How to store XML data into a MySQL database using Python?

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 1K+ Views

XML (eXtensible Markup Language) is a widely used format for storing structured data. When working with large XML datasets, storing this data in a MySQL database provides better performance and query capabilities. Python offers excellent libraries to parse XML and interact with MySQL databases. Prerequisites Before starting, ensure you have the required libraries installed ? pip install mysql-connector-python Step 1: Import Required Libraries We need two essential libraries for this task ? xml.etree.ElementTree: For parsing and manipulating XML documents mysql.connector: For connecting Python to MySQL ...

Read More

How to Standardize Data in a Pandas DataFrame?

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 4K+ Views

Data standardization, also known as feature scaling, is a crucial preprocessing step that transforms data to have a mean of 0 and standard deviation of 1. This ensures all features contribute equally to machine learning algorithms. Pandas provides several methods to standardize DataFrame columns efficiently. What is Data Standardization? Standardization transforms data using the formula: z = (x - μ) / σ, where μ is the mean and σ is the standard deviation. This creates a standard normal distribution with consistent scale across all features. Method 1: Using StandardScaler from sklearn The most common approach uses ...

Read More

How to Stack Multiple Pandas DataFrames?

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 461 Views

Pandas provides several methods to stack multiple DataFrames vertically or horizontally. When working with multiple datasets that need to be combined for analysis, functions like concat(), append() (deprecated), and numpy.vstack() offer different approaches for DataFrame stacking. This article explores the most effective methods for combining multiple DataFrames, focusing on practical examples and their use cases. Syntax pd.concat() pd.concat(objs, axis=0, join='outer', keys=None, ignore_index=False) Concatenates DataFrames along a specified axis with options for join types and index handling. numpy.vstack() numpy.vstack(tup) Stacks arrays vertically (row-wise), provided they have the same number ...

Read More

How to split the Dataset With scikit-learnís train_test_split() Function

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 462 Views

Machine learning models require proper data splitting to evaluate performance accurately. Scikit-learn's train_test_split() function provides a simple way to divide your dataset into training and testing portions, ensuring your model can be validated on unseen data. Syntax from sklearn.model_selection import train_test_split X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42) Parameters X, y: Feature matrix and target vector respectively test_size: Proportion of data for testing (typically 0.2 or 20%) random_state: Seed for reproducible random splitting stratify: Maintains class proportions in splits ...

Read More

How to Split Data into Training and Testing in Python without Sklearn

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 1K+ Views

Splitting data into training and testing sets is a fundamental step in machine learning. While scikit-learn's train_test_split() is commonly used, understanding how to split data manually helps you grasp the underlying concepts and provides flexibility when external libraries aren't available. Why Split Data? Machine learning models learn patterns from training data. To evaluate how well they generalize to new, unseen data, we need a separate testing set. Using the same data for both training and testing leads to overfitting — the model memorizes the training data but fails on new data. The typical split ratios are 80-20 ...

Read More

How to Split a File into a List in Python?

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 481 Views

Python provides multiple ways to read a file and split its contents into a list. This is a common operation in data processing, text analysis, and file manipulation tasks. Understanding Files and Lists in Python In Python, a file is a named location on disk where data is stored permanently. Python can work with various file formats including text files (.txt), CSV files (.csv), and others. A list is a built-in data structure that can store multiple items in a single variable. Lists are mutable, meaning you can modify them after creation. Method 1: Using read() ...

Read More

How to Speedup Pandas with One-Line change using Modin?

Tushar Sharma
Tushar Sharma
Updated on 27-Mar-2026 235 Views

Data is considered the new oil in this information era. Python, with its extensive libraries, is one of the leading programming languages for data analysis, and Pandas is its crown jewel. However, as datasets have grown larger, Pandas users have found their workflows hampered by slow execution on large datasets. Fortunately, there's a way to vastly improve Pandas performance using a single line of code with Modin. What is Modin? Pandas excels in delivering high-performance, user-friendly data structures and tools for data analysis. However, it has one significant limitation — it was built to leverage single-core processing, which ...

Read More
Showing 1–10 of 61 articles
« Prev 1 2 3 4 5 7 Next »
Advertisements