Article Categories

Selected Reading

Server Side Programming Articles

Page 106 of 2109

How to select a range of rows from a dataframe in PySpark?

PySpark Python Server Side Programming Programming

Tapas Kumar Ghosh

Updated on 27-Mar-2026 1K+ Views

A PySpark DataFrame is a distributed collection of data organized into rows and columns. Selecting a range of rows means filtering data based on specific conditions. PySpark provides several methods like filter(), where(), and collect() to achieve this. Setting Up PySpark First, install PySpark and import the required modules ? pip install pyspark from pyspark.sql import SparkSession # Create SparkSession spark = SparkSession.builder \ .appName('DataFrame_Range_Selection') \ .getOrCreate() # Sample data customer_data = [ ("PREM KUMAR", 1281, "AC", 40000, 4000), ...

How to slice a PySpark dataframe in two row-wise dataframe?

PySpark Python Server Side Programming Programming

Tapas Kumar Ghosh

Updated on 27-Mar-2026 1K+ Views

PySpark dataframes can be split into two row-wise dataframes using various built-in methods. This process, called slicing, is useful for data partitioning and parallel processing in distributed computing environments. Syntax Overview The key methods for slicing PySpark dataframes include: limit(n) − Returns first n rows subtract(df) − Returns rows not present in another dataframe collect() − Retrieves all elements as a list head(n) − Returns first n rows as Row objects exceptAll(df) − Returns rows excluding another dataframe's rows filter(condition) − Filters rows based on conditions Installation pip install pyspark ...

How to set axes labels & limits in a Seaborn plot?

Seaborn Python Server Side Programming Programming

Tapas Kumar Ghosh

Updated on 27-Mar-2026 13K+ Views

Seaborn automatically adjusts labels and axes limits to make plots more understandable, but sometimes you need custom control. Setting appropriate axes labels helps viewers understand what the plot represents, while adjusting limits lets you focus on specific data ranges. We can use matplotlib functions like xlabel(), ylabel(), xlim(), and ylim() to customize Seaborn plots. Core Functions for Axes Customization Here are the main functions used to set labels and limits: plt.xlabel() − Sets the x-axis label text plt.ylabel() − Sets the y-axis label text plt.xlim() − Sets the x-axis range limits plt.ylim() − Sets the y-axis ...

How to set the tab size in Text widget in Tkinter?

Tkinter Python Server Side Programming Programming

Tapas Kumar Ghosh

Updated on 27-Mar-2026 1K+ Views

The Python Tkinter module provides a powerful way to create graphical user interfaces (GUIs). The Text widget is particularly useful for multi-line text input, and you can customize its tab size using the tabs parameter to improve text formatting and readability. Setting Tab Size in Text Widget The tabs parameter in the Text widget accepts a tuple or list of tab stop positions measured in pixels or other units ? import tkinter as tk # Create main window root = tk.Tk() root.title("Text Widget Tab Size") root.geometry("600x400") # Create Text widget with custom tab size ...

How to setup Conda environment with Jupyter Notebook?

Python Server Side Programming Programming

Tapas Kumar Ghosh

Updated on 27-Mar-2026 661 Views

Jupyter Notebook is an open-source web application that allows you to create and share documents containing live code, equations, visualizations, and narrative text. Conda is a powerful package manager that helps you manage different Python environments and packages. Setting up a Conda environment with Jupyter Notebook provides an isolated workspace for your data science and machine learning projects. Benefits of Using Conda with Jupyter Notebook Create isolated environments for different projects with specific package versions Easy installation and management of data science packages like NumPy, Pandas, and Matplotlib Avoid package conflicts between different projects Simple environment sharing ...

Get Random Range Average using Python

Python Programming Server Side Programming

Rohan Singh

Updated on 27-Mar-2026 987 Views

Python provides several methods to generate random numbers within a specific range and calculate their average. This article explores four different approaches using the random module, NumPy library, random.choices() function, and statistics module. Algorithm The general algorithm to generate random numbers and find their average is: Generate random numbers within a specified range Store these numbers in a list or array Calculate the average of the generated numbers Display the result Method 1: Using the Random Module The random module provides a simple way to generate random numbers. We can use random.randint(a, b) ...

How to set alignment of each dropdown widget in Jupyter?

Python Server Side Programming Programming

Tapas Kumar Ghosh

Updated on 27-Mar-2026 721 Views

Dropdown widgets in Jupyter notebooks can be aligned using CSS layout properties and the ipywidgets package. We can control alignment using the Layout() class to position dropdowns side by side, center them, or arrange them vertically for better visual presentation. Installation Requirements Install the required packages ? pip install ipywidgets ipyvuetify Basic Syntax The main components for creating aligned dropdown widgets ? # Create dropdown widget widgets.Dropdown(options=[], description='', layout=widgets.Layout()) # Define layout alignment widgets.Layout(width='70%', align_self='center') # Alternative using ipyvuetify v.Select(multiple=True, items=[], label='', style_='width:300px') Key Parameters ...

Ledoit-Wolf vs OAS Estimation in Scikit Learn

Python Scikit-learn Server Side Programming Programming

Siva Sai

Updated on 27-Mar-2026 595 Views

Understanding various techniques for estimating covariance matrices is essential in machine learning. Scikit-Learn provides two popular shrinkage-based covariance estimation methods: Ledoit-Wolf and Oracle Approximating Shrinkage (OAS). Both methods address the challenge of unreliable empirical covariance estimation in high-dimensional scenarios. Introduction to Covariance Estimation Covariance estimation quantifies relationships between multiple dimensions or features in datasets. In high-dimensional data where features outnumber samples, the standard empirical covariance matrix becomes unreliable. Shrinkage methods like Ledoit-Wolf and OAS provide more robust estimates by "shrinking" the empirical matrix toward a structured target. Ledoit-Wolf Estimation The Ledoit-Wolf method shrinks the empirical covariance ...

How to search a value within a Pandas DataFrame row?

Pandas Python Server Side Programming Programming

Tapas Kumar Ghosh

Updated on 27-Mar-2026 4K+ Views

Pandas DataFrame is a two-dimensional data structure that represents data in tabular form with rows and columns. Python provides several built-in methods like eq(), any(), loc[], and apply() to search for specific values within DataFrame rows. Basic Value Search in a Column The simplest approach is to search for a value in a specific column using boolean indexing ? import pandas as pd # Create a DataFrame data = {'Name': ['Bhavish', 'Abhinabh', 'Siddhu'], 'Age': [25, 32, 28]} df = pd.DataFrame(data) # Search for a value in ...

Lazy import in Python

Python Server Side Programming Programming

Siva Sai

Updated on 27-Mar-2026 5K+ Views

Lazy import in Python is a technique where modules are imported only when they're actually needed, rather than at the start of the program. This approach can significantly improve startup times and reduce memory usage, especially for applications with heavy dependencies. What is Lazy Import? Traditionally, Python imports modules at the beginning of a script using import statements. However, importing large libraries can slow down startup times and consume unnecessary memory if those modules aren't immediately used. Lazy import delays the importing process until the module is actually required in your code. This technique is also known ...

Showing 1051–1060 of 21,090 articles

« Prev 1 … 104 105 106 107 108 … 2109 Next »