Write a Python function to split the string based on delimiter and convert to series

When working with strings in Python, you often need to split them based on a delimiter and convert the result into a Pandas Series for further data analysis. This is commonly done when processing CSV-like data or text files.

Understanding the Problem

Let's say we have a tab-separated string like 'apple\torange\tmango\tkiwi' and want to split it into individual elements, then convert to a Pandas Series ?

0    apple
1    orange
2    mango
3    kiwi
dtype: object

Method 1: Using a Function

We can create a reusable function that accepts a string and delimiter, then returns a Pandas Series ?

import pandas as pd

def split_to_series(s, delimiter):
    split_data = s.split(delimiter)
    return pd.Series(split_data)

# Test the function
result = split_to_series('apple\torange\tmango\tkiwi', '\t')
print(result)
0     apple
1    orange
2     mango
3      kiwi
dtype: object

Method 2: Using Lambda Function

For a more concise approach, we can use a lambda function to achieve the same result ?

import pandas as pd

data = 'apple\torange\tmango\tkiwi'
delimiter = '\t'

# Create lambda function for splitting
split_data = lambda x, y: x.split(y)
result = split_data(data, delimiter)

# Convert to Series
series_result = pd.Series(result)
print(series_result)
0     apple
1    orange
2     mango
3      kiwi
dtype: object

Method 3: Direct Approach

For simple cases, you can directly split and convert in one line ?

import pandas as pd

# Direct conversion
data = 'red,blue,green,yellow'
series_result = pd.Series(data.split(','))
print(series_result)
0       red
1      blue
2     green
3    yellow
dtype: object

Working with Different Delimiters

The same approach works with various delimiters like commas, semicolons, or custom characters ?

import pandas as pd

def split_to_series(text, delimiter):
    return pd.Series(text.split(delimiter))

# Different delimiter examples
comma_data = 'cat,dog,bird,fish'
semicolon_data = 'python;java;javascript;c++'

print("Comma-separated:")
print(split_to_series(comma_data, ','))
print("\nSemicolon-separated:")
print(split_to_series(semicolon_data, ';'))
Comma-separated:
0     cat
1     dog
2    bird
3    fish
dtype: object

Semicolon-separated:
0          python
1            java
2    javascript
3             c++
dtype: object

Comparison

Method Best For Reusability
Function Multiple uses, complex logic High
Lambda Functional programming style Medium
Direct Simple one-time operations Low

Conclusion

Use the function approach for reusable code, lambda for functional programming style, or direct conversion for simple one-time operations. All methods effectively split strings and convert them to Pandas Series for data analysis.

Updated on: 2026-03-25T16:29:59+05:30

822 Views

Kickstart Your Career

Get certified by completing the course

Get Started
Advertisements