- Trending Categories
- Data Structure
- Operating System
- MS Excel
- C Programming
- Social Studies
- Fashion Studies
- Legal Studies
- Selected Reading
- UPSC IAS Exams Notes
- Developer's Best Practices
- Questions and Answers
- Effective Resume Writing
- HR Interview Questions
- Computer Glossary
- Who is Who
How to extract file extension using Python?
An operating system like Microsoft Windows uses a file extension as a suffix to the name of a computer file. It falls under the category of metadata. An operating system's understanding of a file's attributes and, to some extent, its desired usage is supported by the file extension.
We could need to extract file extensions in Python. You can achieve this in a number of ways.
OS file path manipulation is made simple with the help of the Python module os.path. It covers receiving the data from file paths, opening, saving, and updating.To obtain the file extension in Python, we shall make use of this module.
The function splitext() in os.path allows you to separate the root and extension of a specified file path. A tuple made up of the root string and the extension string is the function's output.
Example Using the splitext() method
The function os.path.splitext() returns a tuple with two items: the file extension and the path with the name of the file, respectively. Following is an example to extract the file using os.path module −
# importing the module import os# Providing the path path = 'D:\Work TP.py'# declaring the variable to get the result result = os.path.splitext(path) print('Path:', result) print('Extension:', result)
The root file path has now successfully returned the extension. Following is an output of the above code−
Path: D:\Work TP Extension: .py
Pathlib is a Python module that defines useful functions and constants for classes that represent file paths.
Using a path string as a parameter, pathlib.Path() creates a new Path object.
The attribute suffix on the pathlib.Path object returns information about the file extension.
By simply calling the attributes parent and name within the Path object, we can obtain the parent file path and the actual file name of the provided file path in addition to the root.
Following is an example to extract the file using pathlib module:
import pathlib path = pathlib.Path('D:\Work TP.py') print('Parent:', path.parent) print('NameOfFile:', path.name) print('Extension:', path.suffix)
Following is an output of the above code−
Parent: D:\ NameOfFile: Work TP.py Extension: .py
Extracting just the extension suffix (without dot)
If you want to remove the dot and extract just the extension suffix such as py, txt, docx etc. You need to add “[1:]” after the result) while working with the splitext() method as –
Similarly while working with the pathlib.Path() method add “[1:]” after path.suffix as−
The following program demonstrates how to print just the suffixes using both the methods discussed above −
# importing the modules import os import pathlib path = 'D:/test.txt' result = os.path.splitext(path) print('Extension:', result[1:]) print('Extension:', pathlib.Path('D:/test.txt').suffix[1:])
Extension: txt Extension: txt
- Related Articles
- How to change file extension in Python?
- How to get the file extension using PowerShell?
- How to write Python regular expression to match with file extension?
- How to extract all the .txt files from a zip file using Python?
- Perl File Extension
- How to get file extension of file as a result of MySQL query?
- Golang program to get the file extension
- Extract csv file specific columns to list in Python
- How can I extract or uncompress gzip file using php?
- Get file extension name in Java
- How to extract a part of the file path (a directory) in Python?
- Python Program to extract email-id from URL text file
- How to extract numbers from a string using Python?
- How to rename a file using Python?