Skip to content
/ filext Public

Python library to identify file type based on its file signature

License

Notifications You must be signed in to change notification settings

dobizz/filext

Repository files navigation

filext

Python library to identify file type based on its file signature

Installation

pip install filext

Usage

If the category of the file is not certain, you may use whatfile but is a little slower

from filext import whatfile

file_path = "./tests/files/document.pdf"

# pass file as path str
file_type = whatfile(file_path)

with open(filepath, "rb") as file:
    # pass file as bytes
    file_type = whatfile(file.read())

If the category of the file is known, you may use the function for that category instead.

from filext import whatdoc

file_path = "./tests/files/document.pdf"

# pass file as path str
file_type = whatdoc(file_path)

with open(filepath, "rb") as file:
    # pass file as bytes
    file_type = whatdoc(file.read())

Supported File Types

Documents

  • PDF
  • DOC
  • PPT
  • XLS
  • DOCX
  • PPTX
  • XLSX

Images

  • BMP
  • GIF
  • HEIC
  • JPG
  • PNG
  • TIF

About

Python library to identify file type based on its file signature

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages