Skip to content

Latest commit

 

History

History
51 lines (35 loc) · 2.01 KB

README.md

File metadata and controls

51 lines (35 loc) · 2.01 KB

Email Data Extractor 📧

The Email Data Extractor is a Python program designed to gather relevant information from email bodies and store it in an Excel spreadsheet.

Python Version License Openpyxl BeautifulSoup4

Features

  • IMAP Connection: Establish a secure connection to the specified IMAP server using provided credentials.
  • Email Retrieval: Search and retrieve all emails from the specified mailbox.
  • Information Extraction: Extract relevant information such as subject, sender, date, and content from each email.
  • Duplicate Fixing: Avoid duplicate entries by checking existing data in the Excel file.
  • Excel Saving: Save the extracted information into an Excel spreadsheet.

Dependencies

  • imaplib for IMAP communication.
  • email library for parsing email messages.
  • BeautifulSoup library for parsing HTML content.

Installation

pip install openpyxl bs4

Usage

  1. Input Configuration: Provide the IMAP server, username, and password in the program.

    imap_server = "imaps.udag.de"
    username = "[email protected]"
    password = "YourPassword123"
  2. Run the Program: Execute the program to connect to the email account, extract information, and save data into an Excel file.

  3. Avoiding Duplicates: The program checks for duplicate entries, ensuring only unique information is stored.

  4. Output Excel File: The Excel file serves as a structured dataset, containing non-duplicated information from email bodies.

Contact

For any questions or feedback, feel free to reach out: