File attachment and URL extractor for EML & MSG files using Python
-
Updated
Oct 13, 2017 - Python
File attachment and URL extractor for EML & MSG files using Python
Extact all URLs from anchor and image tags within a html/xhtml page and its children.
An Apache Drill UDF for working with Twitter tweet text via the twitter-text Java library (https://github.com/twitter/twitter-text/tree/master/java)
Bootcamp Laboratoria - Produto final do sprint 4. Biblioteca no npm para extracao de links em documento markdown.
A Minimal Yet Powerful Crawler for Extracting all The Internal/External/Fuzz-able Links from a website
A small tool for extracting all urls from a blob of binary data (ex. PDFs).
Extract URLs,endpoints,paths and word-lists form source files
Tika based link (URL) extractor for httpreserve
A python script to extract URL from the text or paragraph.
🍊🔗 Squeeze some juice from URLs: A URL crawler/extraction library.
Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with robust patterns.
Extract article title, description, images, keywords and authors from any URL
URL Extractor is a simple Python code designed to extract the domain name from a list of URLs stored in a text file. This application provides a convenient way to extract and process URLs efficiently.
LinkLifter is a Python script that searches for URLs in a given text file or recursively in a directory and its subdirectories. The found URLs, along with the file they are located in, are saved to a CSV file.
Extract urls from your a file or web address
Extract http/https URLs from any kind of text content.
Website URL Scanner is a simple command-line tool that allows you to scan a website and extract all URLs. It can be useful for various purposes, such as link analysis or checking for broken links.
Web scraping | Website cloner
URL Title Extractor is a Python program that extracts the titles of web pages from a file containing URLs. It uses the requests and BeautifulSoup libraries to extract the title and decode any HTML entities.
Add a description, image, and links to the url-extractor topic page so that developers can more easily learn about it.
To associate your repository with the url-extractor topic, visit your repo's landing page and select "manage topics."