Multithreaded pastebin scraper, scrapes to mysql database, then reads pastes for noteworthy information.
Use sniff-paste.py to go through the entire process of collection, logging, and harvest automatically. The scraper can be set to a paste limit of 0 to scrape indefinitely. If scraped indefinitely, press ctrl + c to stop scraping, any useful information will be in the database, along with a link back to the original paste it was found in.
sudo apt install libxslt-dev python3-lxml python3-nmap xsltproc mysql-server
pip3 install -r requirements.txt
- Create database named
sniff_paste
in mysql server - Fill in settings.ini
python3 sniff-paste.py
This will scrape pastebin for the latest number of pastes, then run analysis for ip addresses, emails, and phone numbers. It filters out duplicates and runs scans on some of the harvested data.
sniff_paste
-- root dbpastes
-- stores paste with full text, date, link, title, and languageemails
-- stores emails with extension to pastelinks
-- stores urls with extension to pasteip
-- stores ip with connectivity and extension to pastephones
-- stores phone numbers with extension to pastesecrets
-- stores secret type with extension to pasteports
-- stores port scan info (port, status, service, version, ip)cryptos
-- stores cryptocurrency findings with extension to paste
Crypto findings are not certain to be valid, consider them low probability findings
- Please contribute! If there's an error let me know -- even better if you can fix it :)
- Regex Contributions would be very helpful, and should be pretty easy to add!
- This tool is in the process of a bigger update, where the scraper can send all new pastes to my new project needmorecowbell/Funnel. I'm trying to consolidate all of my osint tools into one streamlined solution.