This Scrapy spider scrapes domain names from the website mailcheck.ai categorized alphabetically.
- Python 3.x
- Scrapy
-
Create a virtual environment to isolate the project dependencies. Open a terminal or command prompt and navigate to the project directory.
python3 -m venv myenv
-
Activate the virtual environment:
-
For Windows:
myenv\Scripts\activate
-
For Unix or Linux:
source myenv/bin/activate
-
-
Clone the repository to your local machine.
git clone <REPOSITORY URL>
-
Install the required Python packages using pip:
pip install scrapy
-
Run the scraper
cd /MailCheckScraper/mailchimp_scraping/mailchimp_scraping/spiders scrapy crawl <spider name> scrapy crawl mailcheck