Skip to content

Scrape product details for all categories on Booker wholesale website using Scrapy

Notifications You must be signed in to change notification settings

james-innes/booker-scraper

Repository files navigation

Booker Scraper

Scraping in action

Run pipenv install to install dependencies.
Run pipenv shell to enter virtual environment.

Process

1. Login

.env

BOOKER_ACCOUNT=
BOOKER_EMAIL=
BOOKER_PASSWORD=
ASP_NET_SESSION=
ASPXAUTH=

Run login.py every time spiders return non 200 response and copy printed values into .env.

2. Sitemap

Sitemap manually copied from side nav pane. Could be automated.

3. Product List

From list view scrap all product code's and other info available. scrapy crawl product_list

Load outputted CSV file into Database for the following step!

4. Product Detail

From the aforementioned step we have the product_list table which we now use to scrap each product page using the code. scrapy crawl product_detail

Load the data into database.

5. View the database

Run the barcode.py script to generate a CSV file of all the products in the database.

6. View the database

SQLite views collate data which can be exported to CSV.

About

Scrape product details for all categories on Booker wholesale website using Scrapy

Topics

Resources

Stars

Watchers

Forks

Languages