Scrape Goodreads User Data: Profile, Book Shelves, Books, Authors
Using pip:
pip install goodreads-user-scraper
goodreads-user-scraper --user_id <your id> --output_dir goodreads-data
Using pipx:
pipx run goodreads-user-scraper --user_id <your id> --output_dir goodreads-data
- Description: The user whose data should be scraped. Find your user id using these directions.
- Required: Yes
- Description: The directory where all scraped data will be output.
- Required: No
- Default:
goodreads-data
- Description: Whether the script should skip scraping user information.
- Required: No
- Default:
False
- Description: Whether the script should skip scraping shelves.
- Required: No
- Default:
False
- Description: Whether the script should skip scraping authors.
- Required: No
- Default:
False
Ensure that your profile is viewable by anyone:
- Navigate to the Goodreads Account Settings page
- Click on the
Settings
tab - In the
Privacy
section, under the Who Can View My Profile question, select "anyone"
-
Clone the GitHub repository
git clone https://github.com/YashTotale/goodreads-user-scraper.git
-
Run the install script
sh scripts/install.sh
-
Make changes
-
Run the test script
sh scripts/test.sh
-
Create
.env
TWINE_USERNAME=<foo> TWINE_PASSWORD=<bar>
-
Run the publish script
sh scripts/publish.sh <patch|minor|major>