Skip to content

A Web Scraper that retrieves anime, watchlist, and review data from https://myanimelist.net/ using BeautifulSoup and Selenium

License

Notifications You must be signed in to change notification settings

Matthew1906/MyAnimeListScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

74 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AnimeListScraper

This is a web scraper made using BeautifulSoup and Selenium to scrape MyAnimeList website.

This project uses Chrome webdriver to automate the scraping process.

The resulting scraping data is saved in semicolon delimited CSV files and are still dirty.

I wouldn't recommend running the scraper since it takes quite a while to retrieve the data.

You can just get the data (in JSON format) by running the code inside this Google Colab. I've removed the duplicates, but there are some empty values that I didn't handle. Or you can also simply clone this repository and download the raw CSV anime and review data. As for watchlists, you can access the scraping result in this Google Drive folder.

Languages and Technologies:

Visual Studio Code Python Selenium Pandas Git GitHub

About

A Web Scraper that retrieves anime, watchlist, and review data from https://myanimelist.net/ using BeautifulSoup and Selenium

Topics

Resources

License

Stars

Watchers

Forks

Languages