AnimeListScraper

This is a web scraper made using BeautifulSoup and Selenium to scrape MyAnimeList website.

This project uses Chrome webdriver to automate the scraping process.

The resulting scraping data is saved in semicolon delimited CSV files and are still dirty.

I wouldn't recommend running the scraper since it takes quite a while to retrieve the data.

You can just get the data (in JSON format) by running the code inside this Google Colab. I've removed the duplicates, but there are some empty values that I didn't handle. Or you can also simply clone this repository and download the raw CSV anime and review data. As for watchlists, you can access the scraping result in this Google Drive folder.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
.vscode		.vscode
controllers		controllers
data		data
diagrams		diagrams
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AnimeListScraper

Languages and Technologies:

About

Languages

License

Matthew1906/MyAnimeListScraper

Folders and files

Latest commit

History

Repository files navigation

AnimeListScraper

Languages and Technologies:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages