Skip to content

Get a quickstart code to scrape an HTML file from this project. Please read the README.md to understand how to do it.

Notifications You must be signed in to change notification settings

anildhage/Simple-Web-Scraper

Repository files navigation

A simple program that will scrape any website and save data in a csv file. This is just a quick start app, you may want to add more code as per your need

  1. Create an environment & Install dependencies found in the .py files using pip in the command prompt
  2. Run the file in the command prompt - python Scrape_IMDB_top_1000.py or the other files that is in the directory

Scraped top 1000 IMDB website to get the details of all the movies and corresponding information like year, movie-time, rating (Multiple pages).

Saved the data collected from the html websites in the .csv file using pandas library. You can apply these techniques to any websites with making changes to the above codes to scrape websites that allow you to do legally.

About

Get a quickstart code to scrape an HTML file from this project. Please read the README.md to understand how to do it.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages