GitHub - anildhage/Simple-Web-Scraper: Get a quickstart code to scrape an HTML file from this project. Please read the README.md to understand how to do it.

A simple program that will scrape any website and save data in a csv file. This is just a quick start app, you may want to add more code as per your need

Create an environment & Install dependencies found in the .py files using pip in the command prompt
Run the file in the command prompt - python Scrape_IMDB_top_1000.py or the other files that is in the directory

Scraped top 1000 IMDB website to get the details of all the movies and corresponding information like year, movie-time, rating (Multiple pages).

Saved the data collected from the html websites in the .csv file using pandas library. You can apply these techniques to any websites with making changes to the above codes to scrape websites that allow you to do legally.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
.DS_Store		.DS_Store
README.md		README.md
Scrape_IMDB_top_1000.py		Scrape_IMDB_top_1000.py
Top1000.csv		Top1000.csv
f2-trend-latest.csv		f2-trend-latest.csv
laptops_indexed.csv		laptops_indexed.csv
scrape-f2-trend&latest-movies.py		scrape-f2-trend&latest-movies.py
scrape-flipkart-laptops.py		scrape-flipkart-laptops.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

anildhage/Simple-Web-Scraper

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages