Welcome to the Web Scraping with Beautiful Soup and Selenium repository! This repository provides a comprehensive guide to web scraping, from basic concepts to practical implementations.
This repository is designed to help beginners understand the basics of web scraping using Beautiful Soup and Selenium. You'll learn how to extract data from websites and perform various tasks such as navigating HTML structures, using XPath, and applying CSS selectors. The repository includes a practical project for scraping data from the Trendyol website.
Before you begin, make sure you have the following installed:
- Python 3.x
- pip (Python package installer)
To install the required libraries
beautifulsoup4
selenium
requests
Here's a brief overview of how to use the scripts in this repository:
- Clone the repository:
git clone https://github.com/Memirdogan/Web-Scraping-with-Beautiful-Soup-and-Selenium.git
- Navigate to the project directory:
cd Web-Scraping-with-Beautiful-Soup-and-Selenium
Web-Scraping-with-Beautiful-Soup-and-Selenium/
├── .idea/
├── Beautiful Soup’a Giriş/
├── Bot Tespiti ve Proxy/
├── HTML Temelleri/
│ └── Uygulama
├── Selenium'a Giriş/
├── TrendyolPhoneData/
│ ├── TrendyolPhoneData.py
│ ├── main.py
│ ├── DataClear.py
│ ├── datasetanalyser.py
│ └── ...
└── XPath ve CSS Seçicilere Giriş/
- Introduction to Beautiful Soup
- Bot Detection and Proxy
- HTML Basics
- Introduction to Selenium
- TrendyolPhoneData Project
- Introduction to XPath and CSS Selectors
Contributions are welcome! Please feel free to submit a Pull Request.