Skip to content

Latest commit

 

History

History
23 lines (15 loc) · 859 Bytes

README.md

File metadata and controls

23 lines (15 loc) · 859 Bytes

Google Flights Web Scraping and Data Mining

This repository focuses on web scraping and data mining from the Google Flights website using R Selenium.

Prerequisites:

Before getting started, make sure to load the following R packages:

  • R Selenium
  • dplyr
  • rebus
  • stringr
  • readxl

Usage:

  • Modify the rutas.xlsx with the desired routes. Then, begin with the web-scraping.R file, adjusting the date as needed.
  • After running the script, execute the second R file (datamining.R) to generate a structured database that visually represents all flight information. Customize it based on your requirements.

Note:

  • Data processing may encounter specific errors during execution.
  • Six weeks' worth of obtained data are included for reference. Feel free to modify and adapt the code to suit your needs.