Skip to content

josephDev123/Nodejs-web-crawler-script

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Node.js Web Crawler Script

Description

This is a simple Node.js-based web crawler that extracts data(link) from websites. It can be customized to scrape specific content, such as text, images, or links, for various use cases like data collection or analysis.

Features

  • Crawl web pages
  • Extract specific content (e.g. links)

Installation

  1. Clone the repository:
    git clone https://github.com/josephDev123/Nodejs-web-crawler-script.git
  2. Navigate to the project directory:
    cd Nodejs-web-crawler-script
  3. Install dependencies:
    npm install

Usage

  1. Modify the target URL and scraping logic in the src/index.ts file.
  2. Run the script:
    npm run start baseUrl url
    note("the baseUrl and url must be the same")

Configuration

  • Customize the crawler by editing the src/index.ts to target different websites or specific elements.

License

This project is licensed under the ----.


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published