This is a simple Node.js-based web crawler that extracts data(link) from websites. It can be customized to scrape specific content, such as text, images, or links, for various use cases like data collection or analysis.
- Crawl web pages
- Extract specific content (e.g. links)
- Clone the repository:
git clone https://github.com/josephDev123/Nodejs-web-crawler-script.git
- Navigate to the project directory:
cd Nodejs-web-crawler-script
- Install dependencies:
npm install
- Modify the target URL and scraping logic in the
src/index.ts
file. - Run the script:
npm run start baseUrl url note("the baseUrl and url must be the same")
- Customize the crawler by editing the
src/index.ts
to target different websites or specific elements.
This project is licensed under the ----.