# Install requirements (pip or pip3)
pip install -r requirements.txt
# Download all zips (python or python3)
python download.py
# Extract and merge zips to create pdfs
python merge_pdfs.py
This script uses the data in data.json
to scrape the books. If it's outdated, you can regenerate it using the latest data.
- Get the latest relevant source from view-source:https://ncert.nic.in/textbook.php and paste it to
sourceScript.js
- Cleanup the script (format, remove comments etc)
- Run
python parse_script.py