GitHub - olyazavr/ubuntu_scraper: Scrapes computer and hardware information into a database

#Ubuntu Scraper

Scrapes computer and hardware information into a database using Django and Scrapy. Retrieves computers and their parts' relationship, and in the case of Ubuntu, checks the certification information (whether or not it can run on Ubuntu and what version). Certified means Ubuntu will definitely run. Enabled signifies Ubuntu will run only if preinstalled by the manufacturer.

##Setup:

install pip
install Django with pip
install scrapy with pip
install south with pip
install Postgresql

##To run a spider: $ scrapy crawl <spider_name>

##Spiders:

intel_spider (must be run before toshiba_spider)
toshiba_spider
dell_spider
ubuntu_spider

##To modify the database:

modify scraper/models.py
If a new model is added, add it to scraper/scrapers/items.py
$ python manage.py syncdb
$ python manage.py schemamigration scraper --auto
$ python manage.py migrate scraper

##To drop/create database:

$ psql -U scraper scraper
# drop schema public cascase; create schema public;
syncdb/schemamigration/migrate as above

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
hardware		hardware
scraper		scraper
.gitignore		.gitignore
README.md		README.md
manage.py		manage.py
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

olyazavr/ubuntu_scraper

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages