every one whole use the program can collect data from websites fast, and also help other to collect data.
Software architecture description
mongodump -h 127.0.0.1 -d py_crowd_spider -o mongodb
mongorestore -h 127.0.0.1 -d mongodb/py_crowd_spider
mongoexport -h 127.0.0.1 -d py_crowd_spider -c proxies -o D:/proxies.json
mongoimport --db py_crowd_spider --collection proxies --file D:/proxies.json
pip freeze > requirements.txt
- pip install -r requirements.txt
- download Tesseract: https://github.com/tesseract-ocr/tesseract/wiki