Releases: bitdruid/python-wayback-machine-downloader
Releases · bitdruid/python-wayback-machine-downloader
1.3.0
1.2.0
- fixed errors if snapshots colide path<->file #3 #4
- fixed errors where a picture was stored as index.html #7
- added url-encoding #4
- prevent redirect loops #4
- fixed SIGINT KeyboardInterrupt prevents csv-file from generating #8
- added custom exception handler
- added
--debug
to log exceptions into an error-log and print out full traceback instead of shortened - replaced batch-lists with queue for workers #9
- added some cdx-queries from example.com to test
- added
--cdxbackup
and--cdxinject
to either store a cdx query for later use or use a backup - added
--skip
-> an existing csv-file will be used to check for already downloaded snapshots - changed user-agent to give archive.org the possibility to know who is scraping #11
v1.0.2
- fixed paths for win #2 #1:
- stripping ports from domain (:80 :443 ...) to prevent WinError
- stripping mailto-prefixes to prevent WinError
- changed url-parsing to prevent the case where subdir==filename caused WinError
- url-encoded spaces in filenames are now decoded #1
- clarified current-path structure in readme - changes may come in the future #1
- optimized the parsing of cdx-query to keep inside a requested path
- increased performance of collection-creation for very large requests
first release
Changes to beta:
--worker
changed to--workers
.--csv
appends requested url to filename to prevent overwriting- cleanup README
- cleanup HELP