Skip to content

Releases: bitdruid/python-wayback-machine-downloader

1.3.0

29 Jun 11:41

Choose a tag to compare

  • fixed requirements for win
  • fixed csv for win
  • added "auto" mode
  • fixed some minor bugs

1.2.0

08 Jun 14:57

Choose a tag to compare

  • fixed errors if snapshots colide path<->file #3 #4
  • fixed errors where a picture was stored as index.html #7
  • added url-encoding #4
  • prevent redirect loops #4
  • fixed SIGINT KeyboardInterrupt prevents csv-file from generating #8
  • added custom exception handler
  • added --debug to log exceptions into an error-log and print out full traceback instead of shortened
  • replaced batch-lists with queue for workers #9
  • added some cdx-queries from example.com to test
  • added --cdxbackup and --cdxinject to either store a cdx query for later use or use a backup
  • added --skip -> an existing csv-file will be used to check for already downloaded snapshots
  • changed user-agent to give archive.org the possibility to know who is scraping #11

v1.0.2

31 May 07:44

Choose a tag to compare

  • fixed paths for win #2 #1:
    • stripping ports from domain (:80 :443 ...) to prevent WinError
    • stripping mailto-prefixes to prevent WinError
    • changed url-parsing to prevent the case where subdir==filename caused WinError
  • url-encoded spaces in filenames are now decoded #1
  • clarified current-path structure in readme - changes may come in the future #1
  • optimized the parsing of cdx-query to keep inside a requested path
  • increased performance of collection-creation for very large requests

first release

22 Apr 07:10

Choose a tag to compare

Changes to beta:

  • --worker changed to --workers.
  • --csv appends requested url to filename to prevent overwriting
  • cleanup README
  • cleanup HELP