waybackpy
waybackpy is a Python package and CLI tool that interfaces with the Wayback Machine's APIs. It provides access to SavePageNow API, CDX Server API, and Availability API for archiving and retrieving web page snapshots.
Description
waybackpy enables users to interact with Internet Archive's Wayback Machine APIs through either Python scripting or command-line interface. The tool supports three key public APIs: SavePageNow (Save API) for archiving web pages, CDX Server API for querying archived content, and Availability API for checking snapshot availability.
This tool is particularly useful for researchers, security professionals, and web archivists who need programmatic access to historical web data. It simplifies tasks like bulk archiving, snapshot retrieval, and availability checks that would otherwise require manual interaction with the Wayback Machine website.
waybackpy is lightweight with an installed size of 97 KB and depends on common Python libraries. It's released under the MIT License and actively maintained with comprehensive CLI documentation available.
How It Works
waybackpy operates by making HTTP requests to the Wayback Machine's public REST APIs using Python's requests library. It supports the SavePageNow API to submit URLs for archiving, CDX Server API for indexed search queries across time ranges and parameters, and Availability API for checking if specific snapshots exist. The CLI wrapper uses Python's click library to parse command-line arguments and execute API calls with configurable user agents.
Installation
sudo apt install waybackpyFlags
Examples
waybackpy --helpwaybackpy --versionwaybackpy --show-licensewaybackpy -u https://example.comwaybackpy --url https://example.com --user-agent 'Custom UA'waybackpy -ua 'Mozilla/5.0' -u https://kali.org