WayBack Downloader

Download any website from the Internet Archive Wayback Machine.

Installation

Clone repo git clone https://github.com/pavelnovitsky/wayback-machine-download.git
Setup write permissions on the "websites" folder

Basic Usage

Run WayBack Downloader with the base url of the website you want to retrieve as a parameter (e.g., http://example.com):

php downloader.php -h http://example.com

Downloaded files are saved to the websites/{domain}/* directory. For this example it will be websites/example.com/

Options

-h, --host — mandatory parameter, base url of the downloaded website
-t, --timestamp — optional parameter to set the earliest date of the Web Archive snapshots. WayBack Downloader won't download files added before the specified date. Timestamp format: yyyyMMddhhmmss

Examples

http://web.archive.org/web/20060716231334/http://example.com

php downloader.php -h http://example.com

php downloader.php --host=http://example.com

php downloader.php -h http://example.com -t 20060716231334

php downloader.php --host=http://example.com --timestamp=20060716231334

TODO

Add full test coverage
Add separated timestamp options "from" and "to"
Add optional url filter (ex.: only directory, *.jpg, etc)
Add results limiting
Access Control support

Resources used

Wayback CDX Server API

Contributing

You are welcome to contribute with pull requests

Bug tracking

WayBack Downloader uses GitHub issues. If you have found bug, please create an issue.

License

This library is released under the terms of the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Test		Test
lib		lib
websites		websites
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
bootstrap.php		bootstrap.php
downloader.php		downloader.php
phpunit.xml		phpunit.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WayBack Downloader

Installation

Basic Usage

Options

Examples

TODO

Resources used

Contributing

Bug tracking

License

About

Releases

Packages

Languages

License

pavelnovitsky/wayback-machine-download

Folders and files

Latest commit

History

Repository files navigation

WayBack Downloader

Installation

Basic Usage

Options

Examples

TODO

Resources used

Contributing

Bug tracking

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages