web2pdf

A python app to read a bookmark.html file and fetch PDF versions of all links within.

Installation

# Clone the repo
git clone git@github.com:IndelibleStamp/web2pdf.git

# Install wkhtmltopdf. See https://wkhtmltopdf.org/ for other ways
sudo dnf install wkhtmltopdf # For Fedora. 

# Install the python dependencies
cd web2pdf
pip install -r requirements.txt

Usage

Check web2pdf/web2pdf/conf.py and edit it. Mainly the INPUT varialble needs to be set with the path to the bookmarks.html file. Then run as follows:

(pdf) bash-4.3 ~/code/web2pdf/web2pdf$ ./web2pdf.py 
Found 2599 links in the bookmark file
Found 2599 rows in the bookmark db
..of which 81 links are already saved
..and 2506 are pending
Hit enter to start downloading pending PDFs
Downloading https://www.quantamagazine.org/20170207-bell-test-quantum-loophole/ | experiment-reaffirms-quantum-weirdness
<snipped>

TODO

This is a weekend project. There is quite a bit more to do :)

Make it async. Too slow right now.
Log to file instead of stdout.
Add tests.
Support additional bookmark formats?

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
static		static
web2pdf		web2pdf
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

web2pdf

Installation

Usage

TODO

About

Releases

Packages

Contributors 2

Languages

License

arunsrin/web2pdf

Folders and files

Latest commit

History

Repository files navigation

web2pdf

Installation

Usage

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages