Skip to content
This repository has been archived by the owner on Aug 3, 2021. It is now read-only.

Latest commit

 

History

History
113 lines (88 loc) · 4.58 KB

TODO.md

File metadata and controls

113 lines (88 loc) · 4.58 KB

evaluation of screenshot tools



http://stackoverflow.com/a/15699761 https://dzone.com/articles/python-testing-phantomjs http://toddhayton.com/2015/02/03/scraping-with-python-selenium-and-phantomjs/



https://www.caktusgroup.com/blog/2014/06/23/scheduling-tasks-celery/ https://github.com/aosabook/500lines/tree/master/crawler https://news.ycombinator.com/item?id=11887230

requirements:
  • headless
  • runs on virtualized linux (Vagrant, VMWare ESX, EC2, GCE)
  • supports html5 & css3 & javascript & flash
  • makes nice screenshots (fonts)
  • optionally: supports http proxy
articles:
alternatives:

S3 Bucket setup

idea: nicer select boxes

use django forms?

idea: mashup / integration with other projects

celery or dask

better apis

queue thoughts

async requests