auto-scrape

Auto-scrape is a platform for building, managing and remotely deploying web scrapers. It provides the "essential infrastructure" for web scraping while allowing developers to focus on writng Selenium web scraping scripts in a simple and familiar way.

It is built using the Flask framework and uses SQLAlchemy to interface with the SQL database of your choice.

Demo

GIF screenshots demonstrating the user interface in action here.

Features:

live progress logging
database for saving scraped data - no database experience required!
CSV export
multiple simultaneous scrapers
basic resource management
basic user authenticalion for remote deployments (see fea-simple-auth branch

Initial Project Setup

Download chromedriver and place it in /autoscrape. Rename to chromedriver.
Install dependencies: pip install -r requirements.txt
Set environment variables:

Windows:

$env:AUTOSCRAPE_ADMIN_USERNAME="your_admin_username"
$env:AUTOSCRAPE_ADMIN_PASSWORD="your_admin_password"

MacOS / Linux:

export AUTOSCRAPE_ADMIN_USERNAME="your_admin_username"
export AUTOSCRAPE_ADMIN_PASSWORD="your_admin_password"

You could also store authentication details this way for scrapers run behind a paywall.

Start scraping:
- Windows: ./dev.ps1
- MacoS / Linux: source ./dev.sh

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
autoscrape		autoscrape
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
debug.py		debug.py
dev.ps1		dev.ps1
dev.sh		dev.sh
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

auto-scrape

Demo

Features:

Initial Project Setup

About

Releases

Packages

Contributors 2

Languages

License

chrispalmo/auto-scrape

Folders and files

Latest commit

History

Repository files navigation

auto-scrape

Demo

Features:

Initial Project Setup

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages