🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
Updated
Jan 18, 2025 - Python
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
🎭 Playwright integration for Scrapy
A package acting as a wrapper around the headless mode of existing web browsers to generate images from URLs and from HTML+CSS strings or files.
Run Selenium with Python via Github Actions using Headless or Non-Headless browsers!
Example of username and password proxy authentication for use in Selenium
Pyppeteer integration for Scrapy
Scrapfly Python SDK for headless browsers and proxy rotation
🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...
Web crawler and scraper based on Scrapy and Playwright's headless browser.
An embeddable headless browser package for Python that provides a simplified interface for interacting with web pages using Selenium and Selenium Hub.
Smart Scraper: An AI-powered web scraping framework that uses headless browsers, asynchronous programming, and adaptive parsing to extract data efficiently from diverse websites. Includes a user-friendly dashboard and supports cloud deployment.
Dare2024.com Solver is a Python automation script for seamlessly solving Dare2024.com quizzes. Impress your friends with correct answers effortlessly. Compatible with all dare2024.com versions and future updates.
Automates the feedback submission process for Amrita University Management System (AUMS) using Selenium WebDriver. This project handles web forms, dynamic elements, and iframe interactions, all while ensuring smooth, headless operation and robust error handling.
A Python script that checks whether a password has been compromised using the Have I Been Pwned service. The script automates the process of querying the website and retrieving the results for the given password, leveraging Selenium and a headless Firefox browser. It’s a simple tool for testing password security and checking for data breaches.
COVID-19 Apple Mobility Trends Reports
Automated Selenium-based scraper for extracting data from Myntra
This repository contains a Python script that simulates views on a GitHub profile by repeatedly reloading the profile page. The script uses the selenium and requests libraries to fetch the content of the profile page and then reloads the page in a headless Firefox browser.
Automated Selenium-based scraper for extracting and analyzing job listings from Glassdoor
Add a description, image, and links to the headless-browser topic page so that developers can more easily learn about it.
To associate your repository with the headless-browser topic, visit your repo's landing page and select "manage topics."