GitHub - m0hs1ne/WebWorm: web scraping tool designed to effortlessly navigate websites and automatically download all types of files.

"WebWorm – Dig Deep, Download Easy!"

Web scraping tool designed to effortlessly navigate websites and automatically download all types of files.

About The Project

WebWorm is a python script that scrapes and downloads files from a specified website URL. It allows configuring the depth of website crawling and the file extensions to scrape. It also has an option to detect technologies used by the website.

Getting Started

This is an example of how you may run the script.

Prerequisites

Ensure you have Python installed on your system.

Installation

Clone the repository:

git clone https://github.com/m0hs1ne/WebWorm.git

Install the required packages:

pip install -r requirements.txt

Run the script:

python3 webworm.py -u <url> -d <depth> -e <extensions> -t <technologies>

Usage

usage: WebWorm.py [-h] [-e EXTENSIONS] [-d DEPTH] [-t] url

positional arguments:
  url                   The URL of the website to scrape.

options:
  -h, --help            show this help message and exit
  -e EXTENSIONS, --extensions EXTENSIONS
                        Comma-separated list of file extensions to scrape (e.g., "jpg,png,docx"). If not specified, all files will be scraped.
  -d DEPTH, --depth DEPTH
                        The maximum depth to crawl the website. Default is 1.
  -t, --tech            Detect technologies used on the website.

using the -t flag will detect technologies used by the website.

Roadmap

Add support for scraping multiple websites.
Request with session cookies.
enumerate directories.
check for possible keys and secrets in js files.

Contributing

Your contributions are welcome! Whether you're fixing bugs, adding new features, or improving documentation, we appreciate your help in making WebWorm better.

Creating A Pull Request

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

Authors

m0hs1ne - Initial work

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.gitignore		.gitignore
.whitesource		.whitesource
README.md		README.md
Scraper.py		Scraper.py
TechDetector.py		TechDetector.py
WebWorm.py		WebWorm.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

"WebWorm – Dig Deep, Download Easy!"

Table Of Contents

About The Project

Getting Started

Prerequisites

Installation

Usage

Roadmap

Contributing

Creating A Pull Request

Authors

About

Releases

Packages

Contributors 3

Languages

m0hs1ne/WebWorm

Folders and files

Latest commit

History

Repository files navigation

"WebWorm – Dig Deep, Download Easy!"

Table Of Contents

About The Project

Getting Started

Prerequisites

Installation

Usage

Roadmap

Contributing

Creating A Pull Request

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages