JCrawl

JCrawl - Java Websites Crawler

JCrawl is a basic web crawler implemented in Java, designed to scrape web pages starting from a given URL and extracting links from those pages. Web crawling is the process of navigating and extracting information from web pages, often used by search engines and web scrapers

Features

Web crawling from a starting URL.
Specify the number of links to scrape using a breakpoint.
Extract links from web pages.

Prerequisites

Java Development Kit (JDK) installed on your system.

Usage

Clone or download this repository to your local machine.
Compile the JCrawl.java file using javac: javac JCrawl.java

Run the porgram:

java JCrawl

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
JCrawl.java		JCrawl.java
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JCrawl

JCrawl - Java Websites Crawler

Table of Contents

Features

Prerequisites

Usage

Run the porgram:

About

Releases

Packages

Contributors 2

Languages

License

Anzo52/JCrawl

Folders and files

Latest commit

History

Repository files navigation

JCrawl

JCrawl - Java Websites Crawler

Table of Contents

Features

Prerequisites

Usage

Run the porgram:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages