Skip to content

mytusshar/scrap

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

Scraper

This is a simple web-scraper which downloads all URLS from the source website to a file. It is implemented in NodeJS, without using any external module.

Installation

Requirements

  • NodeJS V9

Running Scraper

  • Clone this repository.

  • Open 'scraper' directory.

  • Run following command:

    $ node scrap

Changing website URL

Go to scraper folder, and open constants.js.

    //change the website name
    exports.TARGET_WEBSITE = 'https://medium.com';
    //Maximum parallel connections allowed.
    exports.MAX_CONN = 5;
    //output file name.
    exports.FILE_NAME = 'output.txt';

About

NodeJS scraper.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published