LinkedIn Web Scraping and Email Notification Project

Author: Tatjana Chernenko, 2024

Overview

This project aims to automate the process of extracting job postings from LinkedIn, storing them in a local MongoDB database, and notifying the user via email about relevant job opportunities. It offers two modes of operation: one-time scraping and continuous scraping.

Features

Web scraping of LinkedIn job postings.
Saving job postings to a MongoDB database.
Two modes of operation: one-time scraping or continuous scraping.
Saving job postings to a CSV file.
Email notification for job postings matching predefined keywords.

Installation

Clone the repository: git clone https://github.com/TatjanaChernenko/mongodb_webscrapping
Navigate to the project directory: cd mongodb_webscrapping
Install the required dependencies using pip: pip install -r requirements.txt

Usage

Set up a Google Cloud Platform project and obtain credentials for the Gmail API. Refer to the Gmail API documentation for detailed instructions.
Rename the downloaded client secrets file to client_secret.json and place it in the project's root directory.
Configure the parameters in the config.ini file, including your email addresses, LinkedIn search parameters, and other settings.
Run the main.py file

Configuration

You can customize the project's behavior by editing the config.ini file. Here are the available parameters:

email_sender: Your email address for sending notifications.
email_recipient: Recipient email address for receiving notifications.
linkedin_pages: Number of LinkedIn pages to scrape.
keywords: Keywords to match for suitable job postings.
page_number: Initial page number for scraping.
sleep_time: Time interval between scraping requests.

License

This project is licensed under the terms of the MIT License.

Author

Tatjana Chernenko

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.ini		config.ini
jobs.csv		jobs.csv
linkedin-jobs.csv		linkedin-jobs.csv
main.py		main.py
mongodb_communicate.py		mongodb_communicate.py
playground_mongodb_direct_requests.md		playground_mongodb_direct_requests.md
requirements.txt		requirements.txt
scraper.log		scraper.log
webscrapping_to_db.py		webscrapping_to_db.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LinkedIn Web Scraping and Email Notification Project

Overview

Features

Installation

Usage

Configuration

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

TatjanaChernenko/mongodb_webscrapping

Folders and files

Latest commit

History

Repository files navigation

LinkedIn Web Scraping and Email Notification Project

Overview

Features

Installation

Usage

Configuration

License

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages