WebScraper
Web Scraper for Visitor Classification.
- General Info
- Technologies Used
- Screenshots
- Usage
- Project Status
- Room for Improvement
- Acknowledgements
- Contact
A Single Page Application that takes a website URL as input, scrapes its content, and classifies visitors based on their interests or industry. The goal is to dynamically generate questions and multiple-choice options that help categorize users visiting the site.
-
Project Presentation: Loom Video
-
The most Up-to-date and Deployed Frontend Repo can be viewed at: https://github.com/behi22/visitor-classifier-frontend
-
The most Up-to-date and Deployed Backend Repo can be viewed at: https://github.com/behi22/visitor-classifier
- npm - 8.15.0
- React.js - 18.3.1
- Redux - 9.1.2
- antd - 5.22.2
- HTML - version html5
- CSS
- babel
- Axios
- AJAX
- git version 2.38.1.windows.1
- github
- Linux
- WSL
- Python
- Flask
- PostgreSQL
- Vercel
- Redis
- Render
The app should have the following features:
- Frontend - Neat and User-Friendly component based Frontend, created with React and deployed using Vercel
- Backend API - Python-based API, Properly implementing web scraping, data extraction, and AI-based content generation, deployed using Render
- Storage - Utilize PostgreSQL database for storage, Hosted on Render
- Caching - Utilize Redis for caching, Hosted on Redis Cloud
- Effective integration of Frontend and Backend components
Project is: Semi-Complete (Demo)
-
As indicated in the comments in Home.js, currently the answers for each question aren't submitted anywhere, and the logic could be developed further.
-
The script for generating questions in App.py is still very primitive and could be developed further with more time and resources at hand, so that we could generate more meaningful questions.
-
There is an issue with the Missing Answers StyledParagraph inside Home.js where it is still visible after submitting partial answers and changing the URL, that needs further time in debugging in order to resolve.
- Many thanks to Brave Career for including me in their Software Engineer assessment project.
Created by Behbod Babai - feel free to contact me via email! my email: behibabai@gmail.com