A tool that takes a website URL as input, scrapes its content, and classifies visitors based on their interests or industry. Part of a data pipeline to dynamically generate questions and multiple-choice options that help categorize users visiting the site.
Tech stack must be Frontend: React, Redux | Backend & Cloud: Python, Flask, AWS
It is recommended to run this locally in a virtual env. Run . venv .venv
.
To run the Flask app, run flask --app frontend run --debug
in your command line. Then pnpm dev
for the backend.
To run the file that confirms we can connect to mongodb, first create an .env
file in the root directory:
SCRAPER_APP_CONNECTION_STRING=<your-connection-string>
Then simply run python3 app/db.py
.
- Scraper
- data-sorting pipeline & chatbot
- MongoDB
- SPA frontend