scraper-challenge

A tool that takes a website URL as input, scrapes its content, and classifies visitors based on their interests or industry. Part of a data pipeline to dynamically generate questions and multiple-choice options that help categorize users visiting the site.

Requirements:

Tech stack must be Frontend: React, Redux | Backend & Cloud: Python, Flask, AWS

Quickstart

It is recommended to run this locally in a virtual env. Run . venv .venv.

To run the Flask app, run flask --app frontend run --debug in your command line. Then pnpm dev for the backend.

To run the file that confirms we can connect to mongodb, first create an .env file in the root directory:

SCRAPER_APP_CONNECTION_STRING=<your-connection-string>

Then simply run python3 app/db.py.

Project Components:

Scraper
data-sorting pipeline & chatbot
MongoDB
SPA frontend

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.vscode		.vscode
frontend		frontend
scraper		scraper
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

scraper-challenge

Requirements:

Quickstart

Project Components:

About

Releases

Packages

Languages

elchukc/scraper-challenge

Folders and files

Latest commit

History

Repository files navigation

scraper-challenge

Requirements:

Quickstart

Project Components:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages