Menu Scraper

This is a web scraper which scrapes the menus of any sv-group restaurant and uploads them to the menu-api. It is able to upload all menus accessible on the sv-group website for a specific restaurant (including those for the coming days of the week).

Usage

Every time the scraper is run, it will scrape the menus of the current week, upload them to the configured menu-api and exit.

Ideally the scraper is run once a day in something like a cron-job, so that the menus are always up to date.

Configuration

The configuration of the scraper is stored at [os-config-dir]/menu-scraper/menu-scraper.toml. Should this file not exist, it will be created with default values which then need to be replaced.

api_remote = '' # The url of the menu-api
website_remote = '' # The url of the sv-group website which should be scraped

The configuration values can be overwritten by the following environment variables:

API overwrites the api_remote value

WEBSITE overwrites the website_remote value

When an environment variable is set, it will be used instead of the configuration value.

Hosting with docker

The menu scraper is also available as a docker image published to the GitHub container registry. The image contains a cronjob which runs the scraper once a day (at 00:00 UTC). The image can be pulled with the following command:

docker pull ghcr.io/virtbad/menu-scraper:latest

Note Every image has its own tag, which is the same as the version of the scraper. You can find all available tags here. To get the latest version use the latest tag.

Important The container needs to be run with the --init flag (or init: true in docker-compose) to work properly. This is due to an issue in cron (dubiousjim/dcron#13 (comment)).

Configuration

Once pulled you need to run the container with the following environment variables to configure it properly. The scraper needs to be able to access the api and the website, so you need to provide the urls to them.

API="" # The url of the menu-api
WEBSITE="" # The url of the sv-group website which should be scraped
INITIAL_RUN="true" # Boolean whether the scraper should run immediately after startup (default: true)

Related Projects

License

Coming Soon.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
README.md		README.md
menu-scraper-rs.iml		menu-scraper-rs.iml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Menu Scraper

Usage

Configuration

Hosting with docker

Configuration

Related Projects

License

About

Releases

Packages

Languages

virtbad/menu-scraper-rs

Folders and files

Latest commit

History

Repository files navigation

Menu Scraper

Usage

Configuration

Hosting with docker

Configuration

Related Projects

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages