GitHub - DanielGall500/Voss: An API built to securely and efficiently handle requests to Machine Translation models.

Voss

A web API built to securely and efficiently handle requests between users and machine translation models.
Report Bug · Request Feature

Table of Contents

About The Project
- Built With
Getting Started
- Installation
Usage
Improvements
Contributing
License
Contact
Acknowledgments

About The Project

Voss is an API built to securely and efficiently distribute translation requests among machine translation models, typically hosted on other servers. The API is built using Python 3.10 and implemented in FastAPI. This work was carried out at the ADAPT Research Centre in Dublin City University for EUComMeet, a completely open-source platform to facilitate discussions among EU citizens on important and current topics. It is currently in use for deliberations on the EUComMeet platform serving translation between six languages: English, German, French, Italian, Polish, and Irish. This API is complementary to earlier work carried out which has been recently published in MDPI Analytics (Lohar et al, 2023). The name stems from the influential German translator Johann Heinrich Voss, known for his translations of Homer's Odyssey and Iliad.

mindmap
  root((Voss API))
    MT Server C
        GPU 1
            DE to FR
            EN to FR
            FR to EN
            GA to EN
            IT to EN
            PL to EN
        GPU 2
            DE to GA
            EN to GA
            FR to GA
            GA to FR
            IT to FR
            PL to FR
    MT Server B
        GPU 1
           DE to IT
           EN to IT
           FR to IT
           GA to IT
           IT to GA
           PL to GA
        GPU 2
           DE to PL
           EN to PL
           FR to PL
           GA to PL
           IT to PL
           PL to IT
    MT Server A
        GPU 1
           DE to EN
           EN to DE
           FR to DE
           GA to DE
           IT to DE
           PL to DE

(back to top)

Built With

The project is built using Python 3.10 and FastAPI, an asynchronous framework for developing APIs. It relies on Gunicorn and Uvicorn to improve performance by creating multiple Uvicorn workers to handle any incoming requests.

Server Configuration (SQLite)
- The server configuration, that is, a table which provides the server name, model ID, and GPU to use for each MT model, is stored in a SQLite database and loaded using pysqlite3, which is then loaded into a Python dictionary upon server startup. Further work on this project would take greater advantage of the features that comes with using SQLite.
Preprocessing (Sacremoses)
- Each translation request includes text which must be translated, and this text requires some pre-processing. We decided to handle this on the API side to reduce the load for the translation servers. Tokenization was carried out using sacremoses, a popular tool for language tokenization.
User Verification (Argon2, JWT)
- Password creation and authentication uses Argon2 password hashing. Through the /login endpoint users can verify their authenticity and receive a JSON web token (handled by pyjwt) which then must included in the headers of any translation request made to the server.

Note that Voss does not host the MT models itself, but rather interacts with models that are already hosted. For self-hosting the translation models yourself, please see OpenNMT MT Server

(back to top)

Getting Started

To get a local copy up and running follow the following steps. On the initial setup, the dashboard should note all MT models as offline before you have properly configured the MT servers.

Installation

With only a few steps, you can run this API for yourself and connect it to the servers where your MT models are being hosted. For our use case, we were connecting to the built-in OpenNMT MT server.

Clone the repo

git clone https://github.com/DanielGall500/Voss.git

Set up environment variables
This API requires the user to set up a number of environment variables to be set up in app/.env which are loaded using the Python dotenv package. These variables will reference each machine translation server where the models are being hosted, as well as the secret key and algorithm for the creation and decoding of JSON web tokens (JWTs). The server names must match those contained within the server setup SQLite database. For instance, if our translation models are listed as using servers EUCOM_A, EUCOM_B, and EUCOM_C in the database, it must be listed like so in the environment variables.
```
EUCOM_A_IP="[IP OF EUCOM_A]"
EUCOM_A_PORT=60000

EUCOM_B_IP="[IP OF EUCOM_B]"
EUCOM_B_PORT=60000

EUCOM_C_IP="[IP OF EUCOM_C]"
EUCOM_C_PORT=60000

SECRET_KEY="[YOUR SECRET KEY]"
JWT_ALGORITHM=HS256
```

Create a Docker image and container
Once you have set up your environment variables and a SQLite database, you can run this project using Docker.

FROM python:3.10

WORKDIR /code

COPY ./requirements.txt /code/requirements.txt

RUN pip install --no-cache-dir --upgrade -r /code/requirements.txt

COPY ./gunicorn_start.sh /code/gunicorn_start.sh
COPY ./app /code/app

CMD ["sh", "gunicorn_start.sh"]

An image and container can be created and run using the following commands:

docker build -t api-image .
docker run -d --name api-container -p 8080:80 api-image

This will connect Docker's internal port 80 to our external port 8080. The latter port is the one which you will be concerned with, as the other is setup automatically in the gunicorn_start.sh file.

Additional Setup

It is good practice to setup a reverse proxy such as Nginx to directly handle any incoming requests to your server for increasing security and performance. This was already handled by the institution in which this project was created, thus it wasn't necessary to implement here.

(back to top)

Usage

We will now look at both how one can make translation calls to the API, as well as login with a username and password.

Translate

Submit a translation request to the API. A JSON web token must be provided in the header, which is given to you once you log in.

AUTHORIZATIONS: JWTBearer

REQUEST BODY SCHEMA: application/json

src (required): string (Src) - The source language.
tgt (required): string (Tgt) - The target language.
text (required): string (Text) - The text which you want to translate.

Responses:

201 Successful Response
422 Validation Error

POST /translate

Request samples:

{
  "src": "en",
  "tgt": "de",
  "text": "Hello, my name is Daniel"
}

Response samples:

{
    "status": "SUCCESS",
    "result": {
        "de": "Hallo, ich heiße Daniel",
    }
}

Login

Log in using your username and password in order to receive a JWT. These details are stored in the SQLite database with hashed passwords.

REQUEST BODY SCHEMA: application/json

username (required): string (Username)
password (required): string (Password)

Responses:

201 Successful Response
422 Validation Error

POST /login

Request samples:

Payload Content type: application/json

{
  "username": "string",
  "password": "string"
}

Response samples:

Content type: application/json

201
422

{
    "token": "eyJhbGciOi..."
}

(back to top)

Improvements

The first version of this API was used in a pilot test of the EUComMeet project that took place in December of 2022. This pilot allowed us to see ways in which this version was difficult to work with, less than optimally secure, and not as efficient as other potential implementations. Version 2 was released in April 2023 and has made some major revisions. An overview of these revisions you can find below. There were three primary focuses when developing Version 2:

Reliability: Working towards eradicating bugs / server down-time. Rebuilding the web server from the ground up allowed for a focus on stronger design patterns. A dashboard has been created so that the current model status for each language pair can be checked at any time, and documentation allows developers to better understand each API endpoint for quicker debugging.
Speed: Improving the speed of the API so that many more users could theoretically be served without noticeable latency. This involves both hardware and software improvements.
Security: Steps taken to minimise chance of server security breach and Argon2 encryption of user login details has been implemented.

Asynchronous Web Server

The API previously relied on Flask and Waitress which provided a WSGI server that could handle requests one at a time. This was changed to FastAPI, an open-source python framework specifically designed for building asynchronous RESTful APIs. This allows for multiple requests to be handled simultaneously and therefore better response times.

Model Dashboard & Documentation

A dashboard GUI (view here) has been developed which displays the current status of each of the 30 language models, as well as the current translation latency for each model. From this dashboard one can also navigate to the documentation (view here) for the API, which contains the information needed in order to send requests and the types of responses one should expect. Note that without a username and password provided and stored by ADAPT, one cannot carry out any translations.

(back to top)

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

(back to top)

License

Distributed under the GPL-3.0 License. See LICENSE.txt for more information.

(back to top)

Contact

Daniel Gallagher - LinkedIn - [email protected]

Project Link: https://github.com/DanielGall500/Voss

(back to top)

Acknowledgments

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
app		app
resources		resources
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
gunicorn_start.sh		gunicorn_start.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voss

About The Project

Built With

Getting Started

Installation

Additional Setup

Usage

Translate

Login

Improvements

Asynchronous Web Server

Model Dashboard & Documentation

Contributing

License

Contact

Acknowledgments

About

Releases

Packages

Languages

License

DanielGall500/Voss

Folders and files

Latest commit

History

Repository files navigation

Voss

About The Project

Built With

Getting Started

Installation

Additional Setup

Usage

Translate

Login

Improvements

Asynchronous Web Server

Model Dashboard & Documentation

Contributing

License

Contact

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages