Lesson 239: Use multiprocess.MultiProcessCollector from prometheus_client #412

daxartio · 2025-01-20T10:38:44Z

Since there are four workers ("gunicorn", "-w", "4"), meaning four separate processes, I recommend using multiprocess.MultiProcessCollector from prometheus_client. Otherwise, you’ll collect metrics from only one random process.

tutorials/lessons/239/fastapi-app/main.py

Line 21 in feab6d6

app.mount("/metrics", metrics_app)

https://prometheus.github.io/client_python/exporting/http/fastapi-gunicorn/

from fastapi import FastAPI
from prometheus_client import make_asgi_app

app = FastAPI(debug=False)

# Using multiprocess collector for registry
def make_metrics_app():
    registry = CollectorRegistry()
    multiprocess.MultiProcessCollector(registry)
    return make_asgi_app(registry=registry)


metrics_app = make_metrics_app()
app.mount("/metrics", metrics_app)

The text was updated successfully, but these errors were encountered:

daxartio · 2025-01-20T10:45:38Z

It’s better to use async handlers because sync handlers always run in a thread pool (which has 40 threads in AnyIO, if I’m not mistaken). However, since Python has a GIL and can run only one thread at a time, this creates unnecessary overhead.

@app.get("/healthz", response_class=PlainTextResponse)
async def health():
    return "OK"


@app.get("/api/devices", response_class=ORJSONResponse)
async def get_devices():

daxartio · 2025-01-20T10:58:03Z

It’s not a good practice to do it like this:

async def get_db() -> AsyncGenerator[asyncpg.Connection, None]:
    async with db.get_connection() as conn:
        yield conn


@app.post("/api/devices", status_code=201, response_class=ORJSONResponse)
async def create_device(
    device: DeviceRequest, conn: PostgresDep, cache_client: MemcachedDep
):
    ...
    await conn...
    await cache_client...
    ...

Because await cache_client... executes after await conn..., meaning the connection is still in use while waiting for the cache operation. This prevents the connection from being returned to the pool in a timely manner.

A better approach is:

@app.post("/api/devices", status_code=201, response_class=ORJSONResponse)
async def create_device(
    device: DeviceRequest, db: PostgresDep, cache_client: MemcachedDep
):
    ...
    async with db.get_connection() as conn:
        await conn...
    await cache_client...
    ...

This way, the connection is properly returned to the pool before executing the cache operation, improving resource management.

daxartio · 2025-01-20T11:02:13Z

I think Python is not faster than Go or Node.js, and that’s okay. But these changes will slightly improve performance and help manage expectations so people won’t be too disappointed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lesson 239: Use multiprocess.MultiProcessCollector from prometheus_client #412

Lesson 239: Use multiprocess.MultiProcessCollector from prometheus_client #412

daxartio commented Jan 20, 2025

daxartio commented Jan 20, 2025

daxartio commented Jan 20, 2025

daxartio commented Jan 20, 2025

Lesson 239: Use multiprocess.MultiProcessCollector from prometheus_client #412

Lesson 239: Use multiprocess.MultiProcessCollector from prometheus_client #412

Comments

daxartio commented Jan 20, 2025

daxartio commented Jan 20, 2025

daxartio commented Jan 20, 2025

daxartio commented Jan 20, 2025