fix(server): sanitize logs and error details by Thump604 · Pull Request #341 · waybarrios/vllm-mlx

Thump604 · 2026-04-14T16:22:37Z

Summary

sanitize control characters before logging untrusted request previews and exception strings in the server
stop returning raw exception text in 500 responses from embeddings, transcription, and speech endpoints
add regression coverage for sanitized request logging and generic internal error responses

Test plan

PYTHONPATH=/private/tmp/vllm-mlx-issue68-log-sanitize /opt/ai-runtime/venv-live/bin/python -m pytest tests/test_server.py -q
/opt/ai-runtime/venv-live/bin/python -m black --check --fast vllm_mlx/server.py tests/test_server.py
/opt/ai-runtime/venv-live/bin/python -m compileall vllm_mlx/server.py tests/test_server.py

Part of #68 (finding #12).

janhilgard

Review: fix(server): sanitize logs and error details

Addresses: Issue #68, finding #12 (log injection) and finding #21 (exception detail leakage)

Summary: Introduces _sanitize_log_text() to escape control characters before logging untrusted input. Replaces raw detail=str(e) in 500 responses with generic messages via _log_and_raise_internal_error().

Strengths

Good threat model. ANSI escape injection and newline injection in logs are real risks (log forging, terminal escape attacks). Escaping \n, \r, \t, and non-printable chars is the right approach.
Generic 500 responses. Replacing detail=str(e) with static messages like "Embedding generation failed" prevents internal state leakage to API clients. The full sanitized error is still logged server-side.
_log_and_raise_internal_error helper is clean and reduces boilerplate at each call site.
Consistent application across many log sites: prompt previews, user message previews, exception handlers, tool parser errors, MCP init, cache load/save.
Migrated f-strings to %-style logging in sanitized sites, which is better practice (avoids formatting cost when log level is disabled).
Solid test coverage: sanitizer unit test, integration test for prompt preview escaping, integration test verifying generic 500 detail.

Issues / Suggestions

_sanitize_log_text does not handle backslash itself. If the input contains a literal backslash (e.g. "foo\nbar" where \n is the two characters \ + n, not a newline), the output will contain foo\nbar which looks identical to an escaped newline. This is probably fine for log readability, but worth noting. If you want unambiguous round-tripping, backslashes should be escaped first (\\ -> \\\\).
Inconsistent: some log sites were not migrated. For example, logger.info(f"Initialized tool call parser: {_tool_call_parser}") on line ~2451 still uses an f-string with a potentially-untrusted value (the parser name comes from CLI args, so it's operator-controlled, not user-controlled -- this is fine). Just flagging that the boundary of "what gets sanitized" is implicitly "user-controlled input" vs "operator-controlled config", which is a reasonable distinction.
_log_and_raise_internal_error always raises, but the return type is None. Adding -> NoReturn as the return type annotation would help static analysis and make the control flow clearer to readers:
```
from typing import NoReturn
def _log_and_raise_internal_error(...) -> NoReturn:
```
The limit parameter truncates mid-character for multi-byte escapes. _sanitize_log_text("x" * 499 + "\u2028") would produce 499 x's + \u2028 (6 chars) = 505 chars, truncated to "xxx...2028..." -- the escape sequence gets cut. This is cosmetic, not a security issue, but a note in the docstring would be nice.
exc_info=True was removed from cache load/save warnings. The original code had logger.warning(..., exc_info=True) which logs the full traceback. The new code only logs the sanitized exception string. For debugging cache issues, the traceback is useful. Consider keeping exc_info=True alongside the sanitized message.

Verdict: Clean and well-targeted. The change meaningfully improves the security posture of the logging and error response layer.

…itize fix(server): sanitize logs and error details

This was referenced Apr 14, 2026

security: sanitize log injection and stop leaking exception details to clients #342

Closed

Security audit: authentication bypass, SSRF, and other vulnerabilities #68

Closed

janhilgard reviewed Apr 15, 2026

View reviewed changes

Thump604 force-pushed the codex/issue68-log-sanitize branch 2 times, most recently from 71711d9 to 013177e Compare April 18, 2026 02:04

fix(server): sanitize logs and error details

2be6d26

Thump604 force-pushed the codex/issue68-log-sanitize branch from 013177e to 2be6d26 Compare April 18, 2026 02:09

Thump604 merged commit a6b23a3 into waybarrios:main Apr 18, 2026
9 checks passed

arozanov pushed a commit to arozanov/vllm-mlx that referenced this pull request Apr 30, 2026

Merge pull request waybarrios#341 from Thump604/codex/issue68-log-san…

67f3eee

…itize fix(server): sanitize logs and error details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(server): sanitize logs and error details#341

fix(server): sanitize logs and error details#341
Thump604 merged 1 commit intowaybarrios:mainfrom
Thump604:codex/issue68-log-sanitize

Thump604 commented Apr 14, 2026

Uh oh!

janhilgard left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Thump604 commented Apr 14, 2026

Summary

Test plan

Uh oh!

janhilgard left a comment

Choose a reason for hiding this comment

Review: fix(server): sanitize logs and error details

Strengths

Issues / Suggestions

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants