Actually use FastMCP by keitwb · Pull Request #110 · openshift-assisted/assisted-service-mcp

keitwb · 2025-09-24T18:32:47Z

Previously we were installing the package but using the official SDK for of FastMCP 1.0. Using FastMCP directly gives us a lot more features should we need them.

Summary by CodeRabbit

New Features
- Streamable HTTP responses enabled by default for smoother, real-time interactions.
Bug Fixes
- More reliable token detection by consistently reading tokens from HTTP headers.
- Session identifiers are no longer forwarded during tool calls, reducing exposure of sensitive data.
Refactor
- Simplified tool discovery for improved responsiveness and stability.
Chores
- Updated minimum fastmcp dependency to a newer compatible version.
Tests
- Tests updated to use payload-based dispatch and header-mocking for HTTP behavior.

coderabbitai · 2025-09-24T18:32:54Z

Walkthrough

Updated minimum fastmcp version and refactored server to use fastmcp.FastMCP, header-based token extraction via get_http_headers, added StripSessionIDMiddleware, unified http_app transport, simplified tool listing, and adapted tests to a .run(payload) interface with header-based mocks and response.content usage.

Changes

Cohort / File(s)	Summary
Dependency version bump `pyproject.toml`	Raise `fastmcp` requirement from `>=2.8.0` to `>=2.12.3`.
Server refactor & middleware `server.py`	Replace direct internal FastMCP usage with `from fastmcp import FastMCP`; switch token retrieval to `get_http_headers` (authorization, ocm-offline-token); add `StripSessionIDMiddleware` to remove `session_id` from tool call contexts; simplify tool listing via `mcp.get_tools().keys()`; always use `http_app(transport="streamable-http")`.
Tests updated for header-based HTTP flow `tests/test_server.py`	Change tests to invoke server via `.run(payload_dict)`; mock HTTP headers via `mock_mcp_get_http_headers`; adapt to `TextContent` responses and `result.content[0].text`; update typing/imports and token precedence assertions.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant Client
  participant HTTPApp as HTTP App (streamable-http)
  participant Middleware as StripSessionIDMiddleware
  participant Headers as get_http_headers
  participant FastMCP as FastMCP

  Client->>HTTPApp: HTTP tool call (headers + payload)
  HTTPApp->>Middleware: on_call_tool(context)
  Middleware->>Headers: get_http_headers(include_all=true)
  Headers-->>Middleware: headers (authorization, ocm-offline-token)
  note right of Middleware: Remove `session_id` from context
  Middleware->>FastMCP: call tool (context without session_id, tokens)
  FastMCP-->>Middleware: ToolResult
  Middleware-->>HTTPApp: ToolResult
  HTTPApp-->>Client: HTTP response (content/text)

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

MGMT-21086: support different protocols #38 — Similar changes: switch to fastmcp.FastMCP, header-based token retrieval via get_http_headers, tool listing change, and fastmcp version bump.

Suggested labels

lgtm

Suggested reviewers

eranco74
carbonin

Poem

A rabbit taps the headers’ trail,
Snips a session ID—light as a tail.
Tools queue up, tidy keys in sight,
Streamable HTTP hums through the night.
FastMCP hops, the tests run bright. 🐇✨

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The title succinctly captures the main change of the pull request, which is switching the codebase to use the FastMCP package directly rather than the official SDK.
Docstring Coverage	✅ Passed	Docstring coverage is 97.56% which is sufficient. The required threshold is 80.00%.

✨ Finishing touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

openshift-ci · 2025-09-24T18:32:56Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: keitwb

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [keitwb]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (7)

pyproject.toml (1)
9-9: FastMCP bump aligns with code changes; consider upper-bounding and confirm transitive mcp types availability

Optional: add an upper bound to avoid unexpected v3 breaking changes.

Verify that fastmcp>=2.12.3 brings in mcp (for mcp.types.TextContent in tests) transitively in your environment.

Apply this diff if you want an upper bound:
-    "fastmcp>=2.12.3",
+    "fastmcp>=2.12.3,<3",
server.py (4)
91-96: Normalize header keys to avoid case-sensitivity pitfalls

HTTP headers are case-insensitive but sources may provide mixed casing. Normalize once to lower to make lookups robust.

Apply this diff:
-    headers = get_http_headers(include_all=True)
-    token = headers.get("ocm-offline-token")
+    headers = {k.lower(): v for k, v in get_http_headers(include_all=True).items()}
+    token = headers.get("ocm-offline-token")
117-124: Same header normalization for Authorization parsing

Guard against mixed-case header keys.

Apply this diff:
-    headers = get_http_headers(include_all=True)
-    header = headers.get("authorization")
+    headers = {k.lower(): v for k, v in get_http_headers(include_all=True).items()}
+    header = headers.get("authorization")
941-943: Avoid asyncio.run() if there’s a chance this is called within a running loop

If list_tools() is ever called from an async context (not just at startup), asyncio.run() will raise. Consider exposing an async variant or using a loop-aware helper.

Would you prefer an async list_tools_async() and have startup call asyncio.run(list_tools_async()) while other callers can await?

946-963: Harden middleware against missing/None arguments

Defensively handle cases where arguments might not be a dict.

Apply this diff:
 class StripSessionIDMiddleware(Middleware):
@@
-    async def on_call_tool(
+    async def on_call_tool(
         self,
         context: MiddlewareContext,
         call_next: CallNext,
     ) -> ToolResult:
-        """Strip session_id from tool calls"""
-        if "session_id" in context.message.arguments:
-            del context.message.arguments["session_id"]
+        """Strip session_id from tool calls."""
+        args = getattr(context.message, "arguments", None)
+        if isinstance(args, dict) and "session_id" in args:
+            del args["session_id"]
 
         return await call_next(context)
tests/test_server.py (2)
83-90: Drop redundant test or parametrize

test_get_offline_token_no_request duplicates test_get_offline_token_not_found. Consider removing or parametrizing to keep the suite lean.

Apply this diff to remove the redundant test:
-    def test_get_offline_token_no_request(
-        self, mock_mcp_get_http_headers: dict[str, Any]
-    ) -> None:
-        """Test offline token retrieval when no request is available."""
-        with patch.dict(os.environ, {}, clear=True):
-            with pytest.raises(RuntimeError) as exc_info:
-                server.get_offline_token()
-            assert "No offline token found" in str(exc_info.value)
101-115: Assert raise_for_status() is called in access-token tests

You call response.raise_for_status() in production; assert it in tests to catch regressions.

Example change for one test:
-                mock_response = Mock()
+                mock_response = Mock()
+                mock_response.raise_for_status.return_value = None
                 mock_response.json.return_value = {"access_token": "new-token"}
                 mock_post.return_value = mock_response
@@
                 result = server.get_access_token()
                 assert result == "new-token"
+                mock_response.raise_for_status.assert_called_once()
Apply similarly to the other access-token tests mocking requests.post.

Also applies to: 120-128, 129-154, 156-182

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b196de4 and 572e4e3.

⛔ Files ignored due to path filters (1)

uv.lock is excluded by !**/*.lock

📒 Files selected for processing (3)

pyproject.toml (1 hunks)
server.py (5 hunks)
tests/test_server.py (35 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

tests/test_server.py (2)

server.py (16)

get_offline_token (69-98)

get_access_token (101-139)

cluster_info (144-173)

list_clusters (178-207)

cluster_events (212-236)

host_events (241-273)

cluster_iso_download_url (278-338)

create_cluster (343-447)

set_cluster_vips (571-617)

install_cluster (622-651)

list_versions (656-672)

list_operator_bundles (677-692)

add_operator_bundle_to_cluster (697-730)

set_host_role (831-878)

cluster_credentials_download_url (735-789)

set_cluster_ssh_key (883-934)

service_client/assisted_service_api.py (4)

list_clusters (146-156)

create_cluster (242-278)

install_cluster (367-382)

add_operator_bundle_to_cluster (419-446)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Red Hat Konflux / assisted-service-mcp-saas-main-on-pull-request

🔇 Additional comments (5)

server.py (3)

19-22: Switch to FastMCP imports looks correct

Using fastmcp.FastMCP, get_http_headers, and middleware types is consistent with the new usage.

36-36: FastMCP initialization is appropriate

mcp = FastMCP("AssistedService") is a clean, minimal init for registering tools.

966-968: Transport selection change: double-check environment support

streamable-http requires the newer FastMCP HTTP transport. Ensure deploy/runtime aligns with this transport.

If needed, I can add a startup self-check endpoint that reports the selected transport and registered tools.

tests/test_server.py (2)

31-36: Header mocking fixture is clean and focused

Patching server.get_http_headers to a shared mapping keeps tests simple and explicit.

12-12: Confirm availability of mcp.types.TextContent at runtime

Tests import TextContent from mcp.types. Ensure your dependency set (via fastmcp>=2.12.3) guarantees mcp is installed in CI/production.

Would you like me to add an import self-check in a small smoke test?

coderabbitai · 2025-09-24T18:40:38Z

+            result = await server.set_host_role.run(
+                {"host_id": host_id, "cluster_id": infraenv_id, "role": role}
+            )

-            assert result == host.to_str()
+            assert cast(TextContent, result.content[0]).text == host.to_str()
            mock_inventory_client.update_host.assert_called_once_with(
                host_id, infraenv_id, host_role=role
            )


⚠️ Potential issue

Incorrect variable passed for cluster_id in set_host_role test

The test passes infraenv_id as cluster_id, weakening coverage. Pass a real cluster_id and assert the list call uses it.

Apply this diff:

@@ - host_id = "test-host-id" - infraenv_id = "test-infraenv-id" + host_id = "test-host-id" + cluster_id = "test-cluster-id" + infraenv_id = "test-infraenv-id" @@ - result = await server.set_host_role.run( - {"host_id": host_id, "cluster_id": infraenv_id, "role": role} - ) + result = await server.set_host_role.run( + {"host_id": host_id, "cluster_id": cluster_id, "role": role} + ) @@ - mock_inventory_client.update_host.assert_called_once_with( + mock_inventory_client.list_infra_envs.assert_called_once_with(cluster_id) + mock_inventory_client.update_host.assert_called_once_with( host_id, infraenv_id, host_role=role )

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

result = await server.set_host_role.run(

{"host_id": host_id, "cluster_id": infraenv_id, "role": role}

)

assert result == host.to_str()

assert cast(TextContent, result.content[0]).text == host.to_str()

mock_inventory_client.update_host.assert_called_once_with(

host_id, infraenv_id, host_role=role

)

host_id = "test-host-id"

cluster_id = "test-cluster-id"

infraenv_id = "test-infraenv-id"

result = await server.set_host_role.run(

{"host_id": host_id, "cluster_id": cluster_id, "role": role}

)

assert cast(TextContent, result.content[0]).text == host.to_str()

mock_inventory_client.list_infra_envs.assert_called_once_with(cluster_id)

mock_inventory_client.update_host.assert_called_once_with(

host_id, infraenv_id, host_role=role

)

🤖 Prompt for AI Agents

In tests/test_server.py around lines 839 to 846, the test incorrectly passes infraenv_id as the cluster_id when calling server.set_host_role.run, reducing test coverage; change the call to pass the real cluster_id variable (e.g., {"host_id": host_id, "cluster_id": cluster_id, "role": role}) and update the mock assertion to expect mock_inventory_client.update_host to be called with (host_id, cluster_id, host_role=role) so the test validates the actual cluster id is used.

Previously we were installing the package but using the official SDK for of FastMCP 1.0. Using FastMCP directly gives us a lot more features should we need them.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (6)

server.py (4)

91-96: Harden header lookup: handle missing request context and case-insensitive headers

If no HTTP request context exists, get_http_headers() may raise. Also, header keys are case-insensitive; normalize to lowercase before lookup.

Apply:

-    headers = get_http_headers(include_all=True)
-    token = headers.get("ocm-offline-token")
-    if token:
-        log.debug("Found offline token in request headers")
-        return token
+    try:
+        raw_headers = get_http_headers(include_all=True)
+        headers = {k.lower(): v for k, v in (raw_headers or {}).items()}
+    except Exception:
+        headers = {}
+    token = headers.get("ocm-offline-token")
+    if token:
+        log.debug("Found offline token in request headers")
+        return token

117-124: Same here: robust, case-insensitive Authorization parsing

Normalize header names and be resilient to missing request context.

-    headers = get_http_headers(include_all=True)
-    header = headers.get("authorization")
-    if header is not None:
-        parts = header.split()
-        if len(parts) == 2 and parts[0].lower() == "bearer":
-            log.debug("Found access token in authorization header")
-            return parts[1]
+    try:
+        raw_headers = get_http_headers(include_all=True)
+        headers = {k.lower(): v for k, v in (raw_headers or {}).items()}
+    except Exception:
+        headers = {}
+    header = headers.get("authorization")
+    if header:
+        parts = header.split()
+        if len(parts) == 2 and parts[0].lower() == "bearer":
+            log.debug("Found access token in authorization header")
+            return parts[1]

946-963: Middleware: guard when arguments is None/not a dict

Avoids errors if a tool is invoked without an arguments dict.

     async def on_call_tool(
         self,
         context: MiddlewareContext,
         call_next: CallNext,
     ) -> ToolResult:
         """Strip session_id from tool calls"""
-        if "session_id" in context.message.arguments:
-            del context.message.arguments["session_id"]
+        args = context.message.arguments
+        if isinstance(args, dict) and "session_id" in args:
+            del args["session_id"]
 
         return await call_next(context)

966-971: Make server port configurable

Expose port via env for flexibility in containerized deployments.

     mcp.add_middleware(StripSessionIDMiddleware())
     app = mcp.http_app(transport="streamable-http")
 
     initiate_metrics(list_tools())
     app.add_route("/metrics", metrics)
-    uvicorn.run(app, host="0.0.0.0")
+    uvicorn.run(app, host="0.0.0.0", port=int(os.environ.get("PORT", "8000")))

tests/test_server.py (2)

185-197: “no request context” test still uses the header fixture

This test claims no request context, but the fixture patches get_http_headers to return {}, not to simulate absence/exception. Make it realistic by removing the fixture and making get_http_headers raise.

-    def test_get_access_token_no_request_context(
-        self, mock_mcp_get_http_headers: dict[str, Any]
-    ) -> None:
+    def test_get_access_token_no_request_context(self) -> None:
         """Test access token retrieval when no request context is available."""
-        with patch.object(server, "get_offline_token", return_value="offline-token"):
-            with patch("requests.post") as mock_post:
-                mock_response = Mock()
-                mock_response.json.return_value = {"access_token": "new-token"}
-                mock_post.return_value = mock_response
-
-                result = server.get_access_token()
-                assert result == "new-token"
+        with patch.object(server, "get_http_headers", side_effect=RuntimeError("no ctx")):
+            with patch.object(server, "get_offline_token", return_value="offline-token"):
+                with patch("requests.post") as mock_post:
+                    mock_response = Mock()
+                    mock_response.json.return_value = {"access_token": "new-token"}
+                    mock_post.return_value = mock_response
+                    result = server.get_access_token()
+                    assert result == "new-token"

946-970: Add middleware unit coverage (optional)

Consider a small unit test for StripSessionIDMiddleware to assert it drops session_id before tool execution.

I can draft a minimal test that invokes on_call_tool with a stub MiddlewareContext and verifies the forwarded arguments.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 572e4e3 and b9eaebc.

⛔ Files ignored due to path filters (1)

uv.lock is excluded by !**/*.lock

📒 Files selected for processing (3)

pyproject.toml (1 hunks)
server.py (5 hunks)
tests/test_server.py (35 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

pyproject.toml

🧰 Additional context used

🧬 Code graph analysis (1)

tests/test_server.py (2)

server.py (16)

get_offline_token (69-98)

get_access_token (101-139)

cluster_info (144-173)

list_clusters (178-207)

cluster_events (212-236)

host_events (241-273)

cluster_iso_download_url (278-338)

create_cluster (343-447)

set_cluster_vips (571-617)

install_cluster (622-651)

list_versions (656-672)

list_operator_bundles (677-692)

add_operator_bundle_to_cluster (697-730)

set_host_role (831-878)

cluster_credentials_download_url (735-789)

set_cluster_ssh_key (883-934)

service_client/assisted_service_api.py (4)

list_clusters (146-156)

create_cluster (242-278)

install_cluster (367-382)

add_operator_bundle_to_cluster (419-446)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Red Hat Konflux / assisted-service-mcp-saas-main-on-pull-request

🔇 Additional comments (3)

tests/test_server.py (3)
30-37: Nice, focused header-mock fixture

Localizing header manipulation via a patched get_http_headers keeps tests hermetic.

227-233: Accessing tool result via TextContent is correct

The .run(payload) + result.content[0].text usage aligns with the updated FastMCP result shape.

If future changes introduce non-text content, consider a small helper to extract text safely (e.g., assert type == "text"). Do you want a shared helper function for these assertions?

830-837: Incorrect variable passed for cluster_id in set_host_role test

The test passes infraenv_id as cluster_id, weakening coverage. Pass a real cluster_id and assert the list call uses it.

Apply:
@@
-        host_id = "test-host-id"
-        infraenv_id = "test-infraenv-id"
+        host_id = "test-host-id"
+        cluster_id = "test-cluster-id"
+        infraenv_id = "test-infraenv-id"
@@
-            result = await server.set_host_role.run(
-                {"host_id": host_id, "cluster_id": infraenv_id, "role": role}
-            )
+            result = await server.set_host_role.run(
+                {"host_id": host_id, "cluster_id": cluster_id, "role": role}
+            )
@@
+            mock_inventory_client.list_infra_envs.assert_called_once_with(cluster_id)
             mock_inventory_client.update_host.assert_called_once_with(
                 host_id, infraenv_id, host_role=role
             )

openshift-ci · 2025-09-24T19:12:31Z

@keitwb: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/eval-test	`b9eaebc`	link	false	`/test eval-test`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

openshift-ci Bot requested review from carbonin and eranco74 September 24, 2025 18:32

openshift-ci Bot added approved Indicates a PR has been approved by an approver from all required OWNERS files. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Sep 24, 2025

coderabbitai Bot reviewed Sep 24, 2025

View reviewed changes

Actually use FastMCP

b9eaebc

Previously we were installing the package but using the official SDK for of FastMCP 1.0. Using FastMCP directly gives us a lot more features should we need them.

keitwb force-pushed the use-fastmcp branch from 572e4e3 to b9eaebc Compare September 24, 2025 18:42

coderabbitai Bot reviewed Sep 24, 2025

View reviewed changes

keitwb closed this Sep 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actually use FastMCP#110

Actually use FastMCP#110
keitwb wants to merge 1 commit intoopenshift-assisted:masterfrom
keitwb:use-fastmcp

keitwb commented Sep 24, 2025 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Sep 24, 2025 •

edited

Loading

Uh oh!

openshift-ci Bot commented Sep 24, 2025

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Sep 24, 2025

Uh oh!

coderabbitai Bot left a comment

Uh oh!

openshift-ci Bot commented Sep 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

keitwb commented Sep 24, 2025 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

openshift-ci Bot commented Sep 24, 2025

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

openshift-ci Bot commented Sep 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

keitwb commented Sep 24, 2025 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Sep 24, 2025 •

edited

Loading