fix: improve router.http.request.duration_milliseconds by alepane21 · Pull Request #2564 · wundergraph/cosmo

alepane21 · 2026-02-26T17:24:04Z

When a query plan makes parallel fetches, the metric router.http.request.duration_milliseconds is recorded incorrectly: it reported as latency the time of the slower fetch. With this change I set as duration the timing that I get from the engine for the fetch, that doesn't have this issue.

Summary by CodeRabbit

Tests
- Added an integration test ensuring parallel subgraph request durations are reported to Prometheus.
Bug Fixes
- Improved latency measurement: logging, telemetry and metrics now use fetch timing for more accurate duration reporting.

Checklist

I have discussed my proposed changes in an issue and have received approval to proceed.
I have followed the coding standards of the project.
Tests or benchmarks have been added or updated.
Documentation has been updated on https://github.com/wundergraph/cosmo-docs.
I have read the Contributors Guide.

…ooks

…outer_http_request_duration-and-other-request

…tion-and-other-request

…request_duration-and-other-request' into ale/eng-8915-router-router_http_request_duration-and-other-request

…tion-and-other-request

…outer_http_request_duration-and-other-request

coderabbitai · 2026-02-26T17:24:25Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3c901ce and 3965823.

📒 Files selected for processing (1)

router/core/engine_loader_hooks.go

Walkthrough

Adds an integration test for Prometheus metrics of parallel subgraph durations and refactors engine loader hooks to use fetch-timing from context instead of a stored start time for latency reporting.

Changes

Cohort / File(s)	Summary
Prometheus Metrics Test `router-tests/prometheus_parallel_subgraph_metrics_test.go`	New integration test `TestPrometheusParallelSubgraphRequestDurationMetrics` that exercises parallel subgraph requests, collects Prometheus metrics, and validates per-subgraph `router_http_request_duration_milliseconds` histogram samples and counts for `employees` and `products`.
Engine Loader Hooks — Latency Refactor `router/core/engine_loader_hooks.go`	Removed `engineLoaderHooksRequestContext` and start-time tracking in `OnLoad`; `OnFinished` now reads fetch timing from `FetchTimingKey` (fetchLatency) and uses it for access logging, telemetry, and latency metrics instead of computing latency from stored start time. Minor control-flow and logging/metric updates to propagate `fetchLatency`.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

feat: add timings per client fetch for GraphQL http #2183 — Modifies engine loader hooks and fetch-timing handling; likely touches the same latency/FetchTimingKey areas.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main change: improving the router.http.request.duration_milliseconds metric to correctly capture parallel fetch timings using engine-provided fetch latency instead of incorrectly using the slower fetch duration.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-02-26T17:26:26Z

❌ Internal Query Planner CI failed to run.

codecov · 2026-02-26T17:30:20Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 63.52%. Comparing base (d376585) to head (f185aab).
⚠️ Report is 40 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #2564       +/-   ##
===========================================
+ Coverage   45.74%   63.52%   +17.78%     
===========================================
  Files        1035      251      -784     
  Lines      139075    26759   -112316     
  Branches     8631        0     -8631     
===========================================
- Hits        63613    16998    -46615     
+ Misses      73735     8400    -65335     
+ Partials     1727     1361      -366

Files with missing lines	Coverage Δ
router/core/engine_loader_hooks.go	`93.03% <100.00%> (+1.06%)`	⬆️

... and 789 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…outer_http_request_duration-and-other-request

coderabbitai

🧹 Nitpick comments (1)

router/core/engine_loader_hooks.go (1)
55-57: Remove unused startTime field and engineLoaderHooksRequestContext struct—both are dead code from the refactoring to use FetchTimingKey.

The startTime field is declared but never populated (line 117 creates an empty struct) or accessed. More significantly, the entire engineLoaderHooksRequestContext struct is unused—it's stored in the context at line 117 but never retrieved anywhere in the codebase. The latency measurement now relies entirely on FetchTimingKey (lines 102–103 and 167–173), making this struct and its associated context key EngineLoaderHooksContextKey redundant.
♻️ Proposed cleanup

Remove the struct and its context key registration:
-type engineLoaderHooksRequestContext struct {
-	startTime time.Time
-}
And update line 117:
-	return context.WithValue(ctx, rcontext.EngineLoaderHooksContextKey, &engineLoaderHooksRequestContext{})
+	return ctx
Also remove the EngineLoaderHooksContextKey from router/internal/context/keys.go if it has no other usages.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@router/core/engine_loader_hooks.go` around lines 55 - 57, Remove the dead
engineLoaderHooksRequestContext struct and its associated context key
EngineLoaderHooksContextKey: delete the type declaration for
engineLoaderHooksRequestContext, remove the code that stores an empty instance
into context (the place creating/storing engineLoaderHooksRequestContext in
request context), and remove the EngineLoaderHooksContextKey from
router/internal/context/keys.go; ensure any latency logic continues to use
FetchTimingKey (do not change FetchTimingKey usages) and run a project-wide
search to confirm no remaining references to engineLoaderHooksRequestContext or
EngineLoaderHooksContextKey before committing.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@router/core/engine_loader_hooks.go`:
- Around line 55-57: Remove the dead engineLoaderHooksRequestContext struct and
its associated context key EngineLoaderHooksContextKey: delete the type
declaration for engineLoaderHooksRequestContext, remove the code that stores an
empty instance into context (the place creating/storing
engineLoaderHooksRequestContext in request context), and remove the
EngineLoaderHooksContextKey from router/internal/context/keys.go; ensure any
latency logic continues to use FetchTimingKey (do not change FetchTimingKey
usages) and run a project-wide search to confirm no remaining references to
engineLoaderHooksRequestContext or EngineLoaderHooksContextKey before
committing.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 08efda8 and 3c901ce.

📒 Files selected for processing (1)

router/core/engine_loader_hooks.go

…tion-and-other-request

…request_duration-and-other-request' into ale/eng-8915-router-router_http_request_duration-and-other-request

…tion-and-other-request

SkArchon · 2026-03-11T12:22:20Z

 		if fetchTiming, ok := value.(*atomic.Int64); ok {
-			exprCtx.Subgraph.Request.ClientTrace.FetchDuration = time.Duration(fetchTiming.Load())
+			fetchLatency = time.Duration(fetchTiming.Load())
+			exprCtx.Subgraph.Request.ClientTrace.FetchDuration = fetchLatency


This measures the low level round trip, whereas the current measurement should measure this plus the engine hook logic that would add to the request duration.

However more importantly, would it work with GRPC data sources?

Good catch! It would not work with gRPC data sources.

…tion-and-other-request

github-actions · 2026-03-27T05:46:48Z

This PR was marked stale due to lack of activity. It will be closed in 14 days.

…tion-and-other-request

github-actions · 2026-04-18T05:46:44Z

This PR was marked stale due to lack of activity. It will be closed in 14 days.

alepane21 and others added 12 commits February 20, 2026 21:58

refactor(router): improve latency metrics handling in engine loader h…

24ec69c

…ooks

Merge remote-tracking branch 'origin/main' into ale/eng-8915-router-r…

befb1cb

…outer_http_request_duration-and-other-request

fix: ignore different response order

38d3d54

Merge branch 'main' into ale/eng-8915-router-router_http_request_dura…

4e050ea

…tion-and-other-request

Merge branch 'main' into ale/eng-8915-router-router_http_request_dura…

1dca661

…tion-and-other-request

chore: improve readibility

caea193

Merge remote-tracking branch 'origin/ale/eng-8915-router-router_http_…

f0cc218

…request_duration-and-other-request' into ale/eng-8915-router-router_http_request_duration-and-other-request

Merge branch 'main' into ale/eng-8915-router-router_http_request_dura…

8f4baa8

…tion-and-other-request

chore: replace Eq To Jsoneq to avoid flakiness

3928ae1

Merge remote-tracking branch 'origin/main' into ale/eng-8915-router-r…

6a30f1f

…outer_http_request_duration-and-other-request

Merge remote-tracking branch 'origin/main' into ale/eng-8915-router-r…

f162013

…outer_http_request_duration-and-other-request

fix: make test less flaky

08efda8

github-actions Bot added the router label Feb 26, 2026

alepane21 added 3 commits March 2, 2026 12:02

Merge remote-tracking branch 'origin/main' into ale/eng-8915-router-r…

6823e5e

…outer_http_request_duration-and-other-request

chore: undo changes not needed anymore

fc2df50

chore: remove start var calculation

3c901ce

coderabbitai Bot reviewed Mar 2, 2026

View reviewed changes

alepane21 changed the title ~~Ale/eng 8915 router router http request duration and other request~~ fix: improve router.http.request.duration_milliseconds Mar 2, 2026

alepane21 and others added 5 commits March 2, 2026 13:22

Merge branch 'main' into ale/eng-8915-router-router_http_request_dura…

3cd8de9

…tion-and-other-request

fix: remove unused field

dc7fa25

Merge remote-tracking branch 'origin/ale/eng-8915-router-router_http_…

c59e8d1

…request_duration-and-other-request' into ale/eng-8915-router-router_http_request_duration-and-other-request

chore: remove unused context type

3965823

Merge branch 'main' into ale/eng-8915-router-router_http_request_dura…

317cb0b

…tion-and-other-request

alepane21 marked this pull request as ready for review March 3, 2026 17:23

alepane21 requested review from Noroth, StarpTech and devsergiy as code owners March 3, 2026 17:23

alepane21 requested review from endigma and jensneuse as code owners March 3, 2026 17:23

Merge branch 'main' into ale/eng-8915-router-router_http_request_dura…

d4266d8

…tion-and-other-request

SkArchon reviewed Mar 11, 2026

View reviewed changes

alepane21 added 2 commits March 12, 2026 11:35

Merge branch 'main' into ale/eng-8915-router-router_http_request_dura…

c413eb3

…tion-and-other-request

Merge branch 'main' into ale/eng-8915-router-router_http_request_dura…

d4f717d

…tion-and-other-request

github-actions Bot added the Stale label Mar 27, 2026

alepane21 and others added 4 commits April 3, 2026 14:19

Merge branch 'main' into ale/eng-8915-router-router_http_request_dura…

b874625

…tion-and-other-request

feat: use latency coming up from engine

60ab2ba

chore: remove local engine

3fa19c4

fix: remove wrong file

f185aab

github-actions Bot removed the Stale label Apr 4, 2026

github-actions Bot added the Stale label Apr 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: improve router.http.request.duration_milliseconds#2564

fix: improve router.http.request.duration_milliseconds#2564
alepane21 wants to merge 27 commits intomainfrom
ale/eng-8915-router-router_http_request_duration-and-other-request

alepane21 commented Feb 26, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Feb 26, 2026 •

edited

Loading

❌ Failed checks (1 warning)

Uh oh!

github-actions Bot commented Feb 26, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Feb 26, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot left a comment

Uh oh!

SkArchon Mar 11, 2026

Uh oh!

alepane21 Mar 12, 2026

Uh oh!

github-actions Bot commented Mar 27, 2026

Uh oh!

github-actions Bot commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alepane21 commented Feb 26, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Checklist

Uh oh!

coderabbitai Bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

❌ Failed checks (1 warning)

Uh oh!

github-actions Bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

SkArchon Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

alepane21 Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Mar 27, 2026

Uh oh!

github-actions Bot commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alepane21 commented Feb 26, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Feb 26, 2026 •

edited

Loading

github-actions Bot commented Feb 26, 2026 •

edited

Loading

codecov Bot commented Feb 26, 2026 •

edited

Loading