:loud_sound: Add metrics to log MM embedding time by gkumbhat · Pull Request #899 · torch-spyre/sendnn-inference

gkumbhat · 2026-04-06T20:31:49Z

Description

Add metric to log mm embedding calculation time.

Related Issues

Test Plan

Checklist

I have read the contributing guidelines
My code follows the project's code style (run bash format.sh)
I have added tests for my changes (if applicable)
I have updated the documentation (if applicable)
My commits include a Signed-off-by: line (DCO compliance)

github-actions · 2026-04-06T20:35:03Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, run ./format.sh.
Now you are good to go 🚀.

We also recommend installing prek and configuring it to check your code before every local commit.

Signed-off-by: Gaurav-Kumbhat <Gaurav.Kumbhat@ibm.com>

joerunde · 2026-04-06T20:51:49Z

-            logger.info("maybe_mm_embedding processing time: %.2fms", (t1 * 1000))
+            logger.info("maybe_mm_embedding processing time: %.2fms", (t_elapsed * 1000))
+            self.perf_logger.log(
+                "get_mm_embeddings_time_ms",


Do we need to be publishing a non-standard vllm metric here?

I was trying to find a similar metric from vllm that we could reuse, or push the numbers there. Haven't found one yet..

The idea here is to have a way to measure impact of MM processing.

Open to suggestions, if there is a better and more standard way to deal with this.

@joerunde actually usage of create_perf_metric_logger already handles the optional enablement of this metric. So this will only get printed if we pass VLLM_SPYRE_PERF_METRIC_LOGGING_ENABLED env variable.

joerunde

lpgtm- from talking with @gkumbhat this is meant for development debugging, not for opsviz on deployed models, so we're using the spyre-specific perf logger here

joerunde · 2026-04-14T16:13:31Z

ah shoot GK I forgot to hit merge last week, sorry!

🔊 Add metrics to log MM embedding time

5525708

Signed-off-by: Gaurav-Kumbhat <Gaurav.Kumbhat@ibm.com>

gkumbhat force-pushed the add_mm_log_metrics branch from 0c75c65 to 5525708 Compare April 6, 2026 20:47

joerunde reviewed Apr 6, 2026

View reviewed changes

gkumbhat marked this pull request as ready for review April 7, 2026 22:23

gkumbhat requested review from nikolaospapandreou, sducouedic, tdoublep and yannicks1 as code owners April 7, 2026 22:23

joerunde approved these changes Apr 8, 2026

View reviewed changes

joerunde merged commit ed732b0 into torch-spyre:main Apr 14, 2026
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🔊 Add metrics to log MM embedding time#899

🔊 Add metrics to log MM embedding time#899
joerunde merged 1 commit intotorch-spyre:mainfrom
gkumbhat:add_mm_log_metrics

gkumbhat commented Apr 6, 2026

Uh oh!

github-actions bot commented Apr 6, 2026

Uh oh!

joerunde Apr 6, 2026

Uh oh!

gkumbhat Apr 6, 2026

Uh oh!

gkumbhat Apr 7, 2026

Uh oh!

joerunde Apr 8, 2026

Uh oh!

joerunde left a comment

Uh oh!

joerunde commented Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gkumbhat commented Apr 6, 2026

Description

Related Issues

Test Plan

Checklist

Uh oh!

github-actions bot commented Apr 6, 2026

Uh oh!

joerunde Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

gkumbhat Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

gkumbhat Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

joerunde Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

joerunde left a comment

Choose a reason for hiding this comment

Uh oh!

joerunde commented Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants