Skip to content

Conversation

@can-anyscale
Copy link
Contributor

@can-anyscale can-anyscale commented Oct 27, 2025

Change the unit of scheduler_placement_time from seconds to mili-seconds. The current bucket is in the range of 0.1s to 2.5 hours which doesn't make sense. According to a sample of data, the range we are interested in would be from us to s. Thanks @ZacAttack for pointing this out.

Note: This is an internal (non–public-facing) metric, so we only need to update its usage within Ray (e.g., the dashboard). A simple code change should suffice.
505491038-c5d81017-b86c-406f-acf4-614560752062

Test:

  • CI

@can-anyscale can-anyscale marked this pull request as ready for review October 27, 2025 17:34
@can-anyscale can-anyscale requested a review from a team as a code owner October 27, 2025 17:34
cursor[bot]

This comment was marked as outdated.

@ray-gardener ray-gardener bot added core Issues that should be addressed in Ray Core observability Issues related to the Ray Dashboard, Logging, Metrics, Tracing, and/or Profiling labels Oct 27, 2025
"resolved to when it actually reserves resources on a node to run.",
/*unit=*/"s",
/*unit=*/"ms",
/*boundaries=*/{0.1, 1, 10, 100, 1000, 10000},
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do these seem like reasonable frames? 0.1 and 1ms seem unlikely right? This timer probably encapsulates a few RPC's?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're right — 0.1 ms is unlikely. Based on the current data, most data points are under 100 ms, but we don’t have much visibility below that range. So, we might want to go two levels deeper, just in case, say, 10 and 1 ms. If there turn out to be no data points below 1 ms, that’s actually helpful information, since it tells us 1 ms is the lower boundary.

The upper bound of 10 s looks good, the data already shows that it’s the upper boundary. There are no data points above 10 s, which in itself is useful information.

Base automatically changed from can-statdie03 to master October 29, 2025 21:26
@can-anyscale can-anyscale added the go add ONLY when ready to merge, run all tests label Oct 29, 2025
@can-anyscale can-anyscale merged commit 50ffca4 into master Nov 7, 2025
6 checks passed
@can-anyscale can-anyscale deleted the can-statdie03-bis branch November 7, 2025 19:59
YoussefEssDS pushed a commit to YoussefEssDS/ray that referenced this pull request Nov 8, 2025
…y-project#58217)

Change the unit of `scheduler_placement_time` from seconds to
mili-seconds. The current bucket is in the range of 0.1s to 2.5 hours
which doesn't make sense. According to a sample of data, the range we
are interested in would be from us to s. Thanks @ZacAttack for pointing
this out.

```
Note: This is an internal (non–public-facing) metric, so we only need to update its usage within Ray (e.g., the dashboard). A simple code change should suffice.
```

<img width="1609" height="421"
alt="505491038-c5d81017-b86c-406f-acf4-614560752062"
src="https://github.com/user-attachments/assets/cc647b97-42ec-42eb-bf01-4d1867940207"
/>

Test:
- CI

Signed-off-by: Cuong Nguyen <[email protected]>
landscapepainter pushed a commit to landscapepainter/ray that referenced this pull request Nov 17, 2025
…y-project#58217)

Change the unit of `scheduler_placement_time` from seconds to
mili-seconds. The current bucket is in the range of 0.1s to 2.5 hours
which doesn't make sense. According to a sample of data, the range we
are interested in would be from us to s. Thanks @ZacAttack for pointing
this out.

```
Note: This is an internal (non–public-facing) metric, so we only need to update its usage within Ray (e.g., the dashboard). A simple code change should suffice.
```

<img width="1609" height="421"
alt="505491038-c5d81017-b86c-406f-acf4-614560752062"
src="https://github.com/user-attachments/assets/cc647b97-42ec-42eb-bf01-4d1867940207"
/>

Test:
- CI

Signed-off-by: Cuong Nguyen <[email protected]>
Aydin-ab pushed a commit to Aydin-ab/ray-aydin that referenced this pull request Nov 19, 2025
…y-project#58217)

Change the unit of `scheduler_placement_time` from seconds to
mili-seconds. The current bucket is in the range of 0.1s to 2.5 hours
which doesn't make sense. According to a sample of data, the range we
are interested in would be from us to s. Thanks @ZacAttack for pointing
this out.

```
Note: This is an internal (non–public-facing) metric, so we only need to update its usage within Ray (e.g., the dashboard). A simple code change should suffice.
```

<img width="1609" height="421"
alt="505491038-c5d81017-b86c-406f-acf4-614560752062"
src="https://github.com/user-attachments/assets/cc647b97-42ec-42eb-bf01-4d1867940207"
/>

Test:
- CI

Signed-off-by: Cuong Nguyen <[email protected]>
Signed-off-by: Aydin Abiar <[email protected]>
ykdojo pushed a commit to ykdojo/ray that referenced this pull request Nov 27, 2025
…y-project#58217)

Change the unit of `scheduler_placement_time` from seconds to
mili-seconds. The current bucket is in the range of 0.1s to 2.5 hours
which doesn't make sense. According to a sample of data, the range we
are interested in would be from us to s. Thanks @ZacAttack for pointing
this out.

```
Note: This is an internal (non–public-facing) metric, so we only need to update its usage within Ray (e.g., the dashboard). A simple code change should suffice.
```

<img width="1609" height="421"
alt="505491038-c5d81017-b86c-406f-acf4-614560752062"
src="https://github.com/user-attachments/assets/cc647b97-42ec-42eb-bf01-4d1867940207"
/>

Test:
- CI

Signed-off-by: Cuong Nguyen <[email protected]>
Signed-off-by: YK <[email protected]>
SheldonTsen pushed a commit to SheldonTsen/ray that referenced this pull request Dec 1, 2025
…y-project#58217)

Change the unit of `scheduler_placement_time` from seconds to
mili-seconds. The current bucket is in the range of 0.1s to 2.5 hours
which doesn't make sense. According to a sample of data, the range we
are interested in would be from us to s. Thanks @ZacAttack for pointing
this out.

```
Note: This is an internal (non–public-facing) metric, so we only need to update its usage within Ray (e.g., the dashboard). A simple code change should suffice.
```

<img width="1609" height="421"
alt="505491038-c5d81017-b86c-406f-acf4-614560752062"
src="https://github.com/user-attachments/assets/cc647b97-42ec-42eb-bf01-4d1867940207"
/>

Test:
- CI

Signed-off-by: Cuong Nguyen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

core Issues that should be addressed in Ray Core go add ONLY when ready to merge, run all tests observability Issues related to the Ray Dashboard, Logging, Metrics, Tracing, and/or Profiling

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants