Allow up to 254 vCPUs to a VM #9385

iximeow · 2025-11-11T20:21:12Z

This follows on turning the crank to max vCPUs in Helios and Propolis; if the hardware has so many vCPUs available, what's to stop someone from allocating them all for a single VM?

Similar to creating a VM requiring more memory than is available, one can create (or resize) a VM into a size that is much larger than any hardware has, or is available at runtime. Attempting to run such an instance will error because the instance can't get placed.

One could imagine a future operator control to limit max VM sizes for a silo; larger VMs get more difficult to migrate, can be more difficult to place. Without something like "anti-fragmentation" to group smaller VMs together it's quite possible that a sled could have 255 CPUs, 2 vCPUs for one small VM, 253 CPUs not spoken for, and unable to fit a 254 vCPU VM.

Further, 254 busy vCPUs leaves zero to one CPUs available for Propolis, driving emulated hardware, processing I/O, co-located Crucible, sled-agent, other services, etc. There is no mechanism to earmark CPUs for control plane and I/O purposes, so this isn't any worse than the status quo. But when such a mechanism comes to exist, we'll need to gracefully tolerate prior existence of sled-or-larger-size VMs.

Note that Helios is fine with being asked to oversubscribe hardware threads to vCPUs, and that's how I'd tested that a 254-vCPU VM works reasonably (on a 32-thread CPU). test_cannot_provision_instance_beyond_cpu_capacity is the demonstration that the control plane isn't willing to oversubscribe hardware in practice.

(Dan pointed out to me a bit ago that we could allow 255 vCPUs - my choice of 254 on the Helios side was really a fencepost error on my part. But I'd like to disallow odd vCPU counts in the first place, related to Propolis#940, so 254 is fine.)

This follows on turning the crank to max vCPUs in Helios and Propolis; if the hardware has so many vCPUs available, what's to stop someone from allocating them all for a single VM? Similar to creating a VM requiring more memory than is available, one can create (or resize) a VM into a size that is much larger than any hardware has, or is available at runtime. Attempting to run such an instance will error because the instance can't get placed. One could imagine a future operator control to limit max VM sizes for a silo; larger VMs get more difficult to migrate, can be more difficult to place. Without something like "anti-fragmentation" to group smaller VMs together it's quite possible that a sled could have 255 CPUs, 2 vCPUs for one small VM, 253 CPUs not spoken for, and unable to fit a 254 vCPU VM. Further, 254 busy vCPUs leaves zero to one CPUs available for Propolis, driving emulated hardware, processing I/O, co-located Crucible, sled-agent, other services, etc. There is no mechanism to earmark CPUs for control plane and I/O purposes, so this isn't any worse than the status quo. But when such a mechanism comes to exist, we'll need to gracefully tolerate prior existence of sled-or-larger-size VMs.

iximeow added virtualization Propolis Integration & VM Management release notes reminder to include this in the release notes labels Nov 11, 2025

karencfv approved these changes Nov 13, 2025

View reviewed changes

david-crespo mentioned this pull request Nov 13, 2025

Bump max CPUs per VM to 254 oxidecomputer/console#2972

Open

iximeow merged commit 2f7f807 into main Nov 13, 2025
16 checks passed

iximeow deleted the ixi/max_vcpu_crank_turn branch November 13, 2025 21:55

iximeow mentioned this pull request Nov 13, 2025

Controls for available CPU platforms #9078

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow up to 254 vCPUs to a VM #9385

Allow up to 254 vCPUs to a VM #9385

Uh oh!

iximeow commented Nov 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Allow up to 254 vCPUs to a VM #9385

Allow up to 254 vCPUs to a VM #9385

Uh oh!

Conversation

iximeow commented Nov 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants