Skip to content

Conversation

@robertfernandez-vast
Copy link
Contributor

@robertfernandez-vast robertfernandez-vast commented Oct 30, 2025

There were some new implementation details regarding how Serverless interacts with suspended endpoints that now need to be captured in the documentation. The following properties are now true for suspended endpoints:

When an Endpoint is suspended:

  • The Serverless System will no longer manage the GPU instances contained within the Endpoint.
  • GPU instances in this Endpoint will still be able to receive requests.

There are also inaccuracies with the language of the Stopping an Endpoint section. I updated the language so that the stopping behavior is accurate and the billing description of inactive GPU instances is consistent with the rest of the documentation.

Stopping an Endpoint will pause the recruitment of GPU instances, and put the existing instances into the "Stopped" state, preventing any work from being sent to
the Endpoint group. The instances will still charge the small storage cost, but active rental and bandwidth costs will not be charged to the user account. 
Stopping an Endpoint will:
- Cause the **Serverless System** to no longer manage the GPU instances contained within the Endpoint.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • The Serverless Engine will no longer manage the GPU instances contained within the Endpoint.

Just so it matches above

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will make the change. Just for my understanding, is there a difference between the Serverless system and Serverless Engine? In the Serverless overview page, the language is specifically calling out the Serverless system.

Image

Also on the System Architecture page, it shows that Vast.ai Serverless includes:

  • Serverless System
  • GPU Instances
  • User (Client Application)
Image

Is there a difference between saying Serverless system and saying Serverless Engine. They appear to stated as synonymous in the documentation.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was using Serverless system a while ago until I realized Serverless Engine sounds cooler. All systems should be turned into engines.

Comment on lines 52 to 56
## Destroying GPU instances
Before destroying a GPU instance, move any data off of that GPU instance. For more help on how to do this:
- [Data Movement](https://docs.vast.ai/documentation/instances/storage/data-movement)
- [Cloud Sync](https://docs.vast.ai/documentation/instances/cloud-sync)
- [CLI Commands](https://docs.vast.ai/cli/commands)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should delete this section.

Copy link
Contributor Author

@robertfernandez-vast robertfernandez-vast Oct 31, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would it be appropriate to link to the Managing Instances page? I want to make sure we do our due diligence and make it abundantly clear that it is the user's responsibility to manage inactive GPU instances and GPU instances contained within suspended Endpoint groups.

Comment on lines 59 to 63
To destroy a GPU instance within the **serverless GUI**:
1) Select the Endpoint group tab of the desired Endpoint group.
2) Find the row of the desired GPU instance.
3) Click on the **🛈** button.
4) Click on **Destroy your instance** (trash can button).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should delete this section

@Colter-Downing
Copy link
Contributor

We should also think if the Pricing page is the right spot for detailing Endpoint States.

@robertfernandez-vast
Copy link
Contributor Author

Member

I will talk to @guthrie-vast about this and see how to best do this.

Copy link
Contributor

@Colter-Downing Colter-Downing left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just a couple small things

…nds capacity updates to suspended Endpoint groups.
@robertfernandez-vast robertfernandez-vast force-pushed the AUTO-754--Correct_Suspend_and_Stopped_endpoint_state_docs branch from 74b20f0 to 1493d46 Compare November 4, 2025 22:22
@robertfernandez-vast robertfernandez-vast merged commit 87b6de6 into main Nov 4, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants