Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions docs/components/backends/llm/README.md
1 change: 1 addition & 0 deletions docs/components/backends/sglang/docs/multinode-examples.md
1 change: 1 addition & 0 deletions docs/components/backends/trtllm/README.md
1 change: 1 addition & 0 deletions docs/components/backends/vllm/README.md
1 change: 1 addition & 0 deletions docs/deploy/metrics/docker-compose.yml
1 change: 1 addition & 0 deletions docs/examples/runtime/hello_world/README.md
1 change: 1 addition & 0 deletions docs/guides/dynamo_deploy/operator_deployment.md
2 changes: 1 addition & 1 deletion docs/guides/dynamo_deploy/quickstart.md
Original file line number Diff line number Diff line change
Expand Up @@ -67,7 +67,7 @@ Ensure you have the source code checked out and are in the `dynamo` directory:

### Set Environment Variables

Our examples use the [`nvcr.io`](nvcr.io/nvidia/ai-dynamo/) but you can setup your own values if you use another docker registry.
Our examples use the [`nvcr.io`](https://nvcr.io/nvidia/ai-dynamo/) but you can setup your own values if you use another docker registry.

```bash
export NAMESPACE=dynamo-cloud # or whatever you prefer.
Expand Down
34 changes: 17 additions & 17 deletions docs/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -45,26 +45,26 @@ The examples below assume you build the latest image yourself from source. If us
:margin: 0
:padding: 3 4 0 0

.. grid-item-card:: :doc:`Hello World </examples/hello_world>`
:link: /examples/hello_world
.. grid-item-card:: :doc:`Hello World <examples/runtime/hello_world/README>`
:link: examples/runtime/hello_world/README
:link-type: doc

Demonstrates the basic concepts of Dynamo by creating a simple multi-service pipeline.
Demonstrates the basic concepts of Dynamo by creating a simple GPU-unaware graph

.. grid-item-card:: :doc:`LLM Deployment </examples/llm_deployment>`
:link: /examples/llm_deployment
.. grid-item-card:: :doc:`LLM Serving with VLLM <components/backends/vllm/README>`
:link: components/backends/vllm/README
:link-type: doc

Presents examples and reference implementations for deploying Large Language Models (LLMs) in various configurations.
Presents examples and reference implementations for deploying Large Language Models (LLMs) in various configurations with VLLM.

.. grid-item-card:: :doc:`Multinode </examples/multinode>`
:link: /examples/multinode
.. grid-item-card:: :doc:`Multinode with SGLang <components/backends/sglang/docs/multinode-examples>`
:link: components/backends/sglang/docs/multinode-examples
:link-type: doc

Demonstrates deployment for disaggregated serving on 3 nodes using `nvidia/Llama-3.1-405B-Instruct-FP8`.
Demonstrates disaggregated serving on several nodes.

.. grid-item-card:: :doc:`TensorRT-LLM </examples/trtllm>`
:link: /examples/trtllm
.. grid-item-card:: :doc:`TensorRT-LLM <components/backends/trtllm/README>`
:link: components/backends/trtllm/README
:link-type: doc

Presents TensorRT-LLM examples and reference implementations for deploying Large Language Models (LLMs) in various configurations.
Expand Down Expand Up @@ -110,7 +110,7 @@ The examples below assume you build the latest image yourself from source. If us

Dynamo Deploy Quickstart <guides/dynamo_deploy/quickstart.md>
Dynamo Cloud Kubernetes Platform <guides/dynamo_deploy/dynamo_cloud.md>
Manual Helm Deployment <guides/dynamo_deploy/manual_helm_deployment.md>
Manual Helm Deployment <deploy/helm/README.md>
GKE Setup Guide <guides/dynamo_deploy/gke_setup.md>
Minikube Setup Guide <guides/dynamo_deploy/minikube.md>
Model Caching with Fluid <guides/dynamo_deploy/model_caching_with_fluid.md>
Expand All @@ -126,22 +126,22 @@ The examples below assume you build the latest image yourself from source. If us
:hidden:
:caption: API

Python API <API/python_bindings.md>
NIXL Connect API <API/nixl_connect/README.md>

.. toctree::
:hidden:
:caption: Examples

Aggregated and Disaggregated Deployment <examples/disagg_skeleton.md>
LLM Deployment Examples <examples/llm_deployment.md>
Multinode Examples <examples/multinode.md>
LLM Deployment Examples using TensorRT-LLM <examples/trtllm.md>
Hello World <examples/runtime/hello_world/README.md>
LLM Deployment Examples using VLLM <components/backends/vllm/README.md>
Multinode Examples using SGLang <components/backends/sglang/docs/multinode-examples.md>
LLM Deployment Examples using TensorRT-LLM <components/backends/trtllm/README.md>

.. toctree::
:hidden:
:caption: Reference


Glossary <dynamo_glossary.md>
KVBM Reading <architecture/kvbm_reading.md>

Expand Down
Loading