Skip to content

Commit 4faf885

Browse files
committed
Merge branch 'release/1.3' of github.com:NVIDIA/NeMo-Agent-Toolkit into david-fe-simple-calc-custom-e2e
Signed-off-by: David Gardner <[email protected]>
2 parents 2d9f22f + c17732d commit 4faf885

File tree

3 files changed

+27
-5
lines changed

3 files changed

+27
-5
lines changed

.gitlab-ci.yml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -42,6 +42,7 @@ variables:
4242
NAT_CI_ETCD_HOST: "etcd"
4343
NAT_CI_MILVUS_HOST: "milvus"
4444
NAT_CI_MYSQL_HOST: "mysql"
45+
NAT_CI_OPENSEARCH_URL: "http://opensearch:9200"
4546
NAT_CI_REDIS_HOST: "redis"
4647
NAT_CI_S3_HOST: "minio"
4748
UV_CACHE_DIR: .uv-cache
@@ -94,6 +95,11 @@ test:python_tests:
9495
ETCD_ENDPOINTS: etcd:2379
9596
MINIO_ADDRESS: minio:9000
9697
command: ["milvus", "run", "standalone"]
98+
- name: opensearchproject/opensearch:2.11.1
99+
alias: opensearch
100+
variables:
101+
discovery.type: "single-node"
102+
plugins.security.disabled: "true"
97103

98104
script:
99105
- echo "Running tests"

docs/source/workflows/llms/using-local-llms.md

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -32,10 +32,11 @@ Regardless of the model you choose, the process is the same for downloading the
3232

3333
### Install the Simple Web Query Example
3434

35-
First, ensure the current working directory is the root of the NeMo Agent toolkit repository. Then, install the simple web query example so we have the `webpage_query` tool available.
35+
First, ensure the current working directory is the root of the NeMo Agent toolkit repository. Then, install NAT and the simple web query example.
3636

3737
```bash
38-
pip install -e examples/getting_started/simple_web_query
38+
uv pip install -e .
39+
uv pip install -e examples/getting_started/simple_web_query
3940
```
4041

4142
### Downloading the NIM Containers
@@ -112,7 +113,7 @@ llms:
112113
nim_llm:
113114
_type: nim
114115
base_url: "http://localhost:8000/v1"
115-
model_name: nvidia/llama3.1-nemotron-nano-4b-v1.1
116+
model_name: nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1
116117

117118
embedders:
118119
nv-embedqa-e5-v5:
@@ -142,10 +143,11 @@ vLLM provides an [OpenAI-Compatible Server](https://docs.vllm.ai/en/latest/getti
142143

143144
### Install the Simple Web Query Example
144145

145-
First, ensure the current working directory is the root of the NeMo Agent toolkit repository. Then, install the simple web query example so we have the `webpage_query` tool available.
146+
First, ensure the current working directory is the root of the NeMo Agent toolkit repository. Then, install NAT and the simple web query example.
146147

147148
```bash
148-
pip install -e examples/getting_started/simple_web_query
149+
uv pip install -e .
150+
uv pip install -e examples/getting_started/simple_web_query
149151
```
150152

151153
### Serving the Models

docs/source/workflows/sizing-calc.md

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,16 @@ The NVIDIA NeMo Agent toolkit provides a sizing calculator to estimate the GPU c
2121

2222
The sizing calculator uses the [evaluation](evaluate.md) and [profiling](./profiler.md) systems in the NeMo Agent toolkit.
2323

24+
## Prerequisites
25+
Sizing calculator uses the profiler subsystem. Ensure that it is installed by running the following command:
26+
```bash
27+
uv pip install -e ".[profiling]"
28+
```
29+
If you are installing from a package, you need to install the `nvidia-nat[profiling]` package by running the following command:
30+
```bash
31+
uv pip install "nvidia-nat[profiling]"
32+
```
33+
2434
## Overview
2535

2636
This guide assumes that you have an LLM hosted by an isolated GPU cluster, for which you want to perform the sizing calculations for.
@@ -40,6 +50,10 @@ mkdir -p ${CALC_OUTPUT_DIR}
4050
4151
cp examples/evaluation_and_profiling/simple_calculator_eval/configs/config-sizing-calc.yml $CONFIG_FILE
4252
```
53+
Install the simple calculator example:
54+
```bash
55+
uv pip install -e examples/evaluation_and_profiling/simple_calculator_eval
56+
```
4357

4458
Edit `.tmp/sizing_calc/config-sizing-calc.yml` file by adding a `base_url` parameter for the `llms.nim_llm` section for your cluster. Then, if needed, change the `llms.nim_llm.model_name`.
4559

0 commit comments

Comments
 (0)