Getting Error when trying to run holmes command line #148

bhanupratapm02 · 2024-10-01T20:27:14Z

Hi,

I just started learning, and I was able to install holmesgpt using Cutting Edge (instead of pip used pip3).

pip3 install "https://github.com/robusta-dev/holmesgpt/archive/refs/heads/master.zip"

holmes version
HEAD -> master-2d2a5b3

I am trying to run a command but getting below error. Any insights around this is very much helpful.

export OPENAI_API_BASE=https://generativelanguage.googleapis.com/v1beta/models
holmes ask --api-key="apikey" --model=gemini-1.5-flash-latest "explain what is AI"
and tried below option as well
holmes ask --api-key="apikey" --model=openai/gemini-1.5-flash-latest "explain what is AI"

But getting this

verbosity is Verbosity.NORMAL                                                                                                                                                                                                                       main.py:77
User: explain what is AI
╭──────────────────────────────────────────────────────────────────────────────────────────────────────────── Traceback (most recent call last) ─────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ /Users/Library/Python/3.9/lib/python/site-packages/holmes/main.py:266 in ask                                                                                                                                                                       │
│                                                                                                                                                                                                                                                            │
│   263 │   │   prompt += f"\n\nAttached file '{path.absolute()}':\n{f.read()}"                                                                                                                                                                              │
│   264 │   │   console.print(f"[bold yellow]Loading file {path}[/bold yellow]")                                                                                                                                                                             │
│   265 │                                                                                                                                                                                                                                                    │
│ ❱ 266 │   response = ai.call(system_prompt, prompt, post_processing_prompt)                                                                                                                                                                                │
│   267 │                                                                                                                                                                                                                                                    │
│   268 │   if json_output_file:                                                                                                                                                                                                                             │
│   269 │   │   write_json_file(json_output_file, response.model_dump())                                                                                                                                                                                     │
│                                                                                                                                                                                                                                                            │
│ /Users/Library/Python/3.9/lib/python/site-packages/holmes/core/tool_calling_llm.py:120 in call                                                                                                                                                     │
│                                                                                                                                                                                                                                                            │
│   117 │   │   │   tool_choice = NOT_GIVEN if tools == NOT_GIVEN else "auto"                                                                                                                                                                                │
│   118 │   │   │                                                                                                                                                                                                                                            │
│   119 │   │   │   total_tokens = self.count_tokens_for_message(messages)                                                                                                                                                                                   │
│ ❱ 120 │   │   │   max_context_size = self.get_context_window_size()                                                                                                                                                                                        │
│   121 │   │   │   maximum_output_token = self.get_maximum_output_token()                                                                                                                                                                                   │
│   122 │   │   │                                                                                                                                                                                                                                            │
│   123 │   │   │   if (total_tokens + maximum_output_token) > max_context_size:                                                                                                                                                                             │
│                                                                                                                                                                                                                                                            │
│ /Users/Library/Python/3.9/lib/python/site-packages/holmes/core/tool_calling_llm.py:89 in get_context_window_size                                                                                                                                   │
│                                                                                                                                                                                                                                                            │
│    86 │   │   #if not litellm.supports_function_calling(model=model):                                                                                                                                                                                      │
│    87 │   │   #    raise Exception(f"model {model} does not support function calling. You must                                                                                                                                                             │
│    88 │   def get_context_window_size(self) -> int:                                                                                                                                                                                                        │
│ ❱  89 │   │   return litellm.model_cost[self.model]['max_input_tokens']                                                                                                                                                                                    │
│    90 │                                                                                                                                                                                                                                                    │
│    91 │   def count_tokens_for_message(self, messages: list[dict]) -> int:                                                                                                                                                                                 │
│    92 │   │   return litellm.token_counter(model=self.model,                                                                                                                                                                                               │
╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
KeyError: 'openai/gemini-1.5-flash-latest'

The text was updated successfully, but these errors were encountered:

aantn · 2024-10-06T14:58:08Z

Hi @bhanupratapm02, does it work with --model=gemini/gemini-1.5-flash-latest? Or with --model=gemini/gemini-pro?

I have not used gemini myself, but we use LiteLLM to interact with models and their docs imply that is the way to go - see https://docs.litellm.ai/docs/providers/gemini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting Error when trying to run holmes command line #148

Getting Error when trying to run holmes command line #148

bhanupratapm02 commented Oct 1, 2024 •

edited

Loading

aantn commented Oct 6, 2024

Getting Error when trying to run holmes command line #148

Getting Error when trying to run holmes command line #148

Comments

bhanupratapm02 commented Oct 1, 2024 • edited Loading

aantn commented Oct 6, 2024

bhanupratapm02 commented Oct 1, 2024 •

edited

Loading