Add support for other LLM services #29

yaroslavyaroslav · 2023-10-06T10:08:27Z

There's a few competitive services released their API just yet.

yigitkonur · 2023-10-07T02:27:37Z

An IDE is needed for Prompt Engineers. Or someone like you will take the initiative, enhance it with a Sublime Text Plugin, and bring this feature to Sublime Text, which would be truly wonderful for all of us. I hope you never lose your motivation for the project, I will become a sponsor as soon as possible - I sincerely thank you for your contribution.

ishaan-jaff · 2023-10-24T01:56:33Z

Hi @yaroslavyaroslav @yigitkonur - I believe we can make this easier
I’m the maintainer of LiteLLM - we allow you to deploy a LLM proxy to call 100+ LLMs in 1 format - PaLM, Bedrock, OpenAI, Anthropic etc https://github.com/BerriAI/litellm/tree/main/openai-proxy.

If this looks useful (we're used in production)- please let me know how we can help.

Usage

PaLM request

curl http://0.0.0.0:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
     "model": "palm/chat-bison",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
   }'

gpt-3.5-turbo request

curl http://0.0.0.0:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
     "model": "gpt-3.5-turbo",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
   }'

claude-2 request

curl http://0.0.0.0:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
     "model": "claude-2",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
   }'

yaroslavyaroslav · 2023-10-24T10:38:50Z

@ishaan-jaff wow, thanks for highlighting this. Indeed this could be a simpler solution, than implementing all of those by my own. Even though there're some caveats on Sublime Text side we have here, e.g. there's a hardly limited list of dependency we can rely on within the plugin code. Though I saw such plugins that was rely on complete third party solution, like some node.js running code, so this should be solvable.

Just a few question I have here, that I hope would save some time on both sides:

Is it complete cross platform in its un-contained run, or there are pits and falls to overcome to make it working both windows, linux and macOS?
Do I get it right that this is nothing but a local server that manages all the network by its own based on request's content it receive, i.e. model field? I dropped a quick look into the docs, just want to ensure this moment specifically.

ishaan-jaff · 2023-10-24T19:01:22Z

yes the proxy is cross platform
yes it's a proxy server that allows you to call all LLMs in one format. You can choose to deploy it or use it locally

yaroslavyaroslav · 2023-10-24T20:50:48Z

@ishaan-jaff Thanks, I'll consider it in depth when I'll come closer to implementing this one.

james2doyle · 2023-11-02T23:43:11Z

It would be cool if this plugin supported Ollama. You can run it locally as a standalone server, and make API calls to it: https://github.com/jmorganca/ollama/blob/main/docs/api.md

ishaan-jaff · 2023-11-02T23:48:27Z

Hi @james2doyle litellm already supports ollama

james2doyle · 2023-11-02T23:53:32Z

@ishaan-jaff Oh nice. I misunderstood what litellm was, I thought it was a hosted service

ishaan-jaff · 2023-11-02T23:55:11Z

no worries - litellm is a python package to call 100+ LLMs in the same I/O format. We also offer a proxy server if you don't want to make code changes to your app

yaroslavyaroslav · 2023-11-03T00:25:15Z

FYI: for now there's no option to use arbitrary dependency within the ST plug-in, but TIL that package control 4.0 beta allows just this, regarding its current state I believe it would be released within a quarter of something.

So all work in regards of this task will be started right after that release, as well as some other missed features, like precise tokens count.

UPD: I found PC 4.0 beta project, so I believe that it's quite far from being released in next quarter.

yaroslavyaroslav · 2023-12-03T20:59:53Z

The good news is that PC 4.0.0 released has been just yet, which means soon it'll be possible to add support for a custom python libraries into the package. We're not there yet, coz packagecontrol.io itself still doesn't fully supports 4.0.0 scheme (e.g. arbitrary libraries as dependencies), but I believe that it would take about a month or two to make it happen.

rubjo · 2024-02-22T12:34:25Z

Any news on this? Per now using https://github.com/icebaker/nano-bots-api via https://github.com/icebaker/sublime-nano-bots to talk to Ollama / Mistral locally. Which works, but would like to see if something using this could be better.

yaroslavyaroslav · 2024-02-22T14:24:32Z

Nope unfortunately, every time I tried any local llm as an assistant for my language of interest I noticed a way below gpt4 suggestions quality. More than that the same picture I'm observing with the all competing services as perplexity.

So honestly I have no plans to implement this until things changes.

Peeking at the very last bard/geminy 1m context window though. Maybe it'll worth it.

Aiq0 · 2024-02-25T16:44:49Z

Any news on this? Per now using https://github.com/icebaker/nano-bots-api via https://github.com/icebaker/sublime-nano-bots to talk to Ollama / Mistral locally. Which works, but would like to see if something using this could be better.

I was able to make Ollama working by some small changes, as Ollama API is compatible with OpenAI API:

instal package manually, so to make changes to it
change request in openai_network_client.py, to use ~~self.connection = HTTPClient('localhost:11434')~~ self.connection = HTTPConnection('localhost:11434') instead of logic present there
added dummy token "ollama-dummy-longer-than-10-characters" (or remove token checking)
change models in assitants:

"assistants": [
	{
		"assistant_role": "Apply the change requested by the user to the code with respect to senior knowledge of programming",
		"chat_model": "codellama",
		"max_tokens": 4000,
		"name": "Replace",
		"prompt_mode": "replace"
	},
	{
		"assistant_role": "Insert code or whatever user will request with the following command instead of placeholder with respect to senior knowledge of programming",
		"chat_model": "codellama",
		"max_tokens": 4000,
		"name": "Insert",
		"prompt_mode": "insert",
		"placeholder": "## placeholder"
	},
	{
		"assistant_role": "Append code or whatever user will request with the following command instead of placeholder with respect to senior knowledge of programming",
		"chat_model": "codellama",
		"max_tokens": 4000,
		"name": "Append",
		"prompt_mode": "append"
	},
]

So it will work nicely, if there would be config options to:

toggle between HTTP and HTTPS
change URL
do not require token, when url is not api.openai.com (or just advice to add some dummy token)

rubjo · 2024-02-26T07:08:36Z

@Aiq0 Confirmed working, thank you!
(Used HTTPConnection, not HTTPClient)

Aiq0 · 2024-02-26T12:24:15Z

@Aiq0 Confirmed working, thank you! (Used HTTPConnection, not HTTPClient)

You are welcome. (sorry, that was typo)

yaroslavyaroslav · 2024-02-26T15:10:09Z

@rubjo @Aiq0 glad to hear it folks!

It would be just awesome if you'd do an extra mile and perform a PR with such functionality. The network layer code is kinda the least confusing piece through the whole code base, so I believe it could be taken without paying too much effort for that.

Aiq0 · 2024-02-26T16:55:25Z

@rubjo @Aiq0 glad to hear it folks!

It would be just awesome if you'd do an extra mile and perform a PR with such functionality. The network layer code is kinda the least confusing piece through the whole code base, so I believe it could be taken without paying too much effort for that.

OK, I am going to add some config settings for tweaking connection and create PR (most likely tomorrow). Is there anything other that should be considered?

yaroslavyaroslav · 2024-02-26T17:31:55Z

I believe not much. Just please try to avoid to overcomplicating things. Like to not present an additional settings if they can be avoided (e.g. I believe that it's perfectly fine to come along with dummy token on the user side for a local models rather than providing a separate toggle for that).

If you're about to add some global settings options, please consider to do so on the first level if it's possible.

A few words dropped in Readme about this new feature would definitely worth it either.

yaroslavyaroslav changed the title ~~Add others services LLM support~~ Add support for other LLM services Oct 6, 2023

yaroslavyaroslav pinned this issue Oct 6, 2023

yaroslavyaroslav added the enhancement New feature or request label Oct 10, 2023

yaroslavyaroslav mentioned this issue Dec 3, 2023

max_tokens is not enforced correctly #8

Closed

Aiq0 mentioned this issue Feb 27, 2024

Add support for other LLM with OpenAI compatible API #40

Merged

yaroslavyaroslav closed this as completed in #40 Feb 27, 2024

yaroslavyaroslav unpinned this issue Mar 2, 2024

yaroslavyaroslav mentioned this issue Oct 24, 2024

Any chance we could get Claude.ai too? #66

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for other LLM services #29

Add support for other LLM services #29

yaroslavyaroslav commented Oct 6, 2023 •

edited

Loading

yigitkonur commented Oct 7, 2023

ishaan-jaff commented Oct 24, 2023

yaroslavyaroslav commented Oct 24, 2023

ishaan-jaff commented Oct 24, 2023

yaroslavyaroslav commented Oct 24, 2023

james2doyle commented Nov 2, 2023

ishaan-jaff commented Nov 2, 2023

james2doyle commented Nov 2, 2023

ishaan-jaff commented Nov 2, 2023

yaroslavyaroslav commented Nov 3, 2023 •

edited

Loading

yaroslavyaroslav commented Dec 3, 2023

rubjo commented Feb 22, 2024

yaroslavyaroslav commented Feb 22, 2024

Aiq0 commented Feb 25, 2024 •

edited

Loading

rubjo commented Feb 26, 2024

Aiq0 commented Feb 26, 2024

yaroslavyaroslav commented Feb 26, 2024

Aiq0 commented Feb 26, 2024

yaroslavyaroslav commented Feb 26, 2024

Add support for other LLM services #29

Add support for other LLM services #29

Comments

yaroslavyaroslav commented Oct 6, 2023 • edited Loading

yigitkonur commented Oct 7, 2023

ishaan-jaff commented Oct 24, 2023

Usage

yaroslavyaroslav commented Oct 24, 2023

ishaan-jaff commented Oct 24, 2023

yaroslavyaroslav commented Oct 24, 2023

james2doyle commented Nov 2, 2023

ishaan-jaff commented Nov 2, 2023

james2doyle commented Nov 2, 2023

ishaan-jaff commented Nov 2, 2023

yaroslavyaroslav commented Nov 3, 2023 • edited Loading

yaroslavyaroslav commented Dec 3, 2023

rubjo commented Feb 22, 2024

yaroslavyaroslav commented Feb 22, 2024

Aiq0 commented Feb 25, 2024 • edited Loading

rubjo commented Feb 26, 2024

Aiq0 commented Feb 26, 2024

yaroslavyaroslav commented Feb 26, 2024

Aiq0 commented Feb 26, 2024

yaroslavyaroslav commented Feb 26, 2024

yaroslavyaroslav commented Oct 6, 2023 •

edited

Loading

yaroslavyaroslav commented Nov 3, 2023 •

edited

Loading

Aiq0 commented Feb 25, 2024 •

edited

Loading