-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sidekick - whole project files view #46
Comments
Thanks for the input! As a neat coincidence, I started playing around with vector stores and embeddings tonight, with a new It's not working for large repos. Hitting API limits for # of tokens with OpenAI. I'd like to explore a local embedding solution anyway. Once I get the |
I'm also exploring using ctags to pass along as context |
Mmm what are you trying to improve using ctags? Using them as a sort of summary to reduce context window?
I’m not sure if I want to lose resolution tho. For the use case of ‘fixing my current code’ , I -really- do not want it hallucinating and re-generate the ’things that currently work’.
Do you have a test case example where context window is too large to solve the question you are trying to solve? Would like to take a stab at trying to use prompt engineering to essentially, break it up into smaller questions to try to fit into the context window.
Saw you have a basic unit test setup, I’m thinking what it really needs, is https://github.com/openai/human-eval
I’ve only just started mucking with python recently, and am not familiar with the library eco-system yet, but if you have some suggestions as to how to get a proper ‘code generation test’ going, I think that’s step #1 before taking pot shots at improving it.
… On 18 Jul 2023, at 9:42 AM, Nick Sullivan ***@***.***> wrote:
I'm also exploring using ctags to pass along as context
—
Reply to this email directly, view it on GitHub <#46 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAMJWFMJ3RNDEMXKRPU6BCLXQXSZHANCNFSM6AAAAAA2MNUFBE>.
You are receiving this because you authored the thread.
|
Still working on this. In the meantime, I added commands to the sidekick prompt, so you can add / drop files for context, without restarting. /add file |
FYI - I got And then built a But the results are currently terrible, because the resulting text from the VectorStore is summarized text that answers a question, but it doesn't actually give the code/documents. I think the next thing to try is to have a custom Vector Store tool that searches the vector store database, returns the relevant code, and returns that for use by the LLM. |
mm think i see your problem. i think it would be best resolved by breaking up code into functions. one function per file seems like the easy prototype. Then you would probably want to have a 'fill context' method that, tries to add all the dependent functions once you can identify the functions that matter. Once that limit is hit, multi shot approaches seems like the way forward. "What other functions do you need information on?" |
Makes sense, and I'm headed in that direction In the meantime, the current My current set up is that I use openrouter.ai so that I can use I'll sit down to solve a problem, think about what files are going to be needed to do it, and load those in, then chat with those files. |
It’s think that’s prohibitively expensive tho. Costs USD$2-3 per call @32k input, and when actively using it, you are probably calling it 5-10 times an hour.
That being said, the models that can be run on 24GB consumer gpu’s probably suck vs gpt4. Heh only one way to find out. I need to figure out how to hack into the LLM/ChatModel class to make a custom remote call.
… On 27 Jul 2023, at 2:53 PM, Nick Sullivan ***@***.***> wrote:
Makes sense, and I'm headed in that direction
In the meantime, the current aicodebot sidekick command is pretty good, because you can add files for context. Today I put a lot of energy into making that a better process, including the ability to add/drop files cleanly, a better management of the token economy, etc.
My current set up is that I use openrouter.ai so that I can use gpt4-32k. With this amount of context, I'm able to solve most of my programming tasks very efficiently.
I'll sit down to solve a problem, think about what files are going to be needed to do it, and load those in, then chat with those files.
—
Reply to this email directly, view it on GitHub <#46 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAMJWFMPNWJRMRMTEZMXHY3XSIF73ANCNFSM6AAAAAA2MNUFBE>.
You are receiving this because you authored the thread.
|
i believe using a vectorDB to store the files, then only search for relevant portions would work to get around the input context limit.
https://python.langchain.com/docs/modules/chains/additional/vector_db_text_generation
Could break it down into multiple queries, probably using something like the chain of thoughts to break up different parts. seems like what the community is doing.
falcon-40b-code seems interesting. the encoder can be used standalone to run that vector search, before piping that output to the decoder.
The text was updated successfully, but these errors were encountered: