-
Notifications
You must be signed in to change notification settings - Fork 43.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PoC freeform output w/ XML inserts (instead of strict JSON) #6253
Comments
What about considering markdown? It's even closer to natural language |
I considered it but structuring nested markdown seemed non-trivial to identify key value responses when nested and in a list. Would love an example of the outputs above how you’d imagine them in markdown |
I think there are some quasi markdown formats out there that might be able to help with that e.g.slack has a modified version for their UI builder. I can research and respond with what you asked. |
This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days. |
Unstale! 🪄 |
Did someone start this issue? Do we need to change promts? I'm working with mistral. I was wondering can I start this one and if tests get passed it shows everything is ok? Or this is more than this? |
@MKdir98 a PoC for this issue needs changes to both the prompt and the parsing stage. For example you could try converting the |
This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days. |
Duplicates
Summary 💡
LLMs are bad at generating JSON. Not surprising, because compared to natural language it's like Brainfuck. It's not human-readable without whitespace. Enter XML: it shares syntax with all of the web, and the opening and closing tags make it more readable (and thus more writable too).
Credits to @ntindle for the idea!
Examples 🌈
User: I'm curious about countries’ capitals. Like, what's the capital of the U.S.?
Assistant:
I don't know what the capital of the U.S. is. I'll look for this through a web search.
Function:
The capital of the U.S. is Washington, D.C.
Assistant:
Let's also feed your curiosity by searching for capitals of 10 more countries!
Function:
[insert 10 capitals here]
Assistant:
<answer>The capital of the U.S. is Washington, D.C. Some other national capitals are: [insert 10 capitals here]</answer>
Motivation 🔦
Reasoning: The closer our output format is to the "natural" format that the LLM is trained to output, the higher the reliability and performance can be.
The text was updated successfully, but these errors were encountered: