-
Notifications
You must be signed in to change notification settings - Fork 302
Open
Labels
Milestone
Description
Thank you for your wonderful work, which provides an open-source solution for enabling collaboration between strong and weak models.
Beyond reasoning tasks, retrieval—is another crucial application scenario for large language models. However, not every user query actually requires retrieval. Passive approaches like RAG can introduce irrelevant documents, which may mislead the model, produce incorrect responses, and waste resources. This makes automatic retrieval a key capability for improving both efficiency and accuracy.
I hope that the semantic-router can support retrieval routing. In discussions with chinese developers from vLLM in little red book (china social media), we agreed that this represents a particularly important user case.
XunzhuoXunzhuo