PyMySearch

My low-key LLM-friendly wrapper for search engines.

Installation

pip install git+https://github.com/pnhuy/pymysearch

Usage

Search

from pymysearch.search import SearchClient
client = SearchClient('duckduckgo')
response = client.search('langchain', max_results=5)
results = response.results
for res in results:
    print(res.url)
    print(res.content[:10])

Q&A with Search

Give a quick answer based on search results:

# pip install langchain-text-splitters langchain-chroma langchain-openai langchain-huggingface

from langchain_chroma import Chroma
from langchain_huggingface import HuggingFaceEmbeddings
from langchain_text_splitters import RecursiveCharacterTextSplitter
from pymysearch.search import SearchClient

# you should config OPENAI_API_KEY
# os.environ['OPENAI_API_KEY'] = 'sk-xxx'
# or using other OpenAI compatible local model
llm = ChatOpenAI(model_name="gpt-3.5-turbo")

client = SearchClient(
    llm=llm,
    text_splitter=RecursiveCharacterTextSplitter(chunk_size=5000, chunk_overlap=500),
    vectorstore=Chroma(
        persist_directory="./chroma_db",
        embedding_function=HuggingFaceEmbeddings(),
    ),
)

print(client.qna_search("What is the capital of Vietnam?"))
# => The capital of Vietnam is Hanoi.

You can customize the prompt to change the output:

from langchain_core.prompts import PromptTemplate

prompt = PromptTemplate.from_template("""You are an assistant for question-answering tasks.
Use the following pieces of retrieved context to answer the question.
If you don't know the answer, just say that you don't know.
Make the answer informative and detailed.

Question: {question}

Context:
{context}

Answer:""")
print(client.qna_search("What is the capital of Vietnam?", prompt=prompt))
# => The capital of Vietnam is Hanoi. Hanoi is the second-largest city in Vietnam by population...

If you want to use another LLM instead of OpenAI, please make sure that that model supports function calls. Please see more at Berkeley Function-Calling Leaderboard. The prompt should be tailored for each model.

from langchain_community.llms.ollama import Ollama

llm = Ollama(model='llama3')
client = SearchClient(
    llm=llm,
    text_splitter=RecursiveCharacterTextSplitter(chunk_size=5000, chunk_overlap=500),
    vectorstore=Chroma(
        persist_directory="./chroma_db",
        embedding_function=HuggingFaceEmbeddings(),
    ),
)

prompt = PromptTemplate.from_template("""You are an assistant for question-answering tasks.

Context:
{context}

Use the following pieces of retrieved context to answer the question.
If you don't know the answer, just say that you don't know.
Make the answer consise.
                                      
Question: {question}
""")

print(client.qna_search("What is the capital of Vietnam?", prompt=prompt))
# => Hanoi

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
pymysearch		pymysearch
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyMySearch

Installation

Usage

Search

Q&A with Search

About

Languages

License

pnhuy/pymysearch

Folders and files

Latest commit

History

Repository files navigation

PyMySearch

Installation

Usage

Search

Q&A with Search

About

Topics

Resources

License

Stars

Watchers

Forks

Languages