Fetch fork master #3

Hase-U · 2023-02-16T23:39:35Z

No description provided.

Co-authored-by: Harrison Chase <[email protected]>

Co-authored-by: Stefan Keselj <[email protected]> Co-authored-by: Harrison Chase <[email protected]>

…ngchain-ai#1000) Co-authored-by: Francisco Ingham <>

simple typo fix: because --> between

…1011) Updates the Unstructured example notebook with a PDF example. Includes additional dependencies for PDF processing (and images, etc).

Chroma is a simple to use, open-source, zero-config, zero setup vectorstore. Simply `pip install chromadb`, and you're good to go. Out-of-the-box Chroma is suitable for most LangChain workloads, but is highly flexible. I tested to 1M embs on my M1 mac, with out issues and reasonably fast query times. Look out for future releases as we integrate more Chroma features with LangChain!

Co-authored-by: William FH <[email protected]>

Imho retries should be performed for ServiceUnavailableError (which tends to happen to me quite often).

Co-authored-by: blob42 <[email protected]> Co-authored-by: blob42 <spike@w530>

This PR adds persistence to the Chroma vector store. Users can supply a `persist_directory` with any of the `Chroma` creation methods. If supplied, the store will be automatically persisted at that directory. If a user creates a new `Chroma` instance with the same persistence directory, it will get loaded up automatically. If they use `from_texts` or `from_documents` in this way, the documents will be loaded into the existing store. There is the chance of some funky behavior if the user passes a different embedding function from the one used to create the collection - we will make this easier in future updates. For now, we log a warning.

Add GooseAI, CerebriumAI, Petals, ForefrontAI

Co-authored-by: Ibis Prevedello <[email protected]>

Currently the chain is getting the column names and types on the one side and the example rows on the other. It is easier for the llm to read the table information if the column name and examples are shown together so that it can easily understand to which columns do the examples refer to. For an instantiation of this, please refer to the changes in the `sqlite.ipynb` notebook. Also changed `eval` for `ast.literal_eval` when interpreting the results from the sample row query since it is a better practice. --------- Co-authored-by: Francisco Ingham <> --------- Co-authored-by: Francisco Ingham <[email protected]>

Co-authored-by: jped <[email protected]> Co-authored-by: Justin Torre <[email protected]> Co-authored-by: Ivan Vendrov <[email protected]>

* Support a callback `on_llm_new_token` that users can implement when `OpenAI.streaming` is set to `True`

### Summary Adds tracked metadata from `unstructured` elements to the document metadata when `UnstructuredFileLoader` is used in `"elements"` mode. Tracked metadata is available in `unstructured>=0.4.9`, but the code is written for backward compatibility with older `unstructured` versions. ### Testing Before running, make sure to upgrade to `unstructured==0.4.9`. In the code snippet below, you should see `page_number`, `filename`, and `category` in the metadata for each document. `doc[0]` should have `page_number: 1` and `doc[-1]` should have `page_number: 2`. The example document is `layout-parser-paper-fast.pdf` from the [`unstructured` sample docs](https://github.com/Unstructured-IO/unstructured/tree/main/example-docs). ```python from langchain.document_loaders import UnstructuredFileLoader loader = UnstructuredFileLoader(file_path=f"layout-parser-paper-fast.pdf", mode="elements") docs = loader.load() ```

…angchain-ai#1053) This PR updates the usage instructions for PromptLayerOpenAI in Langchain's documentation. The updated instructions provide more detail and conform better to the style of other LLM integration documentation pages. No code changes were made in this PR, only improvements to the documentation. This update will make it easier for users to understand how to use `PromptLayerOpenAI`

We introduced a breaking change but missed this call. This PR fixes `langchain` to work with upstream `chroma`.

Updating this base file as well as the .ipynb file of the example on the website: langchain-ai/langchain@master...akshayvkt:langchain:patch-1 https://langchain.readthedocs.io/en/latest/modules/document_loaders/examples/everynote.html

Co-authored-by: Andrew Huang <[email protected]>

langchain-ai#909) Adds Google Search integration with [Serper](https://serper.dev) a low-cost alternative to SerpAPI (10x cheaper + generous free tier). Includes documentation, tests and examples. Hopefully I am not missing anything. Developers can sign up for a free account at [serper.dev](https://serper.dev) and obtain an api key. ## Usage ```python from langchain.utilities import GoogleSerperAPIWrapper from langchain.llms.openai import OpenAI from langchain.agents import initialize_agent, Tool import os os.environ["SERPER_API_KEY"] = "" os.environ['OPENAI_API_KEY'] = "" llm = OpenAI(temperature=0) search = GoogleSerperAPIWrapper() tools = [ Tool( name="Intermediate Answer", func=search.run ) ] self_ask_with_search = initialize_agent(tools, llm, agent="self-ask-with-search", verbose=True) self_ask_with_search.run("What is the hometown of the reigning men's U.S. Open champion?") ``` ### Output ``` Entering new AgentExecutor chain... Yes. Follow up: Who is the reigning men's U.S. Open champion? Intermediate answer: Current champions Carlos Alcaraz, 2022 men's singles champion. Follow up: Where is Carlos Alcaraz from? Intermediate answer: El Palmar, Spain So the final answer is: El Palmar, Spain > Finished chain. 'El Palmar, Spain' ```

Co-authored-by: Akshay <[email protected]>

…ain-ai#1066) This PR updates `PromptLayerOpenAI` to now support requests using the [Async API](https://langchain.readthedocs.io/en/latest/modules/llms/async_llm.html) It also updates the documentation on Async API to let users know that PromptLayerOpenAI also supports this. `PromptLayerOpenAI` now redefines `_agenerate` a similar was to how it redefines `_generate`

Alternate implementation to PR langchain-ai#960 Again - only FAISS is implemented. If accepted can add this to other vectorstores or leave as NotImplemented? Suggestions welcome...

…ze calculation. (langchain-ai#991) I modified the logic of the batch calculation for embedding according to this cookbook https://github.com/openai/openai-cookbook/blob/main/examples/Embedding_long_inputs.ipynb

This is a work in progress PR to track my progres. ## TODO: - [x] Get results using the specifed searx host - [x] Prioritize returning an `answer` or results otherwise - [ ] expose the field `infobox` when available - [ ] expose `score` of result to help agent's decision - [ ] expose the `suggestions` field to agents so they could try new queries if no results are found with the orignial query ? - [ ] Dynamic tool description for agents ? - Searx offers many engines and a search syntax that agents can take advantage of. It would be nice to generate a dynamic Tool description so that it can be used many times as a tool but for different purposes. - [x] Limit number of results - [ ] Implement paging - [x] Miror the usage of the Google Search tool - [x] easy selection of search engines - [x] Documentation - [ ] update HowTo guide notebook on Search Tools - [ ] Handle async - [ ] Tests ### Add examples / documentation on possible uses with - [ ] getting factual answers with `!wiki` option and `infoboxes` - [ ] getting `suggestions` - [ ] getting `corrections` --------- Co-authored-by: blob42 <spike@w530> Co-authored-by: Harrison Chase <[email protected]>

Co-authored-by: Ivan Vendrov <[email protected]> Co-authored-by: Sasmitha Manathunga <[email protected]>

Co-authored-by: Chen Wu (吴尘) <[email protected]>

This addresses langchain-ai#948. I set the documentation max width to 2560px, but can be adjusted - see screenshot below. <img width="1741" alt="Screenshot 2023-02-14 at 13 05 57" src="https://user-images.githubusercontent.com/23406704/218749076-ea51e90a-a220-4558-b4fe-5a95b39ebf15.png">

Co-authored-by: Maxime Vidal <[email protected]>

Co-authored-by: Francisco Ingham <[email protected]>

Fixes langchain-ai#1087

Changed number of types of chains to make it consistent with the rest of the docs

…in-ai#1093) The langchain-ai#1088 introduced a bug in Qdrant integration. That PR reverts those changes and provides class attributes to ensure consistent payload keys. In addition to that, an exception will be thrown if any of texts is None (that could have been an issue reported in langchain-ai#1087)

without --no-sandbox param, load documents from url by selenium in chrome occured error below: ```Traceback (most recent call last): File "/data//playgroud/try_langchain.py", line 343, in <module> langchain_doc_loader() File "/data//playgroud/try_langchain.py", line 67, in langchain_doc_loader documents = loader.load() File "/install/anaconda3-env/envs/python3.10/lib/python3.10/site-packages/langchain/document_loaders/url_selenium.py", line 102, in load driver = self._get_driver() File "/install/anaconda3-env/envs/python3.10/lib/python3.10/site-packages/langchain/document_loaders/url_selenium.py", line 76, in _get_driver return Chrome(options=chrome_options) File "/install/anaconda3-env/envs/python3.10/lib/python3.10/site-packages/selenium/webdriver/chrome/webdriver.py", line 80, in __init__ super().__init__( File "/install/anaconda3-env/envs/python3.10/lib/python3.10/site-packages/selenium/webdriver/chromium/webdriver.py", line 104, in __init__ super().__init__( File "/install/anaconda3-env/envs/python3.10/lib/python3.10/site-packages/selenium/webdriver/remote/webdriver.py", line 286, in __init__ self.start_session(capabilities, browser_profile) File "/install/anaconda3-env/envs/python3.10/lib/python3.10/site-packages/selenium/webdriver/remote/webdriver.py", line 378, in start_session response = self.execute(Command.NEW_SESSION, parameters) File "/install/anaconda3-env/envs/python3.10/lib/python3.10/site-packages/selenium/webdriver/remote/webdriver.py", line 440, in execute self.error_handler.check_response(response) File "/install/anaconda3-env/envs/python3.10/lib/python3.10/site-packages/selenium/webdriver/remote/errorhandler.py", line 245, in check_response raise exception_class(message, screen, stacktrace) selenium.common.exceptions.WebDriverException: Message: unknown error: Chrome failed to start: exited abnormally. (unknown error: DevToolsActivePort file doesn't exist) (The process started from chrome location /usr/bin/google-chrome is no longer running, so ChromeDriver is assuming that Chrome has crashed.) Stacktrace: #0 0x55cf8da1bfe3 <unknown> #1 0x55cf8d75ad36 <unknown> #2 0x55cf8d783b20 <unknown> #3 0x55cf8d77fa9b <unknown> #4 0x55cf8d7c1af7 <unknown> #5 0x55cf8d7c111f <unknown> langchain-ai#6 0x55cf8d7b8693 <unknown> langchain-ai#7 0x55cf8d78b03a <unknown> langchain-ai#8 0x55cf8d78c17e <unknown> langchain-ai#9 0x55cf8d9dddbd <unknown> langchain-ai#10 0x55cf8d9e1c6c <unknown> langchain-ai#11 0x55cf8d9eb4b0 <unknown> langchain-ai#12 0x55cf8d9e2d63 <unknown> langchain-ai#13 0x55cf8d9b5c35 <unknown> langchain-ai#14 0x55cf8da06138 <unknown> langchain-ai#15 0x55cf8da062c7 <unknown> langchain-ai#16 0x55cf8da14093 <unknown> langchain-ai#17 0x7f3da31a72de start_thread ``` add option `chrome_options.add_argument("--no-sandbox")` for chrome.

hwchase17 and others added 30 commits February 11, 2023 08:29

Harrison/0083 (langchain-ai#996)

e51fad1

Co-authored-by: Harrison Chase <[email protected]>

Harrison/fake llm (langchain-ai#990)

10e7297

Co-authored-by: Stefan Keselj <[email protected]> Co-authored-by: Harrison Chase <[email protected]>

Added initial capital letter to bullet points that had it missing (la…

0b6aa6a

…ngchain-ai#1000) Co-authored-by: Francisco Ingham <>

pdfminer (langchain-ai#1003)

bbb06ca

Harrison/unstructured structured (langchain-ai#1004)

0998577

bump version to 0084 (langchain-ai#1005)

6d44a22

typo fix on chat vector db docs (langchain-ai#1007)

03e5794

simple typo fix: because --> between

Unstructured example notebook: add a pdf, related deps (langchain-ai#…

05d8969

…1011) Updates the Unstructured example notebook with a PDF example. Includes additional dependencies for PDF processing (and images, etc).

Harrion/kg (langchain-ai#1016)

0c553d2

Co-authored-by: William FH <[email protected]>

chroma docs (langchain-ai#1012)

7fb33fc

agent refactors (langchain-ai#997)

0f0e69a

bump version to 0085 (langchain-ai#1017)

fc2502c

Added retry for openai.error.ServiceUnavailableError (langchain-ai#1022)

2088920

Imho retries should be performed for ServiceUnavailableError (which tends to happen to me quite often).

add links (langchain-ai#1027)

6a31a59

Harrison/makefile (langchain-ai#1033)

012a6df

Co-authored-by: blob42 <[email protected]> Co-authored-by: blob42 <spike@w530>

Add GooseAI, CerebriumAI, Petals, ForefrontAI (langchain-ai#981)

f30dcc6

Add GooseAI, CerebriumAI, Petals, ForefrontAI

Harrison/standarize prompt loading (langchain-ai#1036)

8c45f06

Co-authored-by: Ibis Prevedello <[email protected]>

Harrison/llm integrations (langchain-ai#1039)

88bebb4

Co-authored-by: jped <[email protected]> Co-authored-by: Justin Torre <[email protected]> Co-authored-by: Ivan Vendrov <[email protected]>

docs: fix typo in notebook (langchain-ai#1046)

c67c538

bump version to 0086 (langchain-ai#1050)

f05f025

Enable streaming for OpenAI LLM (langchain-ai#986)

caa8e47

* Support a callback `on_llm_new_token` that users can implement when `OpenAI.streaming` is set to `True`

add to async chain notebook (langchain-ai#1056)

d8ac274

bump version (langchain-ai#1057)

bac676c

Fix typo in integration with Chroma (langchain-ai#1070)

34cba2d

We introduced a breaking change but missed this call. This PR fixes `langchain` to work with upstream `chroma`.

hwchase17 and others added 20 commits February 15, 2023 22:44

Harrison/handle stop tokens ai21 (langchain-ai#1077)

5275306

Co-authored-by: Andrew Huang <[email protected]>

Harrison/evernote nb (langchain-ai#1078)

98186ef

Co-authored-by: Akshay <[email protected]>

Support similarity search by vector (in FAISS) (langchain-ai#961)

f0a2585

Alternate implementation to PR langchain-ai#960 Again - only FAISS is implemented. If accepted can add this to other vectorstores or leave as NotImplemented? Suggestions welcome...

add anthropic example (langchain-ai#1041)

19c2797

Co-authored-by: Ivan Vendrov <[email protected]> Co-authored-by: Sasmitha Manathunga <[email protected]>

Harrison/semantic subset (langchain-ai#1079)

c96ac3e

Co-authored-by: Chen Wu (吴尘) <[email protected]>

Harrison/telegram loader (langchain-ai#1080)

c60954d

Co-authored-by: Maxime Vidal <[email protected]>

Harrison/align table (langchain-ai#1081)

5e10e19

Co-authored-by: Francisco Ingham <[email protected]>

docs for batch size (langchain-ai#1082)

971458c

fix stuff count (langchain-ai#1083)

badeeb3

chat qa with sources (langchain-ai#1084)

7745505

Update qdrant.py (langchain-ai#1088)

5d11e5d

Fixes langchain-ai#1087

Modify number of types of chains (langchain-ai#1089)

3462130

Changed number of types of chains to make it consistent with the rest of the docs

bump version 0.0.88 (langchain-ai#1090)

6322b6f

Merge branch 'fork_master' into fetch__fork_master

eed7746

Hase-U merged commit c305398 into master Feb 16, 2023

Hase-U deleted the fetch__fork_master branch February 16, 2023 23:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fetch fork master #3

Fetch fork master #3

Hase-U commented Feb 16, 2023

Fetch fork master #3

Fetch fork master #3

Conversation

Hase-U commented Feb 16, 2023