Adding support for Llama3 models in BedrockChat #32

fedor-intercom · 2024-04-30T11:15:37Z

Issue described here:
#31

This is a proposal that will allow to query Llama3 models reliably. The main issue was related to the new tokens required Llama3. If not using them, it will return an empty string for long prompts.

3coins · 2024-05-03T15:01:17Z

@fedor-intercom
Thanks for working on this change. Other than the lint errors, code looks good. Can you add some integration tests so we can verify these changes, let me know if you need help with this.

fedor-intercom · 2024-05-08T07:22:30Z

@3coins Should be it? just added a llama specific integration test.
Linting, unit test and integration tests all pass.

jonathancaevans · 2024-05-09T04:45:27Z

libs/aws/langchain_aws/chat_models/bedrock.py

@@ -47,6 +47,42 @@ def convert_messages_to_prompt_llama(messages: List[BaseMessage]) -> str:
    )


+def _convert_one_message_to_text_llama3(message: BaseMessage) -> str:


I propose one small change here, plus extra logic to support word in mouthing the assistant turn, for things like agent scratchpads etc living in the assistant turn,

def _convert_one_message_to_text_llama3(message: BaseMessage, lastMessage: bool) -> str: if isinstance(message, ChatMessage): message_text = f"<|start_header_id|>{message.role.capitalize()}<|end_header_id|>{message}<|eot_id|>" elif isinstance(message, HumanMessage): message_text = f"<|start_header_id|>user<|end_header_id|>{message.content}<|eot_id|>" elif isinstance(message, AIMessage): message_text = f"<|start_header_id|>assistant<|end_header_id|>{message.content}" if not lastMessage: message_text += "<|eot_id|>" elif isinstance(message, SystemMessage): message_text = f"<|start_header_id|>system<|end_header_id|>{message.content}" else: raise ValueError(f"Got unknown type {message}") return message_text def convert_messages_to_prompt_llama3(messages: List[BaseMessage]) -> str: """Convert a list of messages to a prompt for llama3.""" return "<|begin_of_text|>" + "".join( [_convert_one_message_to_text_llama3(message, i == len(messages) - 1) for i, message in enumerate(messages)] ) + "<|start_header_id|>assistant<|end_header_id|>"

This would allow stuff along these lines.

messages = [HumanMessage(content="list 5 colors"),AIMessage(content="No, I don't want to!")]

I am not totally sure I understand the use case.

Shall we do this in a separate PR with a more illustrative example?

@jonathancaevans
Thanks for explaining your use case, would you mind opening another issue for this, or even better if you can open a PR to make this change.

3coins

Looks good! 🚀

fedor-intercom added 3 commits April 30, 2024 12:12

Adding support for Llama3 models.

916d3c1

Remove extraneous <|begin_of_text|> tokens as only one is needed

7dfefbc

linting

d3a8290

fedor-intercom added 2 commits May 7, 2024 13:20

lint fix

52ce5c5

add integration test

1f212f4

jonathancaevans reviewed May 9, 2024

View reviewed changes

3coins approved these changes May 10, 2024

View reviewed changes

3coins merged commit 2efb770 into langchain-ai:main May 10, 2024
12 checks passed

ToyVo mentioned this pull request May 16, 2024

Input formatting fix for Llama3 with Bedrock #44

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for Llama3 models in BedrockChat #32

Adding support for Llama3 models in BedrockChat #32

fedor-intercom commented Apr 30, 2024 •

edited

Loading

3coins commented May 3, 2024

fedor-intercom commented May 8, 2024 •

edited

Loading

jonathancaevans May 9, 2024

fedor-intercom May 9, 2024

jonathancaevans May 9, 2024

3coins May 10, 2024

3coins left a comment

		@@ -47,6 +47,42 @@ def convert_messages_to_prompt_llama(messages: List[BaseMessage]) -> str:
		)


		def _convert_one_message_to_text_llama3(message: BaseMessage) -> str:

Adding support for Llama3 models in BedrockChat #32

Adding support for Llama3 models in BedrockChat #32

Conversation

fedor-intercom commented Apr 30, 2024 • edited Loading

3coins commented May 3, 2024

fedor-intercom commented May 8, 2024 • edited Loading

jonathancaevans May 9, 2024

Choose a reason for hiding this comment

fedor-intercom May 9, 2024

Choose a reason for hiding this comment

jonathancaevans May 9, 2024

Choose a reason for hiding this comment

3coins May 10, 2024

Choose a reason for hiding this comment

3coins left a comment

Choose a reason for hiding this comment

fedor-intercom commented Apr 30, 2024 •

edited

Loading

fedor-intercom commented May 8, 2024 •

edited

Loading