请问max_context_length是什么 #67

volagold · 2023-07-28T09:53:27Z

如题，默认值是512，请问这是指生成下一个token时只看512个token长度的上文吗？

    def chat(
        self,
        history: List[str],
        *,
        max_length: int = 2048,
        max_context_length: int = 512,
        do_sample: bool = True,
        top_k: int = 0,
        top_p: float = 0.7,
        temperature: float = 0.95,
        num_threads: int = 0,
    ) -> str:
        gen_config = _C.GenerationConfig(
            max_length=max_length,
            max_context_length=max_context_length,
            do_sample=do_sample,
            top_k=top_k,
            top_p=top_p,
            temperature=temperature,
            num_threads=num_threads,
        )

li-plus · 2023-08-07T06:08:06Z

max_context_length 是输入(prompt)的最大长度，在cpu上太长的输入会导致首字符延迟太大，如果用gpu可以调大一些，另外max_length是输入+输出的最大长度。

volagold · 2023-08-07T06:11:17Z

好的，谢谢解答

compass-star · 2023-08-16T12:30:41Z

max_context_length 是输入(prompt)的最大长度，在cpu上太长的输入会导致首字符延迟太大，如果用gpu可以调大一些，另外max_length是输入+输出的最大长度。

请问是否可以这么理解max_length-max_context_length=输出的最大长度即max_new_tokens?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

请问max_context_length是什么 #67

请问max_context_length是什么 #67

volagold commented Jul 28, 2023

li-plus commented Aug 7, 2023

volagold commented Aug 7, 2023

compass-star commented Aug 16, 2023

请问max_context_length是什么 #67

请问max_context_length是什么 #67

Comments

volagold commented Jul 28, 2023

li-plus commented Aug 7, 2023

volagold commented Aug 7, 2023

compass-star commented Aug 16, 2023