API footgun: `infer_next_token` still works after end of text #44

philpax · 2023-03-19T01:25:19Z

In llamacord, I have some logic that calls infer_next_token in a loop. Unfortunately, I didn't check for EOT - so the code would keep generating tokens and producing (fascinatingly well-structured) garbage. I think we should probably check if the last token is EOT and return an error? If you feed it a prompt, the EOT would no longer be the last token, and you should be able to infer without issues. (I wonder if A[EOT]B infers differently to AB...)

The text was updated successfully, but these errors were encountered:

setzer22 · 2023-03-20T20:52:07Z

I'd say being able to infer beyond eot is a feature some might want, even if it's just to run some experiment to see what would happen. But I'm OK with making the API harder to misuse, as long as it's still possible to request inference for a new token after EOT.

Returning some sort of "EOT" error in infer_next_token sounds like it might be what we want?

I wonder if A[EOT]B infers differently to AB...

Yes, for sure. EOT is an important token and the transformer will interpret both strings completely differently 👍

philpax · 2023-03-20T23:02:10Z

Yeah, that's what I was thinking. It wouldn't allow your use-case, though: if you returned an EOT error, you wouldn't be able to get the inferred token back.

philpax · 2023-03-26T18:31:57Z

Discussed this on the Discord - it makes much more sense to return the EOT token itself as an error, and then let users continue inferring if they want.

philpax added the issue:bug Something isn't working label Mar 24, 2023

philpax mentioned this issue Mar 26, 2023

Prepare for release #81

Merged

7 tasks

philpax self-assigned this Mar 27, 2023

philpax closed this as completed in c8bdf01 Apr 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API footgun: `infer_next_token` still works after end of text #44

API footgun: `infer_next_token` still works after end of text #44

philpax commented Mar 19, 2023

setzer22 commented Mar 20, 2023

philpax commented Mar 20, 2023

philpax commented Mar 26, 2023

API footgun: infer_next_token still works after end of text #44

API footgun: infer_next_token still works after end of text #44

Comments

philpax commented Mar 19, 2023

setzer22 commented Mar 20, 2023

philpax commented Mar 20, 2023

philpax commented Mar 26, 2023

API footgun: `infer_next_token` still works after end of text #44

API footgun: `infer_next_token` still works after end of text #44