Fix the sampler truncation strategy #713

chenmoneygithub · 2023-02-02T01:17:27Z

resolve #712

mattdangerw

Left a few comments. Overall the thing that is still striking me about this design is how special cased the ragged code for doing the pre and post processing is getting (even in token id space). Seems like we have a few phases for generation...

tokenize inputs
mask and pack inputs
sample
truncate and manipulate outputs
detokenize

It seems unlikely that we will ever do 2) and 4) in a way that is perfectly general, so it's unfortunate that 2), 3), and 4) all come together. It does not feel very modular.

Not something we need to solve on this PR, but let's keep thinking about that.

keras_nlp/samplers/sampler.py

chenmoneygithub · 2023-02-06T21:02:30Z

@mattdangerw Yea this sampler implementation now contains a lot of things, including preprocessing, sampling algo and postprocessing. It's a bit hard to figure out the best user flow with the class itself, we could keep thinking about it while we are rolling out the support for GPT2 text generation, including cache, long sentence generation and so on.

keras_nlp/samplers/sampler.py

mattdangerw

Thanks!

chenmoneygithub requested a review from mattdangerw February 2, 2023 01:18

chenmoneygithub force-pushed the fix-sampler branch from f7d51f7 to 1f1c4b4 Compare February 2, 2023 01:25

chenmoneygithub changed the title ~~Fix the sample truncation strategy~~ Fix the sampler truncation strategy Feb 2, 2023

Fix the truncation strategy

fe24e50

chenmoneygithub force-pushed the fix-sampler branch from 1f1c4b4 to fe24e50 Compare February 2, 2023 02:26

mattdangerw reviewed Feb 3, 2023

View reviewed changes

keras_nlp/samplers/sampler.py Outdated Show resolved Hide resolved

keras_nlp/samplers/sampler.py Outdated Show resolved Hide resolved

address comments

9e2a59b

mattdangerw reviewed Feb 7, 2023

View reviewed changes

keras_nlp/samplers/sampler.py Outdated Show resolved Hide resolved

mattdangerw approved these changes Feb 7, 2023

View reviewed changes

fix naming

f1c00fb

chenmoneygithub merged commit 10d451e into keras-team:master Feb 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix the sampler truncation strategy #713

Fix the sampler truncation strategy #713

Uh oh!

chenmoneygithub commented Feb 2, 2023

Uh oh!

mattdangerw left a comment

Uh oh!

Uh oh!

Uh oh!

chenmoneygithub commented Feb 6, 2023

Uh oh!

Uh oh!

mattdangerw left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix the sampler truncation strategy #713

Fix the sampler truncation strategy #713

Uh oh!

Conversation

chenmoneygithub commented Feb 2, 2023

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

chenmoneygithub commented Feb 6, 2023

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants