Can this library be used for encoder only models? #464

elyxlz · 2023-05-01T00:29:41Z

elyxlz
May 1, 2023

I would like to use tune an encoder-only model (like T5) via RLHF and use its produced embeddings to calculate the reward.
Would this be possible with this library?
Thank you!

Answered by maxreciprocate

May 1, 2023

Hi! Unfortunately no, this library, at its current state, operates solely on discrete token level rewards. Although I recall that @shahbuland was doing something interesting with reinforcement on continuous action spaces, so he might give some pointers here

View full answer

maxreciprocate · 2023-05-01T09:50:43Z

maxreciprocate
May 1, 2023
Maintainer

Hi! Unfortunately no, this library, at its current state, operates solely on discrete token level rewards. Although I recall that @shahbuland was doing something interesting with reinforcement on continuous action spaces, so he might give some pointers here

0 replies

elyxlz · 2023-05-01T12:39:28Z

elyxlz
May 1, 2023
Author

Thank you! I'll contact him

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can this library be used for encoder only models? #464

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Can this library be used for encoder only models? #464

elyxlz May 1, 2023

Replies: 2 comments

maxreciprocate May 1, 2023 Maintainer

elyxlz May 1, 2023 Author

elyxlz
May 1, 2023

maxreciprocate
May 1, 2023
Maintainer

elyxlz
May 1, 2023
Author