We need to simplify our model docstring so they are more easily understandable, and we can start by porting the changes in https://github.com/keras-team/keras-nlp/pull/843 to all other models. Let's do this one model at a time, to keep things granular. Checklist: - [ ] Make sure to update any "custom vocabulary" examples to match the model actual vocabulary type and special token requirements (varies per model). - [ ] Test out all docstring snippets! - [ ] Make sure to follow our code style guidelines re indentation etc.