Add layer attributes for all functional models #1421

mattdangerw · 2024-02-02T18:14:12Z

This proposes a new style for backbone and tasks:

# === Layers ===
self.token_embedding = ...
self.transformer_layers = []
for i in range(num_layers):
    self.transformer_layers.append(...)

# === Functional Model ===
inputs = keras.Input(...)
x = self.token_embedding(inputs)
for layer in self.transformer_layers:
    x = layer(x)
outputs = x
super.__init__(inputs, outputs)

# === Config ===
self.num_layers = num_layers

The main goal is to allow more readable manipulations of the model, e.g. a custom LoRA routine:

backbone = GPT2Backbone(...)
for layer in self.transformer_layers:
    # Use different lora rank for different matrices.
    layer.self_attention.query_projection.enable_lora(16)
    layer.self_attention.key_projection.enable_lora(4)

mattdangerw · 2024-02-02T18:14:53Z

Draft PR just for feedback for now. I've only touched less than half of the models. Tests will fail.

mattdangerw · 2024-02-09T22:39:06Z

I also removed unused features from our pipeline model class, just to keep our tasks as clean as possible.

This proposes a new style for backbone and tasks: ```python self.token_embedding = ... self.transformer_layers = [] for i in range(num_layers): self.transformer_layers.append(...) inputs = keras.Input(...) x = self.token_embedding(inputs) for layer in self.transformer_layers: x = layer(x) outputs = x super.__init__(inputs, outputs) self.num_layers = num_layers ``` The main goal is to allow more readable manipulations of the model, e.g. a custom LoRA routine: ```python backbone = GPT2Backbone(...) for layer in self.transformer_layers: # Use different rank for different matrices. layer.self_attention.query_projection.enable_lora(16) layer.self_attention.key_projection.enable_lora(4) ```

fchollet

LGTM, thank you!

mattdangerw force-pushed the new-backbone-style branch 4 times, most recently from 4c844a6 to acf72d8 Compare February 9, 2024 22:37

mattdangerw force-pushed the new-backbone-style branch 3 times, most recently from 4e5c555 to 58ab131 Compare February 9, 2024 23:25

mattdangerw added the kokoro:force-run Runs Tests on GPU label Feb 9, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Feb 9, 2024

mattdangerw force-pushed the new-backbone-style branch 2 times, most recently from 39f2205 to 5f1f2b3 Compare February 10, 2024 00:31

mattdangerw force-pushed the new-backbone-style branch from 5f1f2b3 to 76ec715 Compare February 10, 2024 01:36

mattdangerw marked this pull request as ready for review February 10, 2024 01:36

mattdangerw requested a review from fchollet February 10, 2024 02:30

fchollet approved these changes Feb 10, 2024

View reviewed changes

Small consistency changes

081a006

mattdangerw merged commit c3c268a into keras-team:master Feb 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add layer attributes for all functional models #1421

Add layer attributes for all functional models #1421

Uh oh!

mattdangerw commented Feb 2, 2024 •

edited

Loading

Uh oh!

mattdangerw commented Feb 2, 2024

Uh oh!

mattdangerw commented Feb 9, 2024

Uh oh!

fchollet left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add layer attributes for all functional models #1421

Add layer attributes for all functional models #1421

Uh oh!

Conversation

mattdangerw commented Feb 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdangerw commented Feb 2, 2024

Uh oh!

mattdangerw commented Feb 9, 2024

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mattdangerw commented Feb 2, 2024 •

edited

Loading