Fix the issue of propagating `training` argument in subclasses #1623

james77777778 · 2024-05-09T07:12:40Z

I encountered this issue while trying to create a float8 training/inference example for keras.io.

In Keras3 (I haven't verified this in Keras2), training argument isn't propagated when using subclasses like keras_nlp.layers.TransformerDecoder, unless we explicitly expose training=None in the signature of call.

I've added the tests to confirm that this issue has been resolved and they can only pass with this PR.

james77777778 · 2024-05-15T07:24:31Z

Kindly ping @mattdangerw

This issue needs to be fixed for the float8 example on keras.io
keras-team/keras-io#1858

mattdangerw

LGTM!

Fix training args in subclasses

abaf480

james77777778 mentioned this pull request May 14, 2024

Add float8 example keras-team/keras-io#1858

Merged

mattdangerw approved these changes May 17, 2024

View reviewed changes

mattdangerw merged commit b043a4f into keras-team:master May 17, 2024

james77777778 deleted the fix-training-args branch June 21, 2024 05:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix the issue of propagating `training` argument in subclasses #1623

Fix the issue of propagating `training` argument in subclasses #1623

Uh oh!

james77777778 commented May 9, 2024

Uh oh!

james77777778 commented May 15, 2024

Uh oh!

mattdangerw left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix the issue of propagating training argument in subclasses #1623

Fix the issue of propagating training argument in subclasses #1623

Uh oh!

Conversation

james77777778 commented May 9, 2024

Uh oh!

james77777778 commented May 15, 2024

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix the issue of propagating `training` argument in subclasses #1623

Fix the issue of propagating `training` argument in subclasses #1623