Add `BloomModel` hydra support #129

jon-tow · 2022-12-08T22:40:01Z

This PR adds hydra-based PPO model branching support for BloomModels.

wandb reports:

PPO Sentiment

Note: The <Arch>ModelBranch implementations should be refactored in the future to share a common interface if we plan on adding more in the near future.

LouisCastricato · 2022-12-09T13:05:44Z

@Dahoas can you give this a quick glance

Dahoas · 2022-12-09T20:07:48Z

Bloom rewards seem a bit low relative to other models :(. But Not much we can really do I guess

LouisCastricato · 2022-12-09T20:13:14Z

To he expected

Dahoas · 2022-12-09T20:10:37Z

trlx/model/nn/ppo_models.py

+        # 1.0 in head_mask indicate we keep the head
+        # attention_probs has shape batch_size x num_heads x N x N
+        # head_mask has shape n_layer x batch x num_heads x N x N
+        head_mask = self.get_head_mask(head_mask, hf_get_num_hidden_layers(self.config))


What is the head mask?

The head mask is a binary mask that can be used to drop the self-attention weights (softmax(qk)) from specified heads before computing the full attention output. For example, see here. get_head_mask just expands the dimensions to line-up with proper shape.

Dahoas · 2022-12-09T20:12:20Z

trlx/model/nn/ppo_models.py

+        else:
+            attention_mask = attention_mask.to(hidden_states.device)
+
+        alibi = modeling_bloom.build_alibi_tensor(


I didn't know bloom used alibi

Dahoas · 2022-12-09T20:14:48Z

Looks good to me! I just left some questions so I can better understand how things are functioning.

Add BloomModel hydra support

5a40a79

jon-tow marked this pull request as draft December 8, 2022 22:51

jon-tow marked this pull request as ready for review December 9, 2022 02:39

LouisCastricato approved these changes Dec 9, 2022

View reviewed changes

Dahoas approved these changes Dec 9, 2022

View reviewed changes

Dahoas merged commit 1a3461d into CarperAI:main Dec 9, 2022

jon-tow deleted the add-bloom-support branch December 9, 2022 20:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `BloomModel` hydra support #129

Add `BloomModel` hydra support #129

jon-tow commented Dec 8, 2022

LouisCastricato commented Dec 9, 2022

Dahoas commented Dec 9, 2022

LouisCastricato commented Dec 9, 2022

Dahoas Dec 9, 2022

jon-tow Dec 9, 2022

Dahoas Dec 9, 2022

Dahoas commented Dec 9, 2022

Add BloomModel hydra support #129

Add BloomModel hydra support #129

Conversation

jon-tow commented Dec 8, 2022

LouisCastricato commented Dec 9, 2022

Dahoas commented Dec 9, 2022

LouisCastricato commented Dec 9, 2022

Dahoas Dec 9, 2022

Choose a reason for hiding this comment

jon-tow Dec 9, 2022

Choose a reason for hiding this comment

Dahoas Dec 9, 2022

Choose a reason for hiding this comment

Dahoas commented Dec 9, 2022

Add `BloomModel` hydra support #129

Add `BloomModel` hydra support #129