Skip to content

Commit

Permalink
Merge pull request #662 from allenai/tiny-olmo-config-fix
Browse files Browse the repository at this point in the history
updated config for olmo tiny suite of models
  • Loading branch information
ananyahjha93 authored Jul 16, 2024
2 parents 56d1fe0 + 43bc92c commit b55fb5f
Show file tree
Hide file tree
Showing 5 changed files with 11 additions and 11 deletions.
4 changes: 2 additions & 2 deletions configs/tiny/OLMo-150M.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -46,11 +46,11 @@ compile: null

optimizer:
name: adamw
learning_rate: 1.0e-3
learning_rate: 6.0e-4
weight_decay: 0.1
eps: 1e-8
decay_norm_and_bias: true
decay_embeddings: false
decay_embeddings: true
betas:
- 0.9
- 0.95
Expand Down
4 changes: 2 additions & 2 deletions configs/tiny/OLMo-20M.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -46,11 +46,11 @@ compile: null

optimizer:
name: adamw
learning_rate: 1.0e-3
learning_rate: 6.0e-4
weight_decay: 0.1
eps: 1e-8
decay_norm_and_bias: true
decay_embeddings: false
decay_embeddings: true
betas:
- 0.9
- 0.95
Expand Down
2 changes: 1 addition & 1 deletion configs/tiny/OLMo-300M.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@ optimizer:
weight_decay: 0.1
eps: 1e-8
decay_norm_and_bias: true
decay_embeddings: false
decay_embeddings: true
betas:
- 0.9
- 0.95
Expand Down
4 changes: 2 additions & 2 deletions configs/tiny/OLMo-60M.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -46,11 +46,11 @@ compile: null

optimizer:
name: adamw
learning_rate: 1.0e-3
learning_rate: 6.0e-4
weight_decay: 0.1
eps: 1e-8
decay_norm_and_bias: true
decay_embeddings: false
decay_embeddings: true
betas:
- 0.9
- 0.95
Expand Down
8 changes: 4 additions & 4 deletions configs/tiny/OLMo-750M.yaml → configs/tiny/OLMo-700M.yaml
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
run_name: OLMo-750M
run_name: OLMo-700M
seed: 6198
dry_run: false

Expand All @@ -8,8 +8,8 @@ wandb:

model:
d_model: 1536
n_heads: 24
n_layers: 24
n_heads: 16
n_layers: 16
mlp_ratio: 8
weight_tying: false
alibi: false
Expand Down Expand Up @@ -50,7 +50,7 @@ optimizer:
weight_decay: 0.1
eps: 1e-8
decay_norm_and_bias: true
decay_embeddings: false
decay_embeddings: true
betas:
- 0.9
- 0.95
Expand Down

0 comments on commit b55fb5f

Please sign in to comment.