Skip to content

Commit

Permalink
Added gc_interval for manual garbage collection (#278)
Browse files Browse the repository at this point in the history
Signed-off-by: Selvaraj Anandaraj <[email protected]>
Co-authored-by: Selvaraj Anandaraj <[email protected]>
  • Loading branch information
sanandaraj5597 and Selvaraj Anandaraj committed Apr 1, 2024
1 parent 8d73d6c commit d4ec7c8
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 1 deletion.
1 change: 1 addition & 0 deletions launcher_scripts/conf/training/gpt3/175b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -156,6 +156,7 @@ model:
use_cpu_initialization: False # Init weights on the CPU (slow for large models)
onnx_safe: False # Use work-arounds for known problems with Torch ONNX exporter.
apex_transformer_log_level: 30 # Python logging level displays logs with severity greater than or equal to this
gc_interval: 10 #Manual garbage collection

# Nsys profiling options
nsys_profile:
Expand Down
2 changes: 1 addition & 1 deletion launcher_scripts/conf/training/llama/llama2_70b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -139,7 +139,7 @@ model:
use_flash_attention: true
overlap_p2p_comm: true
batch_p2p_comm: false
gc_interval: 100
gc_interval: 10
optim:
name: distributed_fused_adam
lr: 0.00015
Expand Down

0 comments on commit d4ec7c8

Please sign in to comment.