Skip to content

Update README for FSDP#832

Merged
regisss merged 3 commits into
huggingface:synapse_1.15from
hlahkar:readme_update
Mar 25, 2024
Merged

Update README for FSDP#832
regisss merged 3 commits into
huggingface:synapse_1.15from
hlahkar:readme_update

Conversation

@hlahkar
Copy link
Copy Markdown
Contributor

@hlahkar hlahkar commented Mar 25, 2024

This corrects the command for FSDP

This corrects the command for FSDP
@hlahkar hlahkar requested a review from vivekgoe March 25, 2024 03:17
@hlahkar hlahkar requested a review from regisss as a code owner March 25, 2024 03:17
@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@regisss regisss added the run-test Run CI for PRs from external contributors label Mar 25, 2024
Copy link
Copy Markdown
Collaborator

@regisss regisss left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Is this compatible with Synapse 1.14 (so merge directly into main) or from 1.15 (so merge into synapse_1.15)?

Copy link
Copy Markdown
Collaborator

@libinta libinta left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

***** train metrics *****
epoch = 2.0
max_memory_allocated (GB) = 82.02
memory_allocated (GB) = 25.98
total_memory_available (GB) = 94.62
train_loss = 0.8538
train_runtime = 1:21:26.40
train_samples_per_second = 1.273
train_steps_per_second = 0.016
03/25/2024 19:01:48 - INFO - main - *** Evaluate ***
[INFO|trainer.py:1688] 2024-03-25 19:01:48,298 >> ***** Running Evaluation *****
[INFO|trainer.py:1690] 2024-03-25 19:01:48,298 >> Num examples = 125
[INFO|trainer.py:1693] 2024-03-25 19:01:48,298 >> Batch size = 1
100%|██████████| 16/16 [00:35<00:00, 2.37s/it]***** eval metrics *****
100%|██████████| 16/16 [00:37<00:00, 2.35s/it]
epoch = 2.0
eval_accuracy = 0.7927
eval_loss = 0.7583
eval_runtime = 0:00:39.94
eval_samples = 125
eval_samples_per_second = 3.129
eval_steps_per_second = 0.401
max_memory_allocated (GB) = 82.02
memory_allocated (GB) = 25.98
perplexity = 2.1347
total_memory_available (GB) = 94.62

@regisss regisss changed the base branch from main to synapse_1.15 March 25, 2024 22:56
@regisss regisss merged commit 1f52a94 into huggingface:synapse_1.15 Mar 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

run-test Run CI for PRs from external contributors synapse 1.15

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants