Skip to content

Add training args to capture device profiling traces#117

Merged
vivekgoe merged 1 commit into
mlperf/llamav2_fusedfrom
device_logging
Mar 19, 2024
Merged

Add training args to capture device profiling traces#117
vivekgoe merged 1 commit into
mlperf/llamav2_fusedfrom
device_logging

Conversation

@vivekgoe
Copy link
Copy Markdown

What does this PR do?

Add training args to capture device profiling traces

Fixes # (issue)

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

Number of steps to ignore for profling.
profiling_steps (`int`, *optional*, defaults to 0):
Number of steps to be captured when enabling profiling.
profiling_record_shapes (`boolean`, *optional*, defaults to True):
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where are we using it? Default value is taken from original transformers?

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This has nothing to do with current change. I just added a missing comment for PT profiler. Its being used in PT profiler initialization in trainer.py, it is habana specific argument (not there in original transformers).

@vivekgoe vivekgoe merged commit 614d3b1 into mlperf/llamav2_fused Mar 19, 2024
astachowiczhabana pushed a commit that referenced this pull request Jan 23, 2025
* Initial commit => add overrides to support bnb on HPU

* Change quantizer file name

* Added HPU specific checks and updates  (#52)

* supports hpu  nf4 quant/dequant

* added tranformer/quantizers to bitsandbytes

* Deleted transformer/quantizers

* Fix BnB inference (#60)

* update inference test (#63)

* Added 4-bit training script (#69)

* adapted test files with framework

* Adjusted test files format

---------

Co-authored-by: Vivek <vgoel@habana.ai>
xinyu-intel pushed a commit that referenced this pull request Mar 4, 2025
* Initial commit => add overrides to support bnb on HPU

* Change quantizer file name

* Added HPU specific checks and updates  (#52)

* supports hpu  nf4 quant/dequant

* added tranformer/quantizers to bitsandbytes

* Deleted transformer/quantizers

* Fix BnB inference (#60)

* update inference test (#63)

* Added 4-bit training script (#69)

* adapted test files with framework

* Adjusted test files format

---------

Co-authored-by: Vivek <vgoel@habana.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants