Update readme files for explicit lazy mode#1921
Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
libinta
left a comment
There was a problem hiding this comment.
can you also check if the fast/slow test needs to be changed to add this env?
|
@jasi306 , Please resolve conflicts. Also did you already check if hpu graphs are turned on then the lazy mode flag is turned on for any tests in the readme. |
|
@jasi306 , Please resolve conflicts. |
1 similar comment
|
@jasi306 , Please resolve conflicts. |
|
@jasi306 CC: @libinta @dsocek leads to |
e5cafb6 to
858d09e
Compare
I believe the conflicts are resolved now. |
@jasi306 is OOO at this moment. I'll try to follow up. |
858d09e to
39e9089
Compare
In recent versions of the Gaudi software stack, lazy mode has been disabled by default. This commit explicitly sets the PT_HPU_LAZY_MODE=1 flag to enable it back. - add PT_HPU_LAZY_MODE=1 flag - add missing bash/python specification to markdown blocks - remove redundant PT_HPU_LAZY_MODE=0 flag Author: Jan Kaminski <jkaminski@habana.ai> Co-authored-by: Karol Brejna <karol.brejna@intel.com> Co-authored-by: Piotr Bielak <piotr.bielak@intel.com>
39e9089 to
cad3eb0
Compare
|
@regisss kind reminder :) |
Co-authored-by: Karol Brejna <karol.brejna@intel.com> Co-authored-by: Piotr Bielak <piotr.bielak@intel.com>
Co-authored-by: Karol Brejna <karol.brejna@intel.com> Co-authored-by: Piotr Bielak <piotr.bielak@intel.com>
Co-authored-by: Karol Brejna <karol.brejna@intel.com> Co-authored-by: Piotr Bielak <piotr.bielak@intel.com>
Co-authored-by: Karol Brejna <karol.brejna@intel.com> Co-authored-by: Piotr Bielak <piotr.bielak@intel.com>
Co-authored-by: Karol Brejna <karol.brejna@intel.com> Co-authored-by: Piotr Bielak <piotr.bielak@intel.com>
* Merge v1.18-release * Hot fix regional compilation (huggingface#2005) Co-authored-by: regisss <15324346+regisss@users.noreply.github.com> * Enable mixtral 8x7b accuracy evaluation (huggingface#1986) Co-authored-by: Rafal <rbogdanowicz@habana.ai> * Update readme files for explicit lazy mode (huggingface#1921) Co-authored-by: Karol Brejna <karol.brejna@intel.com> Co-authored-by: Piotr Bielak <piotr.bielak@intel.com> * [llama-vision] Remove token_idx_cpu parameter (huggingface#2018) Integer parameter token_idx_cpu passed to mllama's forward() method caused an issue with hpu graph cache which led to performance drop. Signed-off-by: Urszula <urszula.golowicz@intel.com> * Update README examples (huggingface#2020) * Fix examples in README audio-classification: - add space between "False" and backslash image-to-text: - add "datasets" to requirements.txt pytorch-image-models: - add "datasets" to requirements.txt sentence-transformers-training/nli: - add command to properly discover HABANA_VISIBLE_MODULES sentence-transformers-training/sts: - add command to properly discover HABANA_VISIBLE_MODULES speech-recognition: - add `--trust_remote_code` for seq2seq examples stable-diffusion/training: - add missing OpenCV requirement for ControlNet Training Co-authored-by: Karol Brejna <karol.brejna@intel.com> * Review fixes: remove grabbing all modules --------- Co-authored-by: Karol Brejna <karol.brejna@intel.com> Co-authored-by: karol-brejna-i <karolbrejna@apache.org> * Pin latest optimum to force mutual updates (huggingface#2016) pin latest optimum to force mutual updates * Fix FP8 support and address related issues (huggingface#2010) - Resolve bugs related to FP8 (floating point 8-bit) computation - Improve stability and correctness of FP8 operations - Add/fix tests to validate FP8 functionality - Update relevant documentation and comments Co-authored-by: IlyasMoutawwakil --------- Signed-off-by: Urszula <urszula.golowicz@intel.com> Co-authored-by: Adam Stachowicz <astachowicz@habana.ai> Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> Co-authored-by: regisss <15324346+regisss@users.noreply.github.com> Co-authored-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com> Co-authored-by: Rafal <rbogdanowicz@habana.ai> Co-authored-by: Jan Kamiński <jkaminski@habana.ai> Co-authored-by: Karol Brejna <karol.brejna@intel.com> Co-authored-by: Piotr Bielak <piotr.bielak@intel.com> Co-authored-by: Urszula Golowicz <urszula.golowicz@intel.com> Co-authored-by: Piotr Bielak <pbielak@users.noreply.github.com> Co-authored-by: karol-brejna-i <karolbrejna@apache.org>
What does this PR do?
Fixes # (issue)
Before submitting