avoid calling `gc.collect` and `cuda.empty_cache` by ydshieh · Pull Request #34514 · huggingface/transformers

ydshieh · 2024-10-30T16:18:08Z

What does this PR do?

Let's avoid calling gc.collect and cuda.empty_cache while the tests are running on CPU:

those operations are slow
(actually, in most cases, they are only relevant for integration tests which use large models)

Running on GPT2 tests,

60 seconds on main, 20 seconds on this PR

Rocketknight1

Yes, this seems like a good speed fix! cc @LysandreJik @ArthurZucker for core maintainer review

LysandreJik

Smart! Should a helper method be made that only runs on CPU?

Both the gc.collect and the torch device checks could be moved into the backend_empty_cache method (or an other method that wraps both)

ydshieh · 2024-10-31T09:29:25Z

Yes, a helper method is nice. Will update

ydshieh · 2024-10-31T10:03:53Z

updated.

So far it doesn't call gc.collect() at all (default value False).
I would like to see if this would cause issue.
In general, we don't need to call it after each test method (so in tearDown) as it is slow.

HuggingFaceDocBuilderDev · 2024-10-31T15:45:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* update * update * update * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ydshieh requested a review from Rocketknight1 October 30, 2024 16:44

Rocketknight1 approved these changes Oct 30, 2024

View reviewed changes

ydshieh changed the title ~~Speed no empty~~ avoid calling gc.collect and cuda.empty_cache Oct 30, 2024

ydshieh changed the title ~~avoid calling gc.collect and cuda.empty_cache~~ avoid calling gc.collect and cuda.empty_cache Oct 30, 2024

ydshieh requested review from ArthurZucker and LysandreJik October 31, 2024 08:32

LysandreJik approved these changes Oct 31, 2024

View reviewed changes

ydshieh force-pushed the speed_no_empty branch from 82e3add to 6620320 Compare October 31, 2024 10:01

ydshieh force-pushed the speed_no_empty branch from 6620320 to 4403c5a Compare October 31, 2024 10:19

ydshieh added 5 commits October 31, 2024 16:14

update

7183264

update

d3af1c4

update

7ef02ac

update

86b6744

update

18d6d5d

ydshieh force-pushed the speed_no_empty branch from 1f47700 to 18d6d5d Compare October 31, 2024 15:18

ydshieh merged commit ab98f0b into main Oct 31, 2024

ydshieh deleted the speed_no_empty branch October 31, 2024 15:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid calling `gc.collect` and `cuda.empty_cache`#34514

avoid calling `gc.collect` and `cuda.empty_cache`#34514
ydshieh merged 5 commits intomainfrom
speed_no_empty

ydshieh commented Oct 30, 2024 •

edited

Loading

Uh oh!

Rocketknight1 left a comment

Uh oh!

LysandreJik left a comment

Uh oh!

ydshieh commented Oct 31, 2024

Uh oh!

ydshieh commented Oct 31, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Oct 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ydshieh commented Oct 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Rocketknight1 left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

ydshieh commented Oct 31, 2024

Uh oh!

ydshieh commented Oct 31, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Oct 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ydshieh commented Oct 30, 2024 •

edited

Loading