feat(hf-demo): Optimize VRAM usage #40

Artemonim · 2025-07-02T14:29:04Z

Description:

This update introduces significant VRAM optimization for the Gradio demo and improves the user experience with a more reliable completion notification.

VRAM Optimization: Implemented dynamic model swapping between the CPU and GPU. The Segment-Anything-Model (SAM) and the MatAnyone model are no longer kept in VRAM simultaneously. The active model is loaded onto the GPU only when required for its specific task (segmentation or matting), while the inactive model is offloaded to the CPU. This change dramatically reduces the VRAM footprint.
Reduced Hardware Requirements: As a result of the optimization, the Gradio demo can now run on GPUs with approximately 6-7 GB of VRAM, making it accessible to a wider range of users.

Optional

Cross-platform Completion Sound: Standard terminal bell (\a), ensuring reliable and cross-platform audio notification upon task completion.

Refactors the Gradio demo to dynamically swap models (SAM and MatAnyone) between GPU and CPU, significantly lowering VRAM requirements to ~6-7 GB.

Artemonim added 2 commits July 2, 2025 17:24

feat(hf-demo): Optimize VRAM usage

83019cd

Refactors the Gradio demo to dynamically swap models (SAM and MatAnyone) between GPU and CPU, significantly lowering VRAM requirements to ~6-7 GB.

feat(hf-demo): Add completion sound notification

068704b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(hf-demo): Optimize VRAM usage #40

feat(hf-demo): Optimize VRAM usage #40

Uh oh!

Artemonim commented Jul 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat(hf-demo): Optimize VRAM usage #40

Are you sure you want to change the base?

feat(hf-demo): Optimize VRAM usage #40

Uh oh!

Conversation

Artemonim commented Jul 2, 2025

Description:

Optional

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant