feat: Add download acceleration for dependencies and HuggingFace models #83

deanq · 2025-08-15T23:56:49Z

Introduces support for download acceleration from Tetra runtime. This also speeds up remote execution startup times by pre-caching pip dependencies and HuggingFace models.

Parallel downloads for large files
Applies accelerated downloads to known large pip libraries and hugging face models
Smart-caching of hugging face models in the container or network volume

Includes new accelerate_downloads and hf_models_to_cache parameters for the @remote decorator with backward compatibility.

# Accelerated downloads are on by default. To turn it off...
from tetra_rp import remote, LiveServerless

gpu_config = LiveServerless(name="my_server")

@remote(
    resource_config=gpu_config,
    dependencies=["vllm"],
    accelerate_downloads=False,
)
...

# cache the model to the container after the first time it is downloaded
from tetra_rp import remote, LiveServerless

gpu_config = LiveServerless(name="my_server")

@remote(
    resource_config=gpu_config,
    dependencies=["vllm"],
    hf_models_to_cache=["facebook/opt-125m"],
)
...

# cache the model to the volume after the first time it is downloaded
from tetra_rp import remote, LiveServerless, NetworkVolume

gpu_config = LiveServerless(
    name="my_server",
    networkVolume=NetworkVolume(name="my_volume")
)

@remote(
    resource_config=gpu_config,
    dependencies=["diffusers", "torch", "transformers", "accelerate", "xformers"],
    hf_models_to_cache=["runwayml/stable-diffusion-v1-5"]
)
...

Related to runpod-workers/worker-tetra#22

- Add pydantic>=2.0.0 for enhanced protocol models - Version bump to 0.10.0 for download acceleration feature

- Add accelerate_downloads and hf_models_to_cache fields to FunctionRequest - Enhance Pydantic models with improved type annotations and documentation - Maintain backward compatibility with existing protocol - Support HuggingFace model pre-caching for faster inference startup

- Add accelerate_downloads and hf_models_to_cache parameters to @Remote decorator - Update function and class decoration to pass acceleration options - Extend docstring with comprehensive parameter documentation - Enable HuggingFace model pre-caching through decorator configuration

- Add acceleration parameters to create_remote_class function - Store acceleration settings in RemoteClassWrapper instances - Pass acceleration options through to remote execution requests - Maintain compatibility with existing class decoration patterns

- Extend prepare_request methods to accept acceleration parameters - Update request building to include new acceleration fields - Maintain consistency across execution pathways - Preserve existing stub interface contracts

- Update create_remote_class calls to include new acceleration parameters - Ensure all existing tests pass with enhanced function signatures - Add proper parameter defaults for backward compatibility - Maintain test coverage for class execution patterns

pandyamarut

/LGTM

…ls (#83) * feat: add pydantic dependency and bump to v0.10.0 - Add pydantic>=2.0.0 for enhanced protocol models - Version bump to 0.10.0 for download acceleration feature * feat: extend protobuf protocol for download acceleration - Add accelerate_downloads and hf_models_to_cache fields to FunctionRequest - Enhance Pydantic models with improved type annotations and documentation - Maintain backward compatibility with existing protocol - Support HuggingFace model pre-caching for faster inference startup * feat: implement download acceleration in client interface - Add accelerate_downloads and hf_models_to_cache parameters to @Remote decorator - Update function and class decoration to pass acceleration options - Extend docstring with comprehensive parameter documentation - Enable HuggingFace model pre-caching through decorator configuration * feat: update class execution system for download acceleration - Add acceleration parameters to create_remote_class function - Store acceleration settings in RemoteClassWrapper instances - Pass acceleration options through to remote execution requests - Maintain compatibility with existing class decoration patterns * feat: update stubs to support download acceleration parameters - Extend prepare_request methods to accept acceleration parameters - Update request building to include new acceleration fields - Maintain consistency across execution pathways - Preserve existing stub interface contracts * test: update tests for download acceleration compatibility - Update create_remote_class calls to include new acceleration parameters - Ensure all existing tests pass with enhanced function signatures - Add proper parameter defaults for backward compatibility - Maintain test coverage for class execution patterns

deanq added 6 commits August 15, 2025 16:53

feat: add pydantic dependency and bump to v0.10.0

2b1ba05

- Add pydantic>=2.0.0 for enhanced protocol models - Version bump to 0.10.0 for download acceleration feature

feat: update stubs to support download acceleration parameters

c222c72

- Extend prepare_request methods to accept acceleration parameters - Update request building to include new acceleration fields - Maintain consistency across execution pathways - Preserve existing stub interface contracts

deanq requested a review from pandyamarut August 15, 2025 23:56

deanq marked this pull request as ready for review August 16, 2025 00:24

deanq mentioned this pull request Aug 16, 2025

feat: Add download acceleration for dependencies & hugging face runpod-workers/worker-tetra#22

Merged

pandyamarut approved these changes Aug 18, 2025

View reviewed changes

deanq merged commit e47c9e3 into main Aug 19, 2025
7 checks passed

deanq deleted the deanq/ae-1075-download-accelerator branch August 19, 2025 00:17

github-actions bot mentioned this pull request Aug 19, 2025

chore: release 0.11.0 #84

Merged

runpod-release-please-bot bot mentioned this pull request Oct 14, 2025

chore: release 0.14.0 #101

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add download acceleration for dependencies and HuggingFace models #83

feat: Add download acceleration for dependencies and HuggingFace models #83

Uh oh!

deanq commented Aug 15, 2025 •

edited

Loading

Uh oh!

pandyamarut left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Add download acceleration for dependencies and HuggingFace models #83

feat: Add download acceleration for dependencies and HuggingFace models #83

Uh oh!

Conversation

deanq commented Aug 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pandyamarut left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

deanq commented Aug 15, 2025 •

edited

Loading