load.read_file: do not use blobfile for local files #446

dtrifiro · 2025-08-30T08:52:29Z

If files are local, there's no need to go through the (optional) blobfile dependency, we can just read them using the standard open/read API.

This solves some issues when trying to load files that have already been downloaded to the HF cache.

See for example vllm-project/vllm#21750

tiran · 2025-09-01T19:28:28Z

@dtrifiro here is a better version that also handles file:// URI correctly:

import urllib.parse

def read_file(blobpath: str) -> bytes:
    # convert file:// URI to local file path
    if blobpath.startswith("file://"):
        parsed = urllib.parse.urlparse(blobpath)
        if parsed.hostname and parsed.hostname != "localhost":
            # must be None, empty, or localhost
            raise ValueError(f"invalid file URI {blobpath}")
        blobpath = urllib.parse.unquote(parsed.path)

    # a path without :// is a local path
    if "://" not in blobpath:
        with open(blobpath, "rb") as f:
            return f.read()

    if blobpath.startswith(("http://", "https://")):
        # avoiding blobfile for public files helps avoid auth issues, like MFA prompts.
        import requests

        resp = requests.get(blobpath)
        resp.raise_for_status()
        return resp.content

    try:
        import blobfile
    except ImportError as e:
        raise ImportError(
            "blobfile is not installed. Please install it by running `pip install blobfile`."
        ) from e
    with blobfile.BlobFile(blobpath, "rb") as f:
        return f.read()

dtrifiro · 2025-09-02T16:26:14Z

Updated as suggested

dtrifiro · 2025-09-15T13:51:25Z

Hey @hauntsaninja, mind taking a look if you have some bandwidth?

Thanks!

Deployment of kimi models fails because of a missing `blobfile` dependency. We don't really need it, so we can add this fix openai/tiktoken#446 to just load local files normally if present.

- Dockerfile.rocm.ubi: add aiter dependencies, keep disabled aiter by default - Dockerfile*.ubi: fix tiktoken patch step (upstream: openai/tiktoken#446) - docker-bake: add note about `ROCM_VERSION` variable - Dockerfile.rocm.ubi: remove deprecated `VLLM_WHEEL_STRATEGY`/`FLASH_ATTENTION_WHEEL_STRATEGY` args

dtrifiro · 2025-10-03T10:25:27Z

A similar solution has been merged in https://github.com/openai/tiktoken/pull/451/files#diff-c2902f8168a36c136ab289157c834cd39fec3758c5269607cd4c7a05d6406058R9

See openai/tiktoken#446 for more context

load.read_file: do not use blobfile for local files

7f6835e

dtrifiro force-pushed the make-blobfile-optional branch from 06cb4c1 to 7f6835e Compare September 1, 2025 13:27

dtrifiro closed this Sep 3, 2025

dtrifiro reopened this Sep 4, 2025

dtrifiro closed this Oct 3, 2025

npanpaliya pushed a commit to odh-on-pz/vllm-cpu that referenced this pull request Oct 27, 2025

Dockerfile*.ubi: remove tiktoken patch (tiktoken==0.12.0 is out)

41f82f2

See openai/tiktoken#446 for more context

npanpaliya pushed a commit to odh-on-pz/vllm-cpu that referenced this pull request Oct 27, 2025

Dockerfile*.ubi: remove tiktoken patch (tiktoken==0.12.0 is out) (#292)

e7bd44d

See openai/tiktoken#446 for more context

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

load.read_file: do not use blobfile for local files #446

load.read_file: do not use blobfile for local files #446

dtrifiro commented Aug 30, 2025

Uh oh!

tiran commented Sep 1, 2025

Uh oh!

dtrifiro commented Sep 2, 2025

Uh oh!

dtrifiro commented Sep 15, 2025

Uh oh!

dtrifiro commented Oct 3, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

load.read_file: do not use blobfile for local files #446

load.read_file: do not use blobfile for local files #446

Conversation

dtrifiro commented Aug 30, 2025

Uh oh!

tiran commented Sep 1, 2025

Uh oh!

dtrifiro commented Sep 2, 2025

Uh oh!

dtrifiro commented Sep 15, 2025

Uh oh!

dtrifiro commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dtrifiro commented Oct 3, 2025 •

edited

Loading