Caching samples PR #909

clefourrier · 2025-08-08T13:13:59Z

Todos
Predictions

feat: adding the cache system
test: testing prediction with accelerate
feat: making the system ligther by loading cached inputs after processing the other ones (probably with an index system?)
feat: adding the system for all models
fix: change cache path to ~/.cache after debugging
stab: adding a test suite for all models

We'll need to tokenize inputs later.

…1)test with DP 2) add a system where we load cached samples in mem *after* processing the other items

HuggingFaceDocBuilderDev · 2025-08-08T13:16:10Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…n theory - added docs on caching

… idea why the imports did not fail

…nd wrap up tests

clefourrier · 2025-08-11T12:17:29Z

examples/custom_models/google_translate_model.py

-    GenerativeResponse,
-    LoglikelihoodResponse,
-    LoglikelihoodSingleTokenResponse,
+    ModelResponse,


Clean up imports, unrelated to the PR

clefourrier · 2025-08-11T12:17:34Z

examples/custom_models/local_mt_model.py

-    GenerativeResponse,
-    LoglikelihoodResponse,
-    LoglikelihoodSingleTokenResponse,
+    ModelResponse,


Clean up imports, unrelated to the PR

clefourrier · 2025-08-11T12:17:55Z

src/lighteval/main_accelerate.py

            config = yaml.safe_load(f)["model_parameters"]
    else:
        # We extract the model args
        config: dict = ModelConfig._parse_args(model_args)


Clean up unused params to simplify tests

clefourrier · 2025-08-11T12:18:53Z

src/lighteval/models/endpoints/endpoint_model.py

+    def loglikelihood_rolling(self, docs: list[Doc], override_bs=None) -> list[ModelResponse]:
+        return self._loglikelihood(docs, rolling=True)
+
+    def _loglikelihood(self, docs: list[Doc], rolling: bool = False) -> list[ModelResponse]:


Grouped logic of both functions, and separated the API function from the logic to avoid breaking changes in people's pipeline if we change the core logic

clefourrier · 2025-08-11T12:19:34Z

src/lighteval/models/nanotron/nanotron_model.py

-        )
-
-    def loglikelihood(self, requests: List[LoglikelihoodRequest]) -> List[LoglikelihoodResponse]:
+    @cached("predictions")


Removed the single token loglikelihood since it should have been removed with the refacto, plus some cleaning up of imports

clefourrier · 2025-08-11T12:19:58Z

src/lighteval/models/transformers/adapter_model.py


    Attributes:
        base_model (str):
            HuggingFace Hub model ID or path to the base model. This is the original


removed unused params

clefourrier · 2025-08-11T12:20:09Z

src/lighteval/models/transformers/delta_model.py

    during loading to reconstruct the full fine-tuned model.

    Attributes:
        base_model (str):


removed unused useless param

Adds a new caching system for generative evals, plus test suite, plus doc - the system loads indices first, then runs samples as needed, then lastly loads the cached items as needed. (We don't keep the cache in mem when running models). Contains a test suite and doc page

clefourrier and others added 7 commits July 31, 2025 16:53

init

e0cd133

not working

aaae0ea

Merge branch 'main' into test_caching

76c8b6e

added caching to transformers

39c03e7

fixed caching system for predictions, and better logging

205ab1e

Merge branch 'main' into test_caching

187aaa7

cache management is working well with no DP for accelerate - need to …

03eb193

…1)test with DP 2) add a system where we load cached samples in mem *after* processing the other items

clefourrier added 6 commits August 8, 2025 14:58

we now actively load cached samples after processing the needed items

94e31f1

removed loglikelihood single tokens since it was removed everywhere i…

3a9d1fb

…n theory - added docs on caching

remove useless parameter

ba776ab

cleanup, some things hanging when being removed from the refacto - no…

19fa6d1

… idea why the imports did not fail

wip tests

348b7d2

we separate the interface function from the actual logic in models, a…

d2ff97c

…nd wrap up tests

clefourrier changed the title ~~Caching samples PR (ongoing)~~ Caching samples PR Aug 11, 2025

Merge branch 'main' into test_caching

f4d5b14

clefourrier commented Aug 11, 2025

View reviewed changes

clefourrier added 2 commits August 11, 2025 12:21

added doc page to tree

730f226

we mock iep model creation

35b7e94

clefourrier force-pushed the test_caching branch from 8da1750 to 35b7e94 Compare August 11, 2025 12:28

clefourrier requested a review from NathanHB August 11, 2025 12:30

fix tests

ce0015e

clefourrier merged commit df3a82d into main Aug 11, 2025
4 of 6 checks passed

clefourrier mentioned this pull request Aug 11, 2025

[FT] Cache partial outputs to enable re-run from interruption #880

Closed

NathanHB added the feature label Sep 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Caching samples PR #909

Caching samples PR #909

Uh oh!

clefourrier commented Aug 8, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 8, 2025

Uh oh!

clefourrier Aug 11, 2025

Uh oh!

clefourrier Aug 11, 2025

Uh oh!

clefourrier Aug 11, 2025

Uh oh!

clefourrier Aug 11, 2025

Uh oh!

clefourrier Aug 11, 2025

Uh oh!

clefourrier Aug 11, 2025

Uh oh!

clefourrier Aug 11, 2025

Uh oh!

Uh oh!

Uh oh!

Caching samples PR #909

Caching samples PR #909

Uh oh!

Conversation

clefourrier commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 8, 2025

Uh oh!

clefourrier Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

clefourrier Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

clefourrier Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

clefourrier Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

clefourrier Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

clefourrier Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

clefourrier Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

clefourrier commented Aug 8, 2025 •

edited

Loading