Dynamic Temperature HF loader support by kalomaze · Pull Request #5174 · oobabooga/textgen

kalomaze · 2024-01-05T07:18:18Z

A rough WIP backporting my Dynamic Temperature sampling method [which has gained some mild traction again recently] to the HF loaders.

Currently it doesn't take minTemp and maxTemp as proper arguments, so it's only using 0.0 minTemp and 2.0 maxTemp for now ✅
It's also hardcoded to trigger only if the "dynatemp" UI variable is above 0.8. This should be a bool switch and the UI should also have minTemp + maxTemp configurable ✅
Right now it runs after the truncation samplers always, but it should respect the "temperature_last" argument and come either first or last depending on that option's value ✅
Max Possible Entropy measurement will have to be changed to exclude all tokens set to -inf probability when estimating 'vocab size' in cases where truncation is used, in order for the measurement to match koboldcpp's behavior ✅

EDIT: PR is ready now. It functions as a range instead of minTemp and maxTemp. Set to zero to disable

Merge dev branch

- Currently doesn't take minTemp and maxTemp as proper arguments, so it's hardcoded to 0.0 minTemp and 2.0 maxTemp for now - Atm it's hardcoded to only trigger if the "dynatemp" UI variable is above 0.8. This should be a bool and the UI should also have minTemp and maxTemp - For obvious reasons, the regular temperature shouldn't apply when Dynamic Temp is on either - Right now it runs after the truncation samplers always, but it should respect the "temperature_last" argument and come either first or last depending on that bool

BadisG · 2024-01-05T11:06:20Z

Can you increase the max temp? For highly confident models like Mixtral you can go up to 5 without any issues.

oobabooga · 2024-01-05T14:12:19Z

Thank you for the PR. I think that a good solution would be to monkey patch the original TemperatureLogitsWarper in the transformers library, similar to how RepetitionPenaltyLogitsProcessor is monkeypatched to add support for repetition_penalty_range. Something like this:

--- a/modules/sampler_hijack.py
+++ b/modules/sampler_hijack.py
@@ -233,6 +233,7 @@ def get_logits_warper_patch(self, generation_config):
         temperature_idx = None
         for i in range(len(warpers)):
             if warpers[i].__class__.__name__ == 'TemperatureLogitsWarper':
+                warpers[i] = TemperatureLogitsWarperWithDynatemp(generation_config.temperature)
                 temperature_idx = i
                 break

Then TemperatureLogitsWarperWithDynatemp can be a modified version of
https://github.com/huggingface/transformers/blob/57e9c8321385dfd31bda33df144a4ac849206e06/src/transformers/generation/logits_process.py#L221

that defaults to regular temperature when the relevant parameters are not set.

BadisG · 2024-01-05T17:08:41Z

I tried (with a fixed seed) different values of DynaTemp and I always got the same outputs, dunno if DynaTemp is working as intended there.

Merge dev branch

kalomaze · 2024-01-06T03:43:54Z

I tried (with a fixed seed) different values of DynaTemp and I always got the same outputs, dunno if DynaTemp is working as intended there.

As described in the main post, the UI value is irrelevant atm and was just there for testing.

There was also a proposal to turn Dynamic Temp into one value, a range.

So, let's say your regular temp is 1.0, and your DynaTemp range is 0.5, the minimum Temp would become Temp - 0.5 (0.5), and the maximum temp would be Temp + 0.5 (1.5). That way instead of "ignoring" your regular temp value, it simply augments it.

So if you wanted minTemp = 0 and maxTemp = 5 you would set the regular temp to 2.5 and the range to 2.5, and so on...

Thoughts? This makes sense to me, and makes it a simple value that is disabled when you turn it to zero.

BadisG · 2024-01-06T03:51:31Z

Both solutions give the same results in the end, but I still prefer to choose the range myself, it's clearer for the user who can see exactly what the limits are, and it dissociates the normal temperature which shouldn't be involved in the dynamic temperature in my opinion.

oobabooga · 2024-01-06T05:07:45Z

Agreed with BadisG, having explicit temperatures makes things more interpretable. Maybe have a dynamic_temperature boolean parameter (false by default) and a dynatemp_low parameter that sets the minimum value? Then the existing temperature parameter becomes the maximum value when dynamic_temperature is true.

The use case would be to tick dynamic_temperature in the UI, set dynatemp_low to something low like 0.1, and set the main temperature to a higher value like 1.5. It would be good to have a preset under presets/ with values that have been tested by other people to work well (I assume combined with temperature_last and min_p).

kalomaze · 2024-01-06T05:43:13Z

Agreed with BadisG, having explicit temperatures makes things more interpretable. Maybe have a dynamic_temperature boolean parameter (false by default) and a dynatemp_low parameter that sets the minimum value? Then the existing temperature parameter becomes the maximum value when dynamic_temperature is true.

The use case would be to tick dynamic_temperature in the UI, set dynatemp_low to something low like 0.1, and set the main temperature to a higher value like 1.5. It would be good to have a preset under presets/ with values that have been tested by other people to work well (I assume combined with temperature_last and min_p).

The reasoning for not doing this idea when I asked concedo was that people who want Dynamic Temp turned off would mistakenly believe that 0.0 dynatemp_low is turning it off.

I think it would be best to either dissociate the regular Temperature altogether as originally proposed in this PR, or set a single range value which can be set to 0.0 which would make dynamic temp not trigger, and therefore wouldn't require a bool.

The range value might be smarter because it's just one extra value that is set to 0 to disable the dynamicism.

oobabooga · 2024-01-06T05:57:38Z

The reasoning for not doing this idea when I asked concedo was that people who want Dynamic Temp turned off would mistakenly believe that 0.0 dynatemp_low is turning it off.

dynamic_temperature and dynatemp_low could be placed near each other in the UI, and a hint saying Only used when dynamic_temperature is checked could be added under dynatemp_low, so I don't see that as a problem.

kalomaze · 2024-01-06T06:11:11Z

The reasoning for not doing this idea when I asked concedo was that people who want Dynamic Temp turned off would mistakenly believe that 0.0 dynatemp_low is turning it off.

dynamic_temperature and dynatemp_low could be placed near each other in the UI, and a hint saying Only used when dynamic_temperature is checked could be added under dynatemp_low, so I don't see that as a problem.

I would rather just have minTemp and maxTemp there at that point and go all the way tbh. That way we avoid the monkeypatch

more work to be done elsewhere

Last commit also ensured the default value is zero for dynatemp

kalomaze · 2024-01-06T12:32:12Z

This is ready to merge. Only thing that might need changing is removal of the print statements that I had for debugging.

kalomaze · 2024-01-06T13:33:30Z

This might have a bug actually. The entropy calculation shouldn't change if the temperature value changes because the dynamic temp effect hasn't been applied by the time the print statement prints out the entropy.

But it's doing that on my end. I'm not sure what's wrong, I thought I ensured that the original temp function doesn't get ran, but I guess not. Trying to resolve.

kalomaze · 2024-01-06T13:52:43Z

It seems like I resolved the issue with the latest commit. Before, it was possible for both the regular Temperature and Dynamic Temperature to run, when Dynamic Temp is supposed to take the base value and modify it, not run the original function first.

Now, it forcibly removes the original Temperature function if DynaTemp is above 0, as intended.

SillyTavern's DynaTemp option only works for koboldcpp at the moment, so I think the only thing left is adjusting that for the API, everything seems to work when it comes to the ooba side of things.

If you spot any potential issues @oobabooga let me know, but it seems ready now.

Ph0rk0z · 2024-01-06T17:29:03Z

FWIW it has still been working in exllamaV2 for me still: https://pastebin.com/h259DiUz

Eager to try now in other loaders. The debug stuff is interesting but definitely not something I like keeping on. Also vote for high and low values being exposed, even when they were in the TXT it is good to set upper bounds. I remember using it and getting really low temperatures and having to set a higher minimum.

BadisG · 2024-01-07T03:45:17Z

I tried temperature 3 + dynamtemp 2 (to get a temp range between 1 and 5) but it doesn't work, all I get is this

Output generated in 0.33 seconds (0.00 tokens/s, 0 tokens, context 55, seed 2054783615)

kalomaze · 2024-01-07T03:52:46Z

I tried temperature 3 + dynamtemp 2 (to get a temp range between 1 and 5) but it doesn't work, all I get is this
Output generated in 0.33 seconds (0.00 tokens/s, 0 tokens, context 55, seed 2054783615)

I think there's a weird bug when the value is exactly a whole number and not a decimal point. Try 1.99 Dynatemp and 2.99 Temp.
I really don't know how or why this happens. @oobabooga any clue?

oobabooga · 2024-01-07T07:51:51Z

I have made the following changes:

Join temperature and dynamic temperature into a single sampler
Add an extension for using minimum/maximum temperature explictly:

I think there's a weird bug when the value is exactly a whole number and not a decimal point. Try 1.99 Dynatemp and 2.99 Temp.
I really don't know how or why this happens. @oobabooga any clue?

I have experienced this 0 tokens generated artifact when the temperature is too high. It may be a bug in the transformers library; if using an integer value is the issue, we could add 1e-6 to the temperature in these cases.

oobabooga · 2024-01-07T13:26:14Z

The 0.00 tokens/second bug was a silent exception that is now fixed:

    raise ValueError(except_msg)
ValueError: `temperature` (=1) has to be a strictly positive float, otherwise your next token scores will be invalid.

I also fixed a bug with getting the logits when a prefix-match happens in llamacpp_HF. It may have resolved #5186.

Everything seems to be working well now, so it should be good to merge.

BadisG · 2024-01-07T13:46:28Z

I tried to activate the "dynatemp_with_range" extension but it doesn't seem to work, I still have the dynatemp value only and nothing else.

Edit: Oh ok I see it on the markdown and the chat, why can't it be on the Parameters -> Generation tab instead? It's a bit confusing because the dynamtemp is still there and it's clashing with the other way of doing it.

The extension could simply have removed the dynamtemp value from the user interface and replaced it with MinT | MaxT instead.

I'm not a big fan of having one MinP - MaxP on the markdown (for the notebook) and one for the chat, I'd want to save those values into my own samplers preset like every other samplers.

oobabooga · 2024-01-07T19:58:46Z

Yeah, I agree that it's pretty annoying to work with a dynamic temperature range. It's better to be able to set the low and high value directly.

I have removed the extension and changed the main parameters to dynamic_temperature (on/off) and dynamic_temperature_low here: #5198. The high value is the regular temperature.

--------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>

oobabooga and others added 18 commits December 14, 2023 22:39

Merge pull request oobabooga#4927 from oobabooga/dev

c3e0fcf

Merge dev branch

Merge pull request oobabooga#4937 from oobabooga/dev

443be39

Merge dev branch

Merge pull request oobabooga#4961 from oobabooga/dev

7be0983

Merge dev branch

Merge pull request oobabooga#4980 from oobabooga/dev

b28020a

Merge dev branch

Merge pull request oobabooga#4988 from oobabooga/dev

781367b

Merge dev branch

Merge pull request oobabooga#5002 from oobabooga/dev

71eb744

Merge dev branch

Merge pull request oobabooga#5005 from oobabooga/dev

5b791ca

Merge dev branch

Merge pull request oobabooga#5011 from oobabooga/dev

c1f78db

Merge dev branch

Merge pull request oobabooga#5012 from oobabooga/dev

489f4a2

Merge dev branch

Merge pull request oobabooga#5022 from oobabooga/dev

11288d1

Merge dev branch

Merge pull request oobabooga#5039 from oobabooga/dev

4b25acf

Merge dev branch

Merge pull request oobabooga#5073 from oobabooga/dev

af87609

Merge dev branch

Merge pull request oobabooga#5078 from oobabooga/dev

19d1374

Merge dev branch

Merge pull request oobabooga#5100 from oobabooga/dev

3fd7073

Merge dev branch

Merge pull request oobabooga#5132 from oobabooga/dev

3e3a66e

Merge dev branch

Merge pull request oobabooga#5152 from oobabooga/dev

3f28925

Merge dev branch

Merge pull request oobabooga#5163 from oobabooga/dev

c54d1da

Merge dev branch

Merge pull request oobabooga#5181 from oobabooga/dev

8ea3f31

Merge dev branch

kalomaze added 2 commits January 6, 2024 02:16

update max

07929de

more work to be done elsewhere

Initiate float to store temp + preliminary fixes

00ed020

kalomaze added 4 commits January 6, 2024 05:50

Properly ensure div by zero for max entropy calc

25f3cab

Last commit also ensured the default value is zero for dynatemp

Update UI description

ae476d6

Fix tiny merge conflict

529daac

Sync Dynatemp branch with latest mainline ooba

85828d8

Attempt to fix duplicated Temperature logic

1cc7a14

oobabooga added 7 commits January 6, 2024 22:23

Lint

44e8a92

Use a single warper for temperature and dynamic temperature

941d257

Comment the debug statements

33821b0

Various minor changes

4849c57

Minor changes

951b268

Always replace temperature with TemperatureLogitsWarperWithDynatemp

4023be2

Add an extension for dynamic temperature with range

2fc441f

oobabooga added 3 commits January 7, 2024 05:02

Fix silent exception when temperature is int

6306927

Fix a logits issue with llamacpp_HF

ba65b3c

Add a Dynamic Temperature preset

aa78dfd

Document the new extension

09d5dd7

oobabooga changed the base branch from main to dev January 7, 2024 13:34

oobabooga merged commit 48327cc into oobabooga:dev Jan 7, 2024

oobabooga mentioned this pull request Jan 7, 2024

Add dynamic_temperature_low parameter #5198

Merged

PoetOnTheRun pushed a commit to PoetOnTheRun/text-generation-webui that referenced this pull request Feb 22, 2024

Dynamic Temperature HF loader support (oobabooga#5174)

bb6890c

--------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>

Conversation

kalomaze commented Jan 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BadisG commented Jan 5, 2024

Uh oh!

oobabooga commented Jan 5, 2024

Uh oh!

BadisG commented Jan 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kalomaze commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BadisG commented Jan 6, 2024

Uh oh!

oobabooga commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kalomaze commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oobabooga commented Jan 6, 2024

Uh oh!

kalomaze commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kalomaze commented Jan 6, 2024

Uh oh!

kalomaze commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kalomaze commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ph0rk0z commented Jan 6, 2024

Uh oh!

BadisG commented Jan 7, 2024

Uh oh!

kalomaze commented Jan 7, 2024

Uh oh!

oobabooga commented Jan 7, 2024

Uh oh!

oobabooga commented Jan 7, 2024

Uh oh!

BadisG commented Jan 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oobabooga commented Jan 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kalomaze commented Jan 5, 2024 •

edited

Loading

BadisG commented Jan 5, 2024 •

edited

Loading

kalomaze commented Jan 6, 2024 •

edited

Loading

oobabooga commented Jan 6, 2024 •

edited

Loading

kalomaze commented Jan 6, 2024 •

edited

Loading

kalomaze commented Jan 6, 2024 •

edited

Loading

kalomaze commented Jan 6, 2024 •

edited

Loading

kalomaze commented Jan 6, 2024 •

edited

Loading

BadisG commented Jan 7, 2024 •

edited

Loading