updating gptj-config #109

Dahoas · 2022-11-22T23:45:11Z

Updating the gptj config and verifying gptj runs on 8 A100s with zero2

https://wandb.ai/dahoas/trlx/runs/5vc7xsx8?workspace=user-dahoas

maxreciprocate · 2022-11-23T00:18:59Z

examples/ppo_sentiments.py

@@ -17,7 +17,7 @@ def get_positive_score(scores):
    return dict(map(lambda x: tuple(x.values()), scores))["POSITIVE"]


-default_config = yaml.safe_load(open("configs/ppo_config.yml"))
+default_config = yaml.safe_load(open("configs/ppo_gptj.yml"))


are you sure you want to change the default to gpt-j? I wouldn't mind the second script which imports main from here and changes config as the most simplest option right now, until we switch to hydra or something else

Agreed, this is a bad idea.

Yeah I just forgot to change this back.

Dahoas · 2022-11-23T05:28:28Z

Added distributed config logging to wandb

maxreciprocate · 2022-11-23T14:48:19Z

configs/ppo_config.yml

@@ -21,6 +21,7 @@ train:

  pipeline: "PromptPipeline"  # prompt pipeline to load
  orchestrator: "PPOOrchestrator"  # orchestrator to load
+  entity_name: "dahoas"  # put your wandb login here


this will give somewhat cryptic error if you're not logged in as dahoas, maybe we can make this as optional enviroment variable instead?

Lets fix this in a future PR

Have it automatically determine what W&B account is logged in. cc @ayulockin

it already does so, this option is optional and was added in #78

otherwise it forces people to wandb disable as in #106

maxreciprocate · 2022-11-23T15:07:59Z

trlx/utils/__init__.py

+    return {
+        "mixed_precision": accelerate_config.mixed_precision,
+        "num_gpus": accelerate_config.num_processes,
+        "gradient_accumulation_steps": ds_plugin.gradient_accumulation_steps,


I think if ds is not used then ds_plugin is None, giving attribute error here
https://github.com/huggingface/accelerate/blob/e4e5611e5d4270a846caf42cba3388e54b83f074/src/accelerate/state.py#L62

maybe some processing of repr(accelerator.state) (which is outputed in accelerate env) would be equivalent here

I guess it's true in some cases a user may not be using deepspeed so a check should be performed. However with the accelerator.state object there are some items I don't care about (such as the local rank).

updating gptj-config

630c95c

maxreciprocate reviewed Nov 23, 2022

View reviewed changes

added distributed config logging to wandb

39990b6

Dahoas and others added 4 commits November 23, 2022 00:30

Merge branch 'main' into update-gptj

2e4aeb2

update

4b2c532

merge

e78e47a

black fix

25399c2

LouisCastricato approved these changes Nov 23, 2022

View reviewed changes

maxreciprocate reviewed Nov 23, 2022

View reviewed changes

Dahoas added 2 commits November 23, 2022 15:36

adding check for ds_plugin

fb701e7

removing wandb entity name from default config

65ee9c0

jon-tow mentioned this pull request Dec 1, 2022

Add unit tests to ensure valid example configs #120

Merged

Merge branch 'main' into update-gptj

f3f43b7

LouisCastricato merged commit b229288 into main Dec 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updating gptj-config #109

updating gptj-config #109

Dahoas commented Nov 22, 2022

maxreciprocate Nov 23, 2022

LouisCastricato Nov 23, 2022

Dahoas Nov 23, 2022

Dahoas commented Nov 23, 2022

maxreciprocate Nov 23, 2022

LouisCastricato Nov 23, 2022

LouisCastricato Nov 23, 2022

maxreciprocate Nov 23, 2022

maxreciprocate Nov 23, 2022

maxreciprocate Nov 23, 2022

Dahoas Nov 23, 2022

updating gptj-config #109

updating gptj-config #109

Conversation

Dahoas commented Nov 22, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Dahoas commented Nov 23, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment