feat: automatic model card generation on `save` #3857

plaguss · 2023-09-28T18:04:44Z

Description

This PR adds the option to generate a model card from the ArgillaTrainer when calling the save method.
Closes #3634

Type of change

(Please delete options that are not relevant. Remember to title the PR according to the type of change)

New feature (non-breaking change which adds functionality)
Refactor (change restructuring the codebase without changing functionality)
Improvement (change adding some improvement to an existing functionality)

How Has This Been Tested

(Please describe the tests that you ran to verify your changes. And ideally, reference tests)

tests/integration/client/feedback/integrations/huggingface/test_model_card.py

Checklist

I added relevant documentation
I followed the style guidelines of this project
I did a self-review of my code
I made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I filled out the contributor form (see text above)
I have added relevant notes to the CHANGELOG.md file (See https://keepachangelog.com/)

src/argilla/client/feedback/integrations/huggingface/card/argilla_model_template.md

davidberenstein1957

Hi @plaguss,

This is looking good already. Some remarks.

For a separate PR:
[ ] I think we should add a push_to_huggingface method to each of the trainers. Here we can nicely include the cards.
[ ] Also, we need a call or predict method for each for each of the trainers.
I believe there are some issues on GitHub but otherwise feel free to create them.

For this PR:
[ ] Missing some tests.
[ ] missing some docs.
[ ] We should add model_card_data to the from argilla.client.feedback.training ABC base class.

model_card_data might be renamed to get_model_card_data
maybe integrations/huggingface/model_card is more explicit?

src/argilla/client/feedback/training/frameworks/transformers.py

src/argilla/client/feedback/integrations/huggingface/card/model_card.py

davidberenstein1957 · 2023-10-09T10:01:33Z

src/argilla/client/feedback/integrations/huggingface/card/argilla_model_template.md

+)
+
+trainer.update_config(
+    # The non default hyperparameters will be filled here


is there a way to get these arguments too?

Don't spend too much time on this.

I'm working on this right now, but its so draft to push the commits yet. They are a bit trickier depending on the model. For example, transformers is almost done as the inner trainer_kwargs is relatively simple, but in the trl framework it will take some work to nicely print config=PPOConfig(batch_size=1, ppo_epochs=1) I think. I'm working on it, if I see that it needs too much effort I'll let you know 😃.

With the current implementation we cannot write the arguments passed by the user to update_config in the following cases:

spacy: it jdoesn't work due to how the training configuration is defined with nested dicts, the (naive) function _updated_arguments which grabs the user arguments cannot currently deal with nested dicts. For example, the default configuration for spacy is:

{ 'dev_corpus': 'corpora.dev', 'train_corpus': 'corpora.train', 'batcher': { '@batchers': 'spacy.batch_by_words.v1', 'discard_oversize': False, 'tolerance': 0.2, 'size': {'@schedules': 'compounding.v1', 'start': 100, 'stop': 1000, 'compound': 1.001, 't': 0.0}, 'get_length': None }, 'optimizer': {'@optimizers': 'Adam.v1', 'beta1': 0.9, 'beta2': 0.999, 'L2_is_weight_decay': True, 'L2': 0.01, 'grad_clip': 1.0, 'use_averages': False, 'eps': 1e-08, 'learn_rate': 0.001}, 'seed': '${system.seed}', 'gpu_allocator': '${system.gpu_allocator}', 'dropout': 0.1, 'accumulate_gradient': 1, 'patience': 1600, 'max_epochs': 0, 'max_steps': 20000, 'eval_frequency': 200, 'score_weights': { 'cats_score': 1.0, 'cats_score_desc': None, 'cats_micro_p': None, 'cats_micro_r': None, 'cats_micro_f': None, 'cats_macro_p': None, 'cats_macro_r': None, 'cats_macro_f': None, 'cats_macro_auc': None, 'cats_f_per_type': None }, 'frozen_components': [], 'annotating_components': [], 'before_to_disk': None, 'before_update': None, 'logger': {'@loggers': 'spacy.ConsoleLogger.v1', 'progress_bar': False} }

trl (when using PPO only), the internal configuration makes it harder to print a nice representation for this case, mainly due to PPOConfig, so I prefer to let this for the.

plaguss · 2023-10-09T19:31:35Z

Hi @plaguss,

This is looking good already. Some remarks.

For a separate PR:
[ ] I think we should add a push_to_huggingface method to each of the trainers. Here we can nicely include the cards.
[ ] Also, we need a call or predict method for each for each of the trainers. I believe there are some issues on GitHub but otherwise feel free to create them.

For the moment I'm generating the call to the predict method if it exists for the framework, and a sample call to the underlying framework (extracted from the docs) for the models that don't have the predict implemented. But yeah, it would be nice to have the predict method available at least for testing the model.

For this PR:
[ ] Missing some tests.
[ ] missing some docs.
[ ] We should add model_card_data to the from argilla.client.feedback.training ABC base class.

Working on the tests! I didn't forgot them, I'm still refining them. I wasn't sure of adding it to the ABC base class, but will do it.
Of course to the docs.

model_card_data might be renamed to get_model_card_data
maybe integrations/huggingface/model_card is more explicit?

Agree, I'll do it.

I've found a decent amount of details, but it's already taking form. As soon as I got more time I'll let it ready for review 😄

Update:

model_card_data added to from argilla.client.feedback.training ABC base class.
model_card_data renamed to get_model_card_data.
maybe integrations/huggingface/model_card is more explicit? -> yes! done

…or simpler frameworks)

…feat/modelcard

plaguss · 2023-10-11T19:01:31Z

Hello @davidberenstein1957,
When you can, please take a look at the tests to see if the approach looks okay to you. They are currently only checking the code snippet generated to see if it matches. Also, we need to train the models in order to generate the model cards, which adds a bit of overhead (around 40 seconds currently). We could test this where we are testing the behavior of the to speed up the tests a bit, but I think its better to have them separated.
And a question for when I tackle the docs. Where would you like me to add the new feature? It needs a short guide on how to add variables to the ArgillaTrainer that would be written on the model card, and some example. Thank you!

…on and rename to get_model_card_data

davidberenstein1957 · 2023-10-12T09:39:49Z

Hi @plaguss, thanks for the iteration 👍

I think we can also just call the get_model_card or generate_model_card after initializing the Trainer? Alternatively, we could do some test-mocking to avoid starting the actual training process. Ultimately, we would be able to try to just test the specific function. I agree it is best to keep the tests separate.

The tests look fine, I think these need to be fixtures and can be added to aconftest.py. Similarly, we can retry using the formatting_func to from the other places in the tests where we actually train the models. We could also use a IOBuffer as output directory and avoid saving the meta file to an actual output directory. Also, this logic is copied for all the different frameworks.

In terms of the docs, I would say adding it here would be great.

…tion

…ry and check against the contents instead of reading from the file

…ed from argilla

…onnection

plaguss · 2023-10-12T18:11:19Z

The tests have been updated to avoid calling the trainer, much faster now, and the same results, no need to mock here (40s -> 7s)
All the patterns to check for the tests have been moved to conftest.py.
Regarding the IOBuffer, I'm not so sure what you mean. Anyway, now all the file creation is done inside a TemporaryDirectory

with TemporaryDirectory() as tmpdirname:
    content = trainer.generate_model_card(tmpdirname)

I think it's a good idea to move all formatting_func to a common module (maybe here instead of a conftest.py. I think makes sense as separate functions more than fixtures, and that way we don't need to update the checks for the model cards, as internally we are reading the source code). But maybe that can be done in a different PR? I can open an issue and get it done after this one if it's ok.

davidberenstein1957 · 2023-10-13T06:35:41Z

https://www.digitalocean.com/community/tutorials/python-io-bytesio-stringio

and a separate PR would be great. Also, we can think of creating common formattinng_funcas part of #3833

…d classes

…model card generation to True by default

… feat/modelcard

plaguss · 2023-10-14T18:54:22Z

We could add more data to the model card (metrics from the models should be there), but I think that it works in the current state, we can always make an upgrade 😃.

…en calling generate_model_card from the trainer

davidberenstein1957 · 2023-10-16T07:21:14Z

We could add more data to the model card (metrics from the models should be there), but I think that it works in the current state, we can always make an upgrade 😃.

I agree

plaguss · 2023-10-16T07:30:32Z

Hi @davidberenstein1957 do you know about the errors on the unit tests? They seem unrelated, I don't know if I missed something

davidberenstein1957 · 2023-10-16T07:37:53Z

@plaguss, they are unrelated and we have been having them for a while without a clear solution besides re-running them.

plaguss added 13 commits September 28, 2023 19:27

feat: add draft model card template

96252fb

feat: add initial version of card data classes per framework

afd0ac5

feat: redirect card data imports

f73af3c

feat: generate model card from the trainers

12ac571

fix: check for optional values in repr method

5b7eddd

fix: add template task call from constant variable

c60a06c

feat: add transformers model card

b81405f

feat: add setfit model card

df55c9f

feat: add setfit model card

927dd33

feat: add peft model card

15b8316

feat: add spacy and spacy-transformers model cards

d6f4574

feat: add trl and openai model cards

f7ba9a9

feat: add output_dir variable name and prediction example code

570f7e3

davidberenstein1957 reviewed Oct 9, 2023

View reviewed changes

src/argilla/client/feedback/integrations/huggingface/card/argilla_model_template.md Outdated Show resolved Hide resolved

davidberenstein1957 reviewed Oct 9, 2023

View reviewed changes

davidberenstein1957 mentioned this pull request Oct 9, 2023

[FEATURE] ArgillaTrainer - add push_to_hub method #3633

Closed

plaguss added 3 commits October 11, 2023 20:34

feat: include update_config parameters given by the user (initially f…

82914cb

…or simpler frameworks)

Merge branch 'develop' of https://github.com/argilla-io/argilla into …

93fbac8

…feat/modelcard

test: add tests for model cards, checks based on code snippets

34d21f9

plaguss added 2 commits October 12, 2023 08:48

refactor: move model card to a separate model_card module

ac26fcb

feat: add model_card_data as an abstract method in the trainer skelet…

de9d6a1

…on and rename to get_model_card_data

plaguss added 6 commits October 12, 2023 13:19

chore: corrected types and let _update_config return implicitly None

065ae85

test: remove training for speed, not needed for the model card genera…

5f59833

…tion

test: refactored the model card patterns to be loaded from a fixture

8be92b5

refactor: move unit tests to its own place

9e98656

refactor: update tests to write the model card in a temporary directo…

27cdf75

…ry and check against the contents instead of reading from the file

feat: load model from huggingface if available, or assume its generat…

d92e368

…ed from argilla

test: mock is_on_huggingface to avoid failing tests due to internet c…

d478ea6

…onnection

plaguss added 7 commits October 13, 2023 20:20

feat: initial smaller version of the argilla model card

15b6cc1

feat: read output dir from the argument passed through save method

c85e8be

fix: add tags only when not given by the user

9d31813

fix: raise error for spanmarker model and correct typing of model car…

c8fbf5c

…d classes

feat: add missing chat completion model card

fbedabc

feat: add reference to the model card generation in the docs and set …

ab63b74

…model card generation to True by default

qMerge branch 'develop' of https://github.com/argilla-io/argilla into…

2ae5e07

… feat/modelcard

plaguss marked this pull request as ready for review October 14, 2023 18:31

chore: update changelog.md

70b0042

plaguss added 3 commits October 14, 2023 21:01

chore: update changelog.md

f68bd3b

fix: rename model card to README.md

5b2e45e

refactor: return ArgillaModelCard instance instead of the contents wh…

77d1f15

…en calling generate_model_card from the trainer

davidberenstein1957 merged commit 37b7074 into argilla-io:develop Oct 16, 2023

plaguss deleted the feat/modelcard branch October 16, 2023 18:20

plaguss mentioned this pull request Oct 25, 2023

[FEATURE] Clean up tests related to ArgillaTrainer #4042

Closed

plaguss mentioned this pull request Nov 6, 2023

[FEATURE] Improve the argilla model card data #4140

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: automatic model card generation on `save` #3857

feat: automatic model card generation on `save` #3857

plaguss commented Sep 28, 2023 •

edited

Loading

davidberenstein1957 left a comment

davidberenstein1957 Oct 9, 2023

davidberenstein1957 Oct 9, 2023

plaguss Oct 9, 2023

plaguss Oct 11, 2023

plaguss commented Oct 9, 2023 •

edited

Loading

plaguss commented Oct 11, 2023

davidberenstein1957 commented Oct 12, 2023 •

edited

Loading

plaguss commented Oct 12, 2023

davidberenstein1957 commented Oct 13, 2023

plaguss commented Oct 14, 2023

davidberenstein1957 commented Oct 16, 2023

plaguss commented Oct 16, 2023

davidberenstein1957 commented Oct 16, 2023

feat: automatic model card generation on save #3857

feat: automatic model card generation on save #3857

Conversation

plaguss commented Sep 28, 2023 • edited Loading

Description

davidberenstein1957 left a comment

Choose a reason for hiding this comment

davidberenstein1957 Oct 9, 2023

Choose a reason for hiding this comment

davidberenstein1957 Oct 9, 2023

Choose a reason for hiding this comment

plaguss Oct 9, 2023

Choose a reason for hiding this comment

plaguss Oct 11, 2023

Choose a reason for hiding this comment

plaguss commented Oct 9, 2023 • edited Loading

plaguss commented Oct 11, 2023

davidberenstein1957 commented Oct 12, 2023 • edited Loading

plaguss commented Oct 12, 2023

davidberenstein1957 commented Oct 13, 2023

plaguss commented Oct 14, 2023

davidberenstein1957 commented Oct 16, 2023

plaguss commented Oct 16, 2023

davidberenstein1957 commented Oct 16, 2023

feat: automatic model card generation on `save` #3857

feat: automatic model card generation on `save` #3857

plaguss commented Sep 28, 2023 •

edited

Loading

plaguss commented Oct 9, 2023 •

edited

Loading

davidberenstein1957 commented Oct 12, 2023 •

edited

Loading