Add TF image classification example script #19956

amyeroberts · 2022-10-28T18:22:34Z

What does this PR do?

Adds the TF equivalent for the PyTorch image classification example script.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2022-10-28T18:37:27Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks a lot for working on this!
It looks like your branch is a bit old and does not contain some fixes made to make sure the example tests run when an example is modified (you can see the test examples are not running here 😅 ). Could you try a rebase on main?

sgugger · 2022-11-03T13:42:11Z

Still no good, I scanned the test fetcher and found a potential bug. COuld you add

            elif f.startswith("examples/tensorflow"):
                test_files_to_run.append("examples/flax/test_tensorflow_examples.py")

at the line 562 of utils/test_fetcher.py (in-between PyTorch and Flax)? I think that's what causing the issue of the tests not running.

amyeroberts · 2022-11-03T13:47:26Z

It looks like your branch is a bit old and does not contain some fixes made to make sure the example tests run when an example is modified (you can see the test examples are not running here 😅 ). Could you try a rebase on main?

@sgugger I've rebased from upstream main and force pushed again. If I run git log --oneline I can see these changes are applied on top of the tip of main.

270bfb056 (HEAD -> add-examples-tf-image-classification, origin/add-examples-tf-image-classification) Add tests
a2256258b Fix up
1a1594cb8 Update requirements
b6a2f1ef9 TF image classification script
9ccea7acb (upstream/main, main) Fix some doctests after PR 15775 (#20036)
a639ea9e8 Add **kwargs (#20037)
ec6878f6c Now supporting pathlike in pipelines too. (#20030)
aa39967b2 reorganize glossary (#20010)
305e8718b Show installed libraries and their versions in CI jobs (#20026)

The examples still aren't running and I can't see in the diff where I could be overriding this 😅 I'm sure there's something I'm overlooking. I'll keep digging but let me know if there's something else I should be doing. Sorry to bother.

amyeroberts · 2022-11-03T13:48:06Z

@sgugger - sorry late on the comment as well. I'll add your suggestion!

sgugger · 2022-11-03T14:36:41Z

examples/tensorflow/test_tensorflow_examples.py

Is there a way to make the test run in less than 30/40s? Would be nice for it to be run on the CI.

Definitely - I'll play with the model / dataset / number of samples.

So, this was paused for quite a while because the script was taking ages on the prepare_tf_dataset step. Thanks to the sleuthing from @Rocketknight1 and the subsequent to_tf_dataset updates and using with_format("np") things move a lot faster. Switching to a hf-internal-testing dataset and small model meant it runs < 30s.

Rocketknight1

This looks really clean overall! Have you tested that you get similar accuracy after fine-tuning as the PT example? I know the CI test checks this, but the tolerances on quick small tests like that have to be quite wide, so they can miss subtler performance degradation.

Rocketknight1 · 2022-11-04T16:03:53Z

examples/tensorflow/image-classification/run_image_classification.py

I know this is from torchvision and not your idea, but trying 10 random rolls and then giving up rather than figuring out how to constrain your randomness so it outputs valid solutions the first time is quite funny.

Although on reflection I suppose doing it this way gives you an unbiased sample from the space of valid target_area + aspect ratio pairs, and you'd need to handle things quite carefully to make that the case if you constrained the random rolls to always be valid.

Rocketknight1 · 2022-11-04T16:08:52Z

examples/tensorflow/image-classification/run_image_classification.py

I haven't looked at our image models too much, but I'm guessing that the scaling factors feature_extractor.image_mean and feature_extractor.image_std were also computed after division by 255? It seems odd to normalize the scale twice like this.

Exactly. It's pretty standard to have the values this way in vision - most of our image_mean and image_std values are constants from e.g. imagenet. I think it comes down to being able to use floats - if I found the mean on my R pixels was 134.7 how should I take it away from the pixel values when they're still uint8?

github-actions · 2022-11-29T15:02:04Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Rocketknight1 · 2022-12-02T13:44:20Z

Hey @amyeroberts, is this PR still going ahead? It looked almost ready!

amyeroberts · 2022-12-02T19:21:03Z

@Rocketknight1 Yes - sorry, this fell down my priority list for a bit. Code is all ready to go - I was trying to find models that make the tests run quickly c.f. this comment.

Rocketknight1 · 2022-12-05T16:17:33Z

@amyeroberts Ah, that makes sense! It's totally okay to upload your own super-mini model and use that - it doesn't really matter if the accuracy is bad, the test will just let us detect if the outputs from this model class change suddenly

github-actions · 2022-12-30T15:02:39Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

examples/tensorflow/image-classification/README.md

Co-authored-by: Sylvain Gugger <[email protected]>

) * Adding `zero-shot-object-detection` pipeline doctest. * Remove nested_simplify.

…ace#20952) * Add generate kwargs to AutomaticSpeechRecognitionPipeline * Add test for generation kwargs

examples/tensorflow/image-classification/README.md

amyeroberts force-pushed the add-examples-tf-image-classification branch from 37a585f to 95c3a62 Compare November 3, 2022 12:46

amyeroberts marked this pull request as ready for review November 3, 2022 12:52

amyeroberts requested review from NielsRogge, Rocketknight1 and sgugger November 3, 2022 12:53

sgugger approved these changes Nov 3, 2022

View reviewed changes

amyeroberts force-pushed the add-examples-tf-image-classification branch from 95c3a62 to 270bfb0 Compare November 3, 2022 13:30

sgugger reviewed Nov 3, 2022

View reviewed changes

Rocketknight1 approved these changes Nov 4, 2022

View reviewed changes

MKhalusova reviewed Jan 5, 2023

View reviewed changes

examples/tensorflow/image-classification/README.md Outdated Show resolved Hide resolved

github-actions bot closed this Jan 13, 2023

Rocketknight1 reopened this Jan 13, 2023

github-actions bot closed this Jan 22, 2023

amyeroberts and others added 8 commits January 26, 2023 11:30

TF image classification script

28dcb60

Update requirements

a6e2181

Fix up

b153089

Add tests

331678e

Update test fetcher

78a9cb1

Co-authored-by: Sylvain Gugger <[email protected]>

Fix directory path

4daf09e

Adding zero-shot-object-detection pipeline doctest. (huggingface#20274

1366831

) * Adding `zero-shot-object-detection` pipeline doctest. * Remove nested_simplify.

Add generate kwargs to AutomaticSpeechRecognitionPipeline (huggingf…

04e8e96

…ace#20952) * Add generate kwargs to AutomaticSpeechRecognitionPipeline * Add test for generation kwargs

sgugger and others added 6 commits January 26, 2023 11:30

Trigger CI

0cc9d56

Data collator returns np

f8c3a11

Update feature extractor -> image processor

a083d39

Bug fixes - updates to reflect changes in API

de79a3f

Update flags to match PT & run faster

91535f4

Update instructions - Maria's comment

8ef50e3

amyeroberts reopened this Jan 26, 2023

amyeroberts force-pushed the add-examples-tf-image-classification branch from 0dea54f to 8ef50e3 Compare January 26, 2023 14:58

amyeroberts commented Jan 26, 2023

View reviewed changes

examples/tensorflow/image-classification/README.md Outdated Show resolved Hide resolved

amyeroberts added 2 commits January 30, 2023 14:21

Update examples/tensorflow/image-classification/README.md

f26c37a

Remove slow decorator

69cc303

amyeroberts merged commit e5db705 into huggingface:main Feb 1, 2023

amyeroberts deleted the add-examples-tf-image-classification branch February 1, 2023 19:09

Add TF image classification example script #19956

Add TF image classification example script #19956

Uh oh!

Conversation

amyeroberts commented Oct 28, 2022

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Oct 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger commented Nov 3, 2022

Uh oh!

amyeroberts commented Nov 3, 2022

Uh oh!

amyeroberts commented Nov 3, 2022

Uh oh!

sgugger Nov 3, 2022

Choose a reason for hiding this comment

Uh oh!

amyeroberts Nov 3, 2022

Choose a reason for hiding this comment

Uh oh!

amyeroberts Feb 1, 2023

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 left a comment

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 Nov 4, 2022

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 Nov 4, 2022

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 Nov 4, 2022

Choose a reason for hiding this comment

Uh oh!

amyeroberts Dec 2, 2022

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 29, 2022

Uh oh!

Rocketknight1 commented Dec 2, 2022

Uh oh!

amyeroberts commented Dec 2, 2022

Uh oh!

Rocketknight1 commented Dec 5, 2022

Uh oh!

github-actions bot commented Dec 30, 2022

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

HuggingFaceDocBuilderDev commented Oct 28, 2022 •

edited

Loading