Add Image Processor Fast Deformable DETR by yonigozlan · Pull Request #34353 · huggingface/transformers

yonigozlan · 2024-10-23T16:50:33Z

What does this PR do?

Adds a fast image processor for Deformable DETR. Follows issue #33810.
This image processor is a result of this work on comparing different image processing method.

The diffs look bad but this PR is almost exclusively made up of # Copied from based on the fast image processor for DETR!

Implementation

See #34063

Usage

Except for the fact that it only returns torch tensors, this fast processor is fully compatible with the current one.
It can be instantiated through AutoImageProcessor with use_fast=True, or through the Class directly:

from transformers import AutoImageProcessor

processor = AutoImageProcessor.from_pretrained("SenseTime/deformable-detr", use_fast=True)

from transformers import DeformableDetrImageProcessorFast

processor = DeformableDetrImageProcessorFast.from_pretrained("SenseTime/deformable-detr")

Usage is the same as the current processor, except for the device kwarg:

from torchvision.io import read_image
images = torchvision.io.read_image(image_path)
processor = DeformableDetrImageProcessorFast.from_pretrained("SenseTime/deformable-detr")
images_processed = processor(images , return_tensors="pt", device="cuda")

If device is not specified:

If the input images are tensors, the processing will be done on the device of the images.
If the inputs are PIL or Numpy images, the processing is done on CPU.

Performance gains

Average over 100 runs on the same 480x640 image. No padding needed, as "all" the images have the same size.

Average over 10% of the COCO 2017 validation dataset, with batch_size=8. Forcing padding to 1333x1333 (="longest_edge"), as otherwise torch.compile needs to recompile if the different batches have different max sizes.

Average over 10% of the COCO 2017 validation dataset, with batch_size=1. Forcing padding to 1333x1333.

Tests

The new image processor is tested on all the tests of the current processor.
I have also added two consistency tests (panoptic and detection) for processing on GPU vs CPU.

Who can review?

@ArthurZucker Pinging you directly as there is almost no "new" code here.

HuggingFaceDocBuilderDev · 2024-10-23T17:17:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/transformers/models/deformable_detr/image_processing_deformable_detr_fast.py

ArthurZucker

Thanks, same comment as for the other PR mostly! 🤗

yonigozlan · 2024-10-29T19:32:19Z

Will make the modifications once this PR #34354 is merged, as most of them will be copied from :)

ArthurZucker

One thing I don't understand: literally everything is copied from. Why not directy map to use the detr class?

src/transformers/models/deformable_detr/image_processing_deformable_detr_fast.py

yonigozlan · 2024-11-05T16:01:55Z

One thing I don't understand: literally everything is copied from. Why not directy map to use the detr class?

All pre-processing functions are copied from image_processing_detr_fast, but the post-processing function are copied from image_processing_deformable_detr. I guess the post-processing functions are also the reasons why there is a base DeformableDetrImageProcessorFast in the first place, as all the base pre-processing functions are also copied from image_processing_detr

ArthurZucker

Got it, thanks! Let's work to make it simpler to add these, with maybe a bit of abstraction on the FastImageProcessor class!

* add deformable detr image processor fast * add fast processor to doc * fix copies * nit docstring * Add tests gpu/cpu and fix docstrings * fix docstring * import changes from detr * fix imports * rebase and fix * fix input data format change in detr and rtdetr fast

yonigozlan marked this pull request as ready for review October 23, 2024 17:43

yonigozlan requested a review from ArthurZucker October 23, 2024 17:44

ArthurZucker requested a review from molbap October 24, 2024 13:51

molbap reviewed Oct 24, 2024

View reviewed changes

src/transformers/models/deformable_detr/image_processing_deformable_detr_fast.py Outdated Show resolved Hide resolved

yonigozlan force-pushed the add-copied-detr-fast branch from b9cfe3b to f9848a7 Compare October 24, 2024 22:22

ArthurZucker reviewed Oct 28, 2024

View reviewed changes

yonigozlan force-pushed the add-copied-detr-fast branch from f9848a7 to f7f480a Compare October 30, 2024 18:27

yonigozlan requested a review from ArthurZucker October 30, 2024 18:27

qubvel added Vision Processing optimization labels Oct 31, 2024

ArthurZucker reviewed Nov 5, 2024

View reviewed changes

src/transformers/models/deformable_detr/image_processing_deformable_detr_fast.py Outdated Show resolved Hide resolved

yonigozlan force-pushed the add-copied-detr-fast branch from f7f480a to 4bc94a4 Compare November 5, 2024 15:57

yonigozlan requested a review from ArthurZucker November 5, 2024 16:02

yonigozlan force-pushed the add-copied-detr-fast branch from 4bc94a4 to 449e4e5 Compare November 18, 2024 19:40

ArthurZucker approved these changes Nov 19, 2024

View reviewed changes

yonigozlan added 10 commits November 19, 2024 15:56

add deformable detr image processor fast

5746208

add fast processor to doc

ac7775b

fix copies

624b161

nit docstring

9362e4e

Add tests gpu/cpu and fix docstrings

8381600

fix docstring

a072df2

import changes from detr

487d862

fix imports

8b6cc7e

rebase and fix

90b92ca

fix input data format change in detr and rtdetr fast

e35fda7

yonigozlan force-pushed the add-copied-detr-fast branch from 449e4e5 to e35fda7 Compare November 19, 2024 16:07

yonigozlan merged commit eedc113 into huggingface:main Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Image Processor Fast Deformable DETR#34353

Add Image Processor Fast Deformable DETR#34353
yonigozlan merged 10 commits intohuggingface:mainfrom
yonigozlan:add-copied-detr-fast

yonigozlan commented Oct 23, 2024 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 23, 2024

Uh oh!

Uh oh!

ArthurZucker left a comment

Uh oh!

yonigozlan commented Oct 29, 2024

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

yonigozlan commented Nov 5, 2024

Uh oh!

ArthurZucker left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

yonigozlan commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Implementation

Usage

Performance gains

Tests

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 23, 2024

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

yonigozlan commented Oct 29, 2024

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yonigozlan commented Nov 5, 2024

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yonigozlan commented Oct 23, 2024 •

edited

Loading