add support for freeze training by youyuxiansen · Pull Request #6001 · ultralytics/yolov5

youyuxiansen · 2021-12-16T06:28:54Z

This tutorial is for freeze training

For what

There may be a need for some peoples to train a model with some structure freezer.

Modify

Modify the --freeze param in train.py，to support more flexible frozen layers selection methods.
Add a frozen training yaml file that support more flexible frozen training plan definitions.

How to use

Customize the model training process by defining a yaml file.

Demo

freeze_train.py --weights yolov5s.pt \
--data data/coco128.yaml --cfg models/yolov5s.yaml --batch-size -1 \
--device 0,1,2,3 --hyp data/hyps/hyp.finetune.yaml --cache \
--freeze-plan freeze_plans/freeze_exp.yaml

TODO

Support selecting part of the data in any freeze training step to train the model.

Implementation

Currently the training process is independent from train.py, because it needs a little bit more time to put it into train.py, and I am not sure if this feature is important for this repository for now. If necessary, I can merge the changes into train.py later, and I will maintain the bugs of this feature in time.

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Implementation of layer freezing mechanism during training in YOLOv5.

📊 Key Changes

Added freeze_train.py, a new training script to handle freeze training.
Introduced freeze_plans folder with example YAML files for defining freeze strategies.
Modified train.py to support the freeze options through --freeze and --freeze-type CLI arguments.
A README file in freeze_plans detailing how to use freeze training has been added.

🎯 Purpose & Impact

🎓 Purpose: Enables training YOLOv5 models with certain layers frozen, enhancing fine-tuning capabilities for transfer learning and potentially improving model performance on specific tasks.
👩‍💻 Impact: Allows users more control over the training process, potentially speeding up training and increasing model stability.
🦾 To Users: Anyone interested in advanced model training techniques can now experiment with different layer freezing strategies.

for more information, see https://pre-commit.ci

glenn-jocher · 2021-12-16T12:05:13Z

@youyuxiansen you can freeze any layers with the --freeze argument, i.e. to freeze the backbone (first 10 layers):

python train.py --weights yolov5s.pt --freeze 10

See argparser for details:

yolov5/train.py

Line 472 in 628817d

    
           parser.add_argument('--freeze', type=int, default=0, help='Number of layers to freeze. backbone=10, all=24')

youyuxiansen · 2021-12-16T13:34:26Z

Hi @glenn-jocher , looks like you haven’t read my commitment. I Submit this because the freeze cannot meet some needs. For example, what if I want to freeze the 2,3,4,5 layers or the 1,3,5,7 layer. And if I want to train multiple times with different frozen layers each time. It is so inconvenient and not directly. But with my commitment, this all can be easy to do.

glenn-jocher · 2021-12-16T13:35:35Z

@youyuxiansen yes the current freeze argument only supports and 'up to layer' value. I think for most customizations a user may want to modify the freeze logic directly here:

yolov5/train.py

Lines 126 to 133 in c1249a4

    
           # Freeze 
        
           freeze = [f'model.{x}.' for x in range(freeze)]  # layers to freeze 
        
           for k, v in model.named_parameters(): 
        
               v.requires_grad = True  # train all layers 
        
               if any(x in k for x in freeze): 
        
                   LOGGER.info(f'freezing {k}') 
        
                   v.requires_grad = False

It is true that I think TF supports a wider variety of freezing strategies.

youyuxiansen · 2021-12-16T13:40:06Z

I suggest you try my submission when you have time. @glenn-jocher

glenn-jocher · 2021-12-16T13:44:20Z

@youyuxiansen yes I've browsed the code. I understand it adds features. The main issue is the scope and complexity of additional code that the PR introduces, and the addition of a new file in the YOLOv5 root directory (which itself introduces duplication of existing code/functionality).

For various reasons (maintenance, documentation, simplicity) I would encourage you to find the minimum viable solution with the least amount of code required.

youyuxiansen · 2021-12-16T13:53:35Z

@glenn-jocher Ok, I understand. At last, can this part of the commit be accepted for a more flexible frozen layer specified?

https://github.com/ultralytics/yolov5/pull/6001/files#diff-ed183d67207df065a11e1289f19d34cc2abbc5448dea952683cfe9728c342b95R127-R137

glenn-jocher · 2021-12-16T14:03:52Z

@youyuxiansen yes I was going to say we might be able to smartly handle arguments so that a single value would be 'up to layer' and multiple values would simply index the freeze layers, i.e.:

python train.py --freeze 10  # freeze up to 10
python train.py --freeze 5 6 7 8 9 10  # freeze layers 5-10

EDIT: we don't want to add additional arguments (unless absolutely necessary for a major feature) to train.py as it already has too many arguments.

EDIT2: see detect.py --weights as an example of variable length argument:

    parser.add_argument('--weights', nargs='+', type=str, default=ROOT / 'yolov5s.pt', help='model path(s)')

# python detect.py --weights yolov5s.pt
# python detect.py --weights yolov5s.pt yolov5m.pt  # ensemble

youyuxiansen · 2021-12-16T14:09:10Z

@glenn-jocher Oh, I see. I think I might have some time tomorrow to modify this for YOLOV5.

youyuxiansen and others added 2 commits December 16, 2021 14:27

add support for freeze training

322bc7c

[pre-commit.ci] auto fixes from pre-commit.com hooks

1fd6d03

for more information, see https://pre-commit.ci

youyuxiansen closed this Dec 17, 2021

youyuxiansen deleted the freezeDev branch December 17, 2021 06:46

This was referenced Dec 17, 2021

add support for multiple frozen layer specifing #6017

Closed

support specfiy multiple frozen layers #6018

Closed

Multi-layer capable --freeze argument #6019

Merged

kadmor mentioned this pull request Dec 17, 2021

Freeze WongKinYiu/yolor#156

Open

bilzard mentioned this pull request Dec 20, 2021

--freeze option doen't work as expected #6038

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for freeze training#6001

add support for freeze training#6001
youyuxiansen wants to merge 2 commits intoultralytics:masterfrom
youyuxiansen:freezeDev

youyuxiansen commented Dec 16, 2021 •

edited by UltralyticsAssistant

Loading

Uh oh!

glenn-jocher commented Dec 16, 2021

Uh oh!

youyuxiansen commented Dec 16, 2021

Uh oh!

glenn-jocher commented Dec 16, 2021

Uh oh!

youyuxiansen commented Dec 16, 2021

Uh oh!

glenn-jocher commented Dec 16, 2021 •

edited

Loading

Uh oh!

youyuxiansen commented Dec 16, 2021

Uh oh!

glenn-jocher commented Dec 16, 2021 •

edited

Loading

Uh oh!

youyuxiansen commented Dec 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

youyuxiansen commented Dec 16, 2021 • edited by UltralyticsAssistant Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

This tutorial is for freeze training

For what

Modify

How to use

Demo

TODO

Implementation

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

Uh oh!

glenn-jocher commented Dec 16, 2021

Uh oh!

youyuxiansen commented Dec 16, 2021

Uh oh!

glenn-jocher commented Dec 16, 2021

Uh oh!

youyuxiansen commented Dec 16, 2021

Uh oh!

glenn-jocher commented Dec 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

youyuxiansen commented Dec 16, 2021

Uh oh!

glenn-jocher commented Dec 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

youyuxiansen commented Dec 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

youyuxiansen commented Dec 16, 2021 •

edited by UltralyticsAssistant

Loading

glenn-jocher commented Dec 16, 2021 •

edited

Loading

glenn-jocher commented Dec 16, 2021 •

edited

Loading