Set ImageNet data augmentation by default #13757

ymjiang · 2019-01-02T08:45:12Z

https://github.com/apache/incubator-mxnet/blob/a38278ddebfcc9459d64237086cd7977ec20c70e/example/image-classification/train_imagenet.py#L42

When I try to train imagenet with this line commented, the train-accuracy reaches 99% while the validation-accuracy is only less than 50% (single machine, 8 GPUs, global batchsize=2048, Resnet50, fp32). Absolutely this is overfitting.

Then I uncomment this line and try again with the same experiment settings. This time both train and validation accuracy converge to about 66%, which looks like normal result.

Thus, it seems that this data augmentation is pretty important for ImageNet training. Perhaps it will be better to uncomment this as default, so that future developers won't get confused by the overfitting issue.

https://github.com/apache/incubator-mxnet/blob/a38278ddebfcc9459d64237086cd7977ec20c70e/example/image-classification/train_imagenet.py#L42 When I try to train imagenet with this line commented, the train-accuracy reaches 99% while the validation-accuracy is only less than 50% (single machine, 8 GPUs, global batchsize=2048, Resnet50). Absolutely this is overfitting. Then I uncomment this line and try again with the same experiment settings. This time both train and validation accuracy converge to about 70%. Thus, it seems that this data augmentation is pretty important for ImageNet training. Perhaps it will be better to uncomment this as default, so that future developers won't get confused by the over-fit issue.

Roshrini · 2019-01-02T18:04:41Z

@sandeep-krishnamurthy @eric-haibin-lin Can you take a look?

@mxnet-label-bot Add [pr-awaiting-review]

vishaalkapoor · 2019-01-07T21:14:07Z

I'm unsure why image net arguments are not the default for a training script for image net and would be curious to know why not, but there are two better approaches to this depending on what is determined.

If ImageNet arguments are to be the default, they should be merged into the stanza:

    parser.set_defaults(
        # network
        network          = 'resnet',
        num_layers       = 50,
        # data
        num_classes      = 1000,
        num_examples     = 1281167,
        image_shape      = '3,224,224',
        min_random_scale = 1, # if input image has min size k, suggest to use
                              # 256.0/x, e.g. 0.533 for 480
        # train
        num_epochs       = 80,
        lr_step_epochs   = '30,60',
        dtype            = 'float32'
    )

If they are not the default, it would be cleaner to add an argument --override-with-image-net-augmentations or something more appropriately named that would override parameters with those in the method.

Vishaal

stu1130 · 2019-01-16T22:43:00Z

@rahul003 could you take a look at it. any idea why it was commented? Thanks a lot!

sandeep-krishnamurthy · 2019-01-28T17:46:16Z

@ymjiang - Thanks for your contributions. Did you get a chance to look at @vishaalkapoor comment?

vandanavk · 2019-02-05T19:45:05Z

@mxnet-label-bot update [pr-awaiting-response]

ymjiang · 2019-02-11T01:58:55Z

Hi @sandeep-krishnamurthy , I agree with @vishaalkapoor that the parameter argument should be set as default. But I see the parameter is already provided in set_imagenet_aug. Perhaps one neat way would be to directly set it as uncommented? No other change will be involved.

ankkhedia · 2019-02-15T22:52:53Z

@vishaalkapoor Could you suggest a way forward on this PR?

anirudhacharya · 2019-03-03T23:53:10Z

example/image-classification/train_imagenet.py

@@ -39,7 +39,7 @@ def set_imagenet_aug(aug):
    data.add_data_args(parser)
    data.add_data_aug_args(parser)
    # uncomment to set standard augmentations for imagenet training


this comment should change accordingly

anirudhacharya · 2019-03-03T23:53:37Z

@ymjiang can you please set a command line argument to either override or keep set_imagenet_aug line. I think that is what @vishaalkapoor was suggesting.

karan6181 · 2019-03-19T00:24:56Z

@ymjiang Could you please address the review comments made by @anirudhacharya. It seems no updates since last 2 weeks. Thanks!

ymjiang · 2019-03-19T02:57:43Z

@karan6181 @anirudhacharya Sorry for the delay. I committed two new changes to enable data-augmentation with command-line argument. Please review and see if they are appropriate.

piyushghai · 2019-04-09T00:31:15Z

@anirudhacharya Ping for review.
@ymjiang Can you look into the CI failures ?

Roshrini · 2019-04-17T15:37:10Z

@vishaalkapoor Can you take a look at this PR again?

roywei · 2019-04-30T16:51:55Z

@ymjiang Hi, could you rebase to latest master? it should resolve the failing CI test

anirudhacharya

lgtm

pinaraws · 2019-05-20T16:50:49Z

@ymjiang Hi, could you rebase to latest master? it should resolve the failing CI test

piyushghai · 2019-06-07T22:56:58Z

@ymjiang Gentle ping...

ymjiang · 2019-06-09T08:48:44Z

Rebased to master now: https://github.com/apache/incubator-mxnet/pull/15189. Will close this issue.

* Update .gitmodules * Set ImageNet data augmentation by default https://github.com/apache/incubator-mxnet/blob/a38278ddebfcc9459d64237086cd7977ec20c70e/example/image-classification/train_imagenet.py#L42 When I try to train imagenet with this line commented, the train-accuracy reaches 99% while the validation-accuracy is only less than 50% (single machine, 8 GPUs, global batchsize=2048, Resnet50). Absolutely this is overfitting. Then I uncomment this line and try again with the same experiment settings. This time both train and validation accuracy converge to about 70%. Thus, it seems that this data augmentation is pretty important for ImageNet training. Perhaps it will be better to uncomment this as default, so that future developers won't get confused by the over-fit issue. * Add argument for imagenet data augmentation * Enable data-aug with argument * Update .gitmodules

ymjiang requested a review from szha as a code owner January 2, 2019 08:45

marcoabreu added the pr-awaiting-review PR is waiting for code review label Jan 2, 2019

marcoabreu added pr-awaiting-response PR is reviewed and waiting for contributor to respond and removed pr-awaiting-review PR is waiting for code review labels Feb 5, 2019

anirudhacharya reviewed Mar 3, 2019

View reviewed changes

ymjiang added 2 commits March 19, 2019 10:49

Add argument for imagenet data augmentation

52f2945

Enable data-aug with argument

bd4faf0

anirudhacharya approved these changes Apr 30, 2019

View reviewed changes

ymjiang closed this Jun 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set ImageNet data augmentation by default #13757

Set ImageNet data augmentation by default #13757

ymjiang commented Jan 2, 2019 •

edited

Loading

Roshrini commented Jan 2, 2019

vishaalkapoor commented Jan 7, 2019 •

edited

Loading

stu1130 commented Jan 16, 2019

sandeep-krishnamurthy commented Jan 28, 2019

vandanavk commented Feb 5, 2019

ymjiang commented Feb 11, 2019

ankkhedia commented Feb 15, 2019

anirudhacharya Mar 3, 2019

anirudhacharya commented Mar 3, 2019

karan6181 commented Mar 19, 2019

ymjiang commented Mar 19, 2019 •

edited

Loading

piyushghai commented Apr 9, 2019

Roshrini commented Apr 17, 2019

roywei commented Apr 30, 2019

anirudhacharya left a comment

pinaraws commented May 20, 2019

piyushghai commented Jun 7, 2019

ymjiang commented Jun 9, 2019

Set ImageNet data augmentation by default #13757

Set ImageNet data augmentation by default #13757

Conversation

ymjiang commented Jan 2, 2019 • edited Loading

Roshrini commented Jan 2, 2019

vishaalkapoor commented Jan 7, 2019 • edited Loading

stu1130 commented Jan 16, 2019

sandeep-krishnamurthy commented Jan 28, 2019

vandanavk commented Feb 5, 2019

ymjiang commented Feb 11, 2019

ankkhedia commented Feb 15, 2019

anirudhacharya Mar 3, 2019

Choose a reason for hiding this comment

anirudhacharya commented Mar 3, 2019

karan6181 commented Mar 19, 2019

ymjiang commented Mar 19, 2019 • edited Loading

piyushghai commented Apr 9, 2019

Roshrini commented Apr 17, 2019

roywei commented Apr 30, 2019

anirudhacharya left a comment

Choose a reason for hiding this comment

pinaraws commented May 20, 2019

piyushghai commented Jun 7, 2019

ymjiang commented Jun 9, 2019

ymjiang commented Jan 2, 2019 •

edited

Loading

vishaalkapoor commented Jan 7, 2019 •

edited

Loading

ymjiang commented Mar 19, 2019 •

edited

Loading