The test set is being used as validation set #145

seuretm · 2022-01-24T10:16:53Z

The network is evaluated on the test set at every epoch, and whenever the result is higher, the network is saved (some kind of early stopping). This is what a validation set should be used for (as CIFAR-10 does not contain a validation set, a subset of the training data can be used for this). The goal of the test set is to know how well a network performs on unseen data; however in this case, the test set is used for optimizing the network's results.

The test set must be used only once, at the end of the training. This training procedure is erroneous, and therefore the reported results are unfortunately all invalid.

zjysteven · 2022-02-14T16:22:13Z

Agreed. Even more concerning, many papers now are reporting their performance using the best results on the test set.

wihn2021 · 2022-03-16T11:56:14Z

But the net is set to eval mode before being tested, while it is set to train mode before training, using:
net.train() and net.eval()

zjysteven · 2022-03-16T13:08:11Z

It is not about batchnorm statistics... It is just that evaluating on the test set to select the best model (e.g., best checkpoint and hyperparameters) goes against the basic practice/assumption of machine learning and is not realistic. In real world, there is no way to obtain the expected test samples before the model is deployed.

wihn2021 · 2022-03-16T13:32:16Z

🤔right.✌️

melhzy · 2022-03-24T20:00:03Z

Yes. It is a big issue here though. The test set is been used as the validation set. That means the models trained in the framework memorize the data patterns from the test set and train set. Overall, it causes overfitting.

yolunghiu · 2022-10-11T06:55:46Z

您好：您的邮件我已收到，我会尽快回复。刘洪宇

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The test set is being used as validation set #145

The test set is being used as validation set #145

seuretm commented Jan 24, 2022

zjysteven commented Feb 14, 2022

wihn2021 commented Mar 16, 2022

zjysteven commented Mar 16, 2022

wihn2021 commented Mar 16, 2022

melhzy commented Mar 24, 2022

yolunghiu commented Oct 11, 2022 via email

The test set is being used as validation set #145

The test set is being used as validation set #145

Comments

seuretm commented Jan 24, 2022

zjysteven commented Feb 14, 2022

wihn2021 commented Mar 16, 2022

zjysteven commented Mar 16, 2022

wihn2021 commented Mar 16, 2022

melhzy commented Mar 24, 2022

yolunghiu commented Oct 11, 2022 via email