TResNet: High Performance GPU-Dedicated Architecture

From linstcl.cn PaddlePaddle, AI Studio@Baidu

TResNet的Paddle复现版

Tal Ridnik, Hussam Lawen, Asaf Noy, Itamar Friedman, Emanuel Ben Baruch, Gilad Sharir
DAMO Academy, Alibaba Group

摘要

Many deep learning models, developed in recent years, reach higher ImageNet accuracy than ResNet50, with fewer or comparable FLOPS count. While FLOPs are often seen as a proxy for network efficiency, when measuring actual GPU training and inference throughput, vanilla ResNet50 is usually significantly faster than its recent competitors, offering better throughput-accuracy trade-off. In this work, we introduce a series of architecture modifications that aim to boost neural networks' accuracy, while retaining their GPU training and inference efficiency. We first demonstrate and discuss the bottlenecks induced by FLOPs-optimizations. We then suggest alternative designs that better utilize GPU structure and assets. Finally, we introduce a new family of GPU-dedicated models, called TResNet, which achieve better accuracy and efficiency than previous ConvNets. Using a TResNet model, with similar GPU throughput to ResNet50, we reach 80.7% top-1 accuracy on ImageNet. Our TResNet models also transfer well and achieve state-of-the-art accuracy on competitive datasets such as Stanford cars (96.0%), CIFAR-10 (99.0%), CIFAR-100 (91.5%) and Oxford-Flowers (99.1%). They also perform well on multi-label classification and object detection tasks.

Main Article Results && Reproduce Article Scores

请参见TResNet

参数文件下载

TResNet_m
TResNet_l
TResNet_xl
TResNet_m_448
TResNet_l_448
TResNet_xl_448

数据集

ImageNet1K-0
ImageNet1K-1

环境依赖

pip install -r requirements.txt

论文结果重现

在paddlepaddle环境中运行：

python run.py \
--val_mode \
--params_dir=/model/path \
--data_dir=/path/to/val \
--model_name=tresnet_m \
--input_size=224

预测

在paddlepaddle环境中运行：

python run.py \
--infer_mode \
--params_dir=/model/path \
--data_dir=/path/to/imgpath \
--model_name=tresnet_m \
--input_size=224

训练

在paddlepaddle环境中运行：

python run.py \
--train_mode \
--params_dir=/model/path \
--data_dir=/path/to/train \
--model_name=tresnet_m \
--input_size=224 \
--batch_size=190 \
--epoch_num=300 \
--lr=0.2 \
--l2_decay=0.0001

也支持多卡训练：

python3 -m paddle.distributed.launch --gpus=0,1,2,3 run.py \
--train_mode \
--params_dir=/model/path \
--data_dir=/path/to/train \
--model_name=tresnet_m \
--input_size=224 \
--batch_size=190 \
--epoch_num=300 \
--lr=0.2 \
--l2_decay=0.0001

Citation

@misc{ridnik2020tresnet,
    title={TResNet: High Performance GPU-Dedicated Architecture},
    author={Tal Ridnik and Hussam Lawen and Asaf Noy and Itamar Friedman},
    year={2020},
    eprint={2003.13630},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Contact

联系复现作者(linstcl.cn) Feel free to contact me if there are any questions or issues (Tal Ridnik, [email protected]).

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
deploy		deploy
scripts		scripts
src		src
test_tipc		test_tipc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_en.md		README_en.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TResNet: High Performance GPU-Dedicated Architecture

Main Article Results && Reproduce Article Scores

参数文件下载

数据集

环境依赖

论文结果重现

预测

训练

Citation

Contact

About

Releases 1

Packages

Languages

License

LINSTCL/PdPaper-1

Folders and files

Latest commit

History

Repository files navigation

TResNet: High Performance GPU-Dedicated Architecture

Main Article Results && Reproduce Article Scores

参数文件下载

数据集

环境依赖

论文结果重现

预测

训练

Citation

Contact

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages