#Torch7 Profiling
720p (720x1280)
67.7 Gops/frame
###Titan X Pascal
20.51 ms
###TitanX
35.40 ms
###GTX 1080
25.48 ms
operations: 1.43 G
image size: 224 x 224
All results are averaged over 100 runs unless otherwise mentioned
31.90 ms
25.30 ms (4C8T)
25.18 ms
462.37 ms (1-core)
3.74 ms
2.99 ms
2.57 ms
2.00 ms
1.96 ms
114.66 ms
25.73 ms
Batch Size | 1 | 2 | 4 | 8 | 16 | 32* |
---|---|---|---|---|---|---|
Time (ms per batch) | 54 | 57 | 69 | 93 | 137 | 216 |
Time (ms per frame) | 54 | 28 | 17 | 12 | 8 | 7 |
*batch > 32 gets worse
Batch Size | 1 | 2 | 4 | 8 | 16 | 32 |
---|---|---|---|---|---|---|
Time (ms per batch) | 28 | 33 | 40 | 70 | 135 | 593 |
Time (ms per frame) | 28 | 16 | 10 | 9 | 8 | 18 |
batch 1 31.6170 ms
(batch > 1 is not better in performance)
Input Resolution | Perf. CPU FP32* (ms) | Perf. GPU FP32 (ms) | Perf. GPU FP16 (ms) |
---|---|---|---|
VGA (640x480) | 1272 | 95 | 58 |
WXGA (1280x720) | 4406 | 308 | 203 |
FHD (1920x1080) | 11237 | 673 | 434 |
*CPU results averaged over 10 runs