Simplifying the code handling dim_ordering for tensorflow back-end #3149

MycChiu · 2016-07-05T13:10:42Z

Tensorflow originally takes an input tensor of the shape
[batch, in_height, in_width, in_channels]
while dealing with the convolution-related operations,
and Theano takes an input of the shape
[batch, in_channels, in_height, in_width]
Therefore, currently in the master branch we are doing an additional tensor transpose when the dim_ordering complies with the theano convention.

However, since r0.8 Tensorflow has added support for different dim_ordering via the data_format argument. Right now, we can actually just pass in 'NCHW' (i.e. [n_sample,channel,height,width]) for the data_format when the dim_ordering is 'th', which will save a tf.transpose operation.

Additionally, when I run the benchmark files from soumith/convnet-benchmarks, I see a consistent 20% speed gain using 'NCHW' data_format while running on gpu. The effect seems to be caused by CuDNN using 'NCHW' data_format internally, and Tensorflow has to perform additional operation to convert the NHWC tensors to comply with CuDNN. Although I am not sure if this applies to all environments, (I am running on Ubuntu 16.04, GTX1070 with CUDA toolkit 8.0RC, CuDNN V5, keras 1.0.5, tensorflow 0.8.0 ) but if it does, I think it would be a good incentive to change the current way of dealing with different data formats.

The text was updated successfully, but these errors were encountered:

fchollet · 2016-07-07T03:03:52Z

Thanks for pointing this out, but unfortunately support doesn't seem mature enough to switch at this point. In particular the CPU implementation only supports NHWC.

ddkang · 2017-09-15T05:34:12Z

Do you think this is worth revisiting?

MycChiu mentioned this issue Jul 5, 2016

Possible inefficiencies in Tensorflow backend on gpu #3150

Closed

stale bot added the stale label May 23, 2017

stale bot closed this as completed Jun 22, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplifying the code handling dim_ordering for tensorflow back-end #3149

Simplifying the code handling dim_ordering for tensorflow back-end #3149

MycChiu commented Jul 5, 2016

fchollet commented Jul 7, 2016

ddkang commented Sep 15, 2017

Simplifying the code handling dim_ordering for tensorflow back-end #3149

Simplifying the code handling dim_ordering for tensorflow back-end #3149

Comments

MycChiu commented Jul 5, 2016

fchollet commented Jul 7, 2016

ddkang commented Sep 15, 2017