Reduce tensorprotoutils binary size #6634

skottmckay · 2021-02-10T05:00:28Z

Description:
Refactor a few aspects of tensorprotoutils to reduce the binary size.

Move type agnostic code outside of templatized code
- external and raw data reads are based on number of bytes and don't need the actual type for the low level read
- if an initializer has external data, UnpackInitializerData can read that first in type agnostic code prior to any templatized code
Make the setup of the two UnpackTensor variants equivalent
- defined and instantiated in .cc
- Also checked that the linker is able to throw away unused instantiations from a MinSizeRel build by commenting out the ConstantOfShape registration and changing the type list in matmal_scale_fusion.cc (the two current external usages of UnpackTensor).
Remove some redundant/duplicate checks
Clarify some naming as to whether something refers to number of elements vs number of bytes
Add unit tests for more external data types
Minor change to use uint8_t for bytes in a few places instead of char.
- The data isn't signed per-se, so uint8_t is a slightly better fit (and shorter to type than unsigned char).

Reductions: 16.2 KB in Windows release build.

Motivation and Context
Reduce binary size growth from adding external data support in more places.

Refactor the unpack tensor logic to switch on data size Add test cases

edgchen1 · 2021-02-10T18:40:50Z

onnxruntime/core/framework/endian_utils.cc

-    size_t element_size_in_bytes,
-    gsl::span<const char> source_bytes, gsl::span<char> destination_bytes) {
+void SwapByteOrderCopy(size_t element_size_in_bytes,
+                       gsl::span<const uint8_t> source_bytes, gsl::span<uint8_t> destination_bytes) {


the reason i went with "char" was that it was explicitly mentioned as one of the types that one can use to examine an object as bytes.
https://en.cppreference.com/w/cpp/language/reinterpret_cast#Type_aliasing
i'm fine with the more verbose "unsigned char" as well, or if you can point me to a reference that indicates uint8_t is ok here too

http://www.gotw.ca/gotw/072.htm has a few comments on why unsigned types are better (although technically that is about bits I think it still applies). std::byte is also defined using unsigned char. And the data isn't inherently signed, so using an unsigned type seems like a better fit on that front as will.

Given uint8_t is an alias to unsigned char that's shorter to type and explicit about the number of bits, what's the benefit of typing the longer 'unsigned char'?

Or alternatively we could define it the same way as c++17 and have a byte type, but that should probably be a different PR.

namespace std { enum class byte : unsigned char {}; };

my understanding is that char, unsigned char, and std::byte are special in that they are allowed to be used for working with bytes of an object.
if uint8_t is guaranteed to be an alias of unsigned char, that should be fine too. all I found was that it should be an 8-bit unsigned integral type.
maybe i'm being overly pedantic here. but i think it was correct before and i'm not completely sure about this change.

For it to be incorrect, wouldn't that require uint8_t to point to something that was not 8 bits in size?

Comes down to whether CHAR_BITS is 8 on the platform or not, as that size is considered to be one 'byte'.

https://stackoverflow.com/a/16138308/684911
https://stackoverflow.com/a/43144010/684911

If protobuf packs data using 8 bits for a byte I don't think you could use it on platforms where a byte has more than 8 bits. So unsigned char is technically the most correct, uint8_t should work fine on all platforms we support, but defining an equivalent of std::byte would maybe be best.

For it to be incorrect, wouldn't that require uint8_t to point to something that was not 8 bits in size?

hm, i'm not quite saying that uint8_t is not 8 bits wide... but that it might not necessarily be the same type as unsigned char. the second paragraph here talks about some possible distinction: https://stackoverflow.com/a/16138470

yes, i think a byte type is what we want here. we conceptually deal with bytes of data, not (signed or unsigned) integers or characters.
until we can use std::byte, another idea is to have an onnxruntime::byte (typedef to unsigned char, and eventually std::byte). and "byte" is shorter to type than "uint8_t" :)

this is just my preference. but on the platforms we support, using uint8_t is probably fine.

onnxruntime/core/framework/tensorprotoutils.cc

onnxruntime/test/optimizer/initializer_test.cc

onnxruntime/test/framework/tensorutils_test.cc

edgchen1 · 2021-02-10T19:28:46Z

onnxruntime/core/framework/endian_utils.h

+                                gsl::span<uint8_t> destination_bytes);
+
+/**
+ * Reads from a little-endian source with check that T is trivially copyable.


nit: only true if not GCC below 5

if we are stuck with supporting the older GCC versions, maybe we can consider this:
https://stackoverflow.com/a/31798726

Not sure that matters. If there's code that is going to fail, it will fail in our CIs which use later versions of GCC won't it, as we don't have any code that is selectively included for a build with GCC < 5?

ok, then my original comment applies, regarding "with check that T is trivially copyable."

yuslepukhin

…nsorProtoUtilsBinarySize

Use unsigned char for bytes. Tried defining 'byte' the same way as c++17 but that created a little more pain than it was worth. Clashed with a typedef in a Windows RPC header, and without the relaxed enum initialization in C++17 it's painful to initialize something like a std::vector<byte>.

skottmckay added 3 commits February 10, 2021 09:28

Move type agnostic code out of UnpackInitializerData

3eb6673

Refactor the unpack tensor logic to switch on data size Add test cases

Remove templatization of more parts

46dde76

Fix build warning by moving a couple of things around

cc9bc00

skottmckay requested review from askhade, edgchen1 and snnn February 10, 2021 05:00

skottmckay requested a review from a team as a code owner February 10, 2021 05:00

snnn previously approved these changes Feb 10, 2021

View reviewed changes