Add TensorFlow examples - ResNet50 and BERT models #7

Satya1493 · 2021-10-29T11:55:59Z

Signed-off-by: Satyanaraya Illa [email protected]

Description of the changes

TensorFlow examples for ResNet50 and BERT models. The samples run inference using pre-trained models.

How to test this PR?

Please follow steps present at tensorflow/README.md

This change is

Signed-off-by: Sonali Saha <[email protected]>

Signed-off-by: Satyanaraya Illa <[email protected]>

dimakuv

Reviewed 4 of 7 files at r1, 3 of 3 files at r2, all commit messages.
Reviewable status: all files reviewed, 12 unresolved discussions, not enough approvals from maintainers (2 more required), not enough approvals from different teams (2 more required, approved so far: ), "fixup! " found in commit messages' one-liners (waiting on @mkow and @Satya1493)

tensorflow/README.md, line 54 at r2 (raw file):

- To run int8 inference on ``gramine-direct`` (non-SGX version), replace ``gramine-sgx`` with
``gramine-direct`` in the above command.
- To run int8 inference on native baremetal (outside Gramine), replace ``gramine-sgx ./python`` with

on native baremetal (outside Gramine) -> natively (outside Gramine)

tensorflow/README.md, line 71 at r2 (raw file):

- To run inference on ``gramine-direct`` (non-SGX version), replace ``gramine-sgx`` with
``gramine-direct`` in the above command.
- To run inference on native baremetal (outside Gramine), replace ``gramine-sgx ./python`` with

on native baremetal (outside Gramine) -> natively (outside Gramine)

tensorflow/README.md, line 89 at r2 (raw file):

- To get the number of cores per socket, do ``lscpu | grep 'Core(s) per socket'``.

## Performance considerations

Could you copy-paste this section from https://github.com/gramineproject/examples/pull/6/files? I already reviewed the OpenVINO PR, and all comments pertaining to this Performance considerations section were already resolved in that OpenVINO PR.

It looks to me that these sections are 95% identical, except for libos.check_invalid_pointers manifest option. Don't you want to add the description of this option in your README?

tensorflow/BERT/Makefile, line 1 at r2 (raw file):

# BERT sample for Tensorflow

Tensorflow -> TensorFlow

tensorflow/BERT/python.manifest.template, line 2 at r2 (raw file):

libos.entrypoint = "{{ entrypoint }}"
loader.preload = "file:{{ gramine.libos }}"

Latest Gramine uses loader.entrypoint instead of loader.preload. Please see this commit: ebc051b

tensorflow/BERT/python.manifest.template, line 8 at r2 (raw file):

loader.insecure__use_cmdline_argv = true
loader.insecure__use_host_env = true
loader.insecure__disable_aslr = true

OpenVINO PR (#6) also has libos.check_invalid_pointers = false. Do you want to add this option here, for performance? Or does it break TensorFlow?

tensorflow/BERT/python.manifest.template, line 55 at r2 (raw file):

  "file:{{ arch_libdir }}/",
  "file:/usr/{{ arch_libdir }}/",
  "file:{{ entrypoint }}/",

You have a / at the end here, this can't be right -- the {{ entrypoint }} is a file, not a directory. Please remove the trailing /.

tensorflow/ResNet50/Makefile, line 1 at r2 (raw file):

# ResNet50 sample for Tensorflow

Tensorflow -> TensorFlow

tensorflow/ResNet50/python.manifest.template, line 2 at r2 (raw file):

libos.entrypoint = "{{ entrypoint }}"
loader.preload = "file:{{ gramine.libos }}"

Latest Gramine uses loader.entrypoint instead of loader.preload. Please see this commit: ebc051b

tensorflow/ResNet50/python.manifest.template, line 8 at r2 (raw file):

loader.insecure__use_cmdline_argv = true
loader.insecure__use_host_env = true
loader.insecure__disable_aslr = true

OpenVINO PR (#6) also has libos.check_invalid_pointers = false. Do you want to add this option here, for performance? Or does it break TensorFlow?

tensorflow/ResNet50/python.manifest.template, line 60 at r2 (raw file):

  "file:/usr/{{ arch_libdir }}/",
  "file:resnet50v1_5_int8_pretrained_model.pb",
  "file:{{ entrypoint }}/",

You have a / at the end here, this can't be right -- the {{ entrypoint }} is a file, not a directory. Please remove the trailing /.

tensorflow/ResNet50/python.manifest.template, line 69 at r2 (raw file):

  "file:/tmp/",
  "file:/etc/",
  "file:/proc/",

This is redundant, you're not mounting /proc/ FS in this manifest file. Just remove this line.

Signed-off-by: Satyanaraya Illa <[email protected]>

Satya1493

Reviewable status: 2 of 7 files reviewed, 12 unresolved discussions, not enough approvals from maintainers (2 more required), not enough approvals from different teams (2 more required, approved so far: ), "fixup! " found in commit messages' one-liners (waiting on @dimakuv and @mkow)

tensorflow/README.md, line 54 at r2 (raw file):