tensorflow · vinhngx · Apr 29, 2022 · Apr 29, 2022 · Apr 29, 2022 · May 27, 2022
diff --git a/README.md b/README.md
diff --git a/tftrt/examples-cpp/image_classification/README.md b/tftrt/examples-cpp/image_classification/README.md
@@ -3,104 +3,11 @@
 
 # TF-TRT C++ Image Recognition Demo
 
-This example shows how you can load a native TF Keras ResNet-50 model, convert it to a TF-TRT optimized model (via the TF-TRT Python API), save the model as a frozen graph, and then finally load and serve the model with the TF C++ API. The process can be demonstrated with the below workflow diagram:
+This example shows how you can load a native TF Keras ResNet-50 model, convert it to a TF-TRT optimized model (via the TF-TRT Python API), save the model as either a frozen graph or a saved model, and then finally load and serve the model with the TF C++ API. The process can be demonstrated with the below workflow diagram:
 
 
-![TF-TRT C++ Inference workflow](TF-TRT_CPP_inference.png "TF-TRT C++ Inference")
+![TF-TRT C++ Inference workflow](TF-TRT_CPP_inference_overview.png "TF-TRT C++ Inference")
 
 This example is built based upon the original Google's TensorFlow C++ image classification [example](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/label_image), on top of which we added the TF-TRT conversion part and adapted the C++ code for loading and inferencing with the TF-TRT model.
 
-## Docker environment
-Docker images provide a convinient and repeatable environment for experimentation. This workflow was tested in the NVIDIA NGC TensorFlow 22.01 docker container that comes with a TensorFlow 2.x build. Tools required for building this example, such as Bazel, NVIDIA CUDA, CUDNN, NCCL libraries are all readily setup.
-
-To replecate the below steps, start by pulling the NGC TF container:
-
-```
-docker pull nvcr.io/nvidia/tensorflow:22.01-tf2-py3
-```
-Then start the container with nvidia-docker:
-
-```
-nvidia-docker run --rm -it -p 8888:8888 --name TFTRT_CPP nvcr.io/nvidia/tensorflow:22.01-tf2-py3
-```
-
-You will land at `/workspace` within the docker container. Clone the TF-TRT example repository with:
-
-```
-git clone https://github.com/tensorflow/tensorrt
-cd tensorrt 
-```
-
-Then copy the content of this C++ example directory to the TensorFlow example source directory:
-
-```
-cp -r ./tftrt/examples-cpp/image_classification/ /opt/tensorflow/tensorflow-source/tensorflow/examples/
-cd /opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification
-```
-
-<!-- #region -->
-## Convert to TF-TRT Model
-
-Start Jupyter lab with:
-
-```
-jupyter lab -ip 0.0.0.0
-```
-
-A Jupyter notebook for downloading the Keras ResNet-50 model and TF-TRT conversion is provided in `tf-trt-conversion.ipynb` for your  experimentation. By default, this notebook will produce a TF-TRT FP32 model at `/opt/tensorflow/tensorflow-source/tensorflow/examples/image-classification/frozen_models_trt_fp32/frozen_models_trt_fp32.pb`.
-
-As part of the conversion, the notebook will also carry out benchmarking and print out the throughput statistics. 
-
-
-<!-- #endregion -->
-
-## Build the C++ example
-The NVIDIA NGC container should have everything you need to run this example installed already.
-
-To build it, first, you need to copy the build scripts `tftrt_build.sh` to `/opt/tensorflow`:
-
-```
-cp tftrt-build.sh /opt/tensorflow
-```
-
-Then from `/opt/tensorflow`, run the build command:
-
-```bash
-cd /opt/tensorflow 
-bash ./tftrt-build.sh
-```
-
-That should build a binary executable `tftrt_label_image` that you can then run like this:
-
-```bash
-tensorflow-source/bazel-bin/tensorflow/examples/image_classification/tftrt_label_image \
---graph=/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/frozen_models_trt_fp32/frozen_models_trt_fp32.pb \
---image=/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/data/img0.JPG
-```
-
-This uses the default image `img0.JPG` which was download as part of the conversion notebook, and should
-output something similar to this:
-
-```
-2022-02-23 13:53:56.076348: I tensorflow/examples/image-classification/main.cc:276] malamute (250): 0.575496
-2022-02-23 13:53:56.076384: I tensorflow/examples/image-classification/main.cc:276] Saint Bernard (248): 0.399285
-2022-02-23 13:53:56.076412: I tensorflow/examples/image-classification/main.cc:276] Eskimo dog (249): 0.0228338
-2022-02-23 13:53:56.076423: I tensorflow/examples/image-classification/main.cc:276] Ibizan hound (174): 0.00127912
-2022-02-23 13:53:56.076449: I tensorflow/examples/image-classification/main.cc:276] Mexican hairless (269): 0.000520922
-```
-
-The program will also benchmark and output the throughput. Observe the improved throughput offered by moving from Python to C++ serving.
-
-Next, try it out on your own images by supplying the --image= argument, e.g.
-
-```bash
-tensorflow-source/bazel-bin/tensorflow/examples/label_image/tftrt_label_image --image=my_image.png
-```
-
-## What's next
-
-Try to build TF-TRT FP16 and INT8 models and test on your own data, and serve them with C++.
-
-```bash
-
-```
+See the respective sub-folder for details on either approach.
diff --git a/tftrt/examples-cpp/image_classification/TF-TRT_CPP_inference_overview.png b/tftrt/examples-cpp/image_classification/TF-TRT_CPP_inference_overview.png
diff --git a/...t/examples-cpp/image_classification/BUILD → ...p/image_classification/frozen-graph/BUILD b/...t/examples-cpp/image_classification/BUILD → ...p/image_classification/frozen-graph/BUILD
diff --git a/tftrt/examples-cpp/image_classification/frozen-graph/README.md b/tftrt/examples-cpp/image_classification/frozen-graph/README.md
@@ -0,0 +1,108 @@
+<!-- #region -->
+<img src="https://developer.download.nvidia.com/notebooks/dlsw-notebooks/tftrt_cpp_frozen-graph/nvidia_logo.png" style="width: 90px; float: right;">
+
+# TF-TRT C++ Image Recognition Demo
+
+This example shows how you can load a native TF Keras ResNet-50 model, convert it to a TF-TRT optimized model (via the TF-TRT Python API), save the model as a frozen graph, and then finally load and serve the model with the TF C++ API. The process can be demonstrated with the below workflow diagram:
+
+
+![TF-TRT C++ Inference workflow](TF-TRT_CPP_inference.png "TF-TRT C++ Inference")
+
+This example is built based upon the original Google's TensorFlow C++ image classification [example](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/label_image), on top of which we added the TF-TRT conversion part and adapted the C++ code for loading and inferencing with the TF-TRT model.
+
+## Docker environment
+Docker images provide a convinient and repeatable environment for experimentation. This workflow was tested in the NVIDIA NGC TensorFlow 22.01 docker container that comes with a TensorFlow 2.x build. Tools required for building this example, such as Bazel, NVIDIA CUDA, CUDNN, NCCL libraries are all readily setup.
+
+To replecate the below steps, start by pulling the NGC TF container:
+
+```
+docker pull nvcr.io/nvidia/tensorflow:22.01-tf2-py3
+```
+Then start the container with nvidia-docker:
+
+```
+nvidia-docker run --rm -it -p 8888:8888 --name TFTRT_CPP nvcr.io/nvidia/tensorflow:22.01-tf2-py3
+```
+
+You will land at `/workspace` within the docker container. Clone the TF-TRT example repository with:
+
+```
+git clone https://github.com/tensorflow/tensorrt
+cd tensorrt 
+```
+
+Then copy the content of this C++ example directory to the TensorFlow example source directory:
+
+```
+cp -r ./tftrt/examples-cpp/image_classification /opt/tensorflow/tensorflow-source/tensorflow/examples/
+cd /opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/frozen-graph
+```
+
+<!-- #region -->
+## Convert to TF-TRT Model
+
+Start Jupyter lab with:
+
+```
+jupyter lab -ip 0.0.0.0
+```
+
+A Jupyter notebook for downloading the Keras ResNet-50 model and TF-TRT conversion is provided in `tf-trt-conversion.ipynb` for your  experimentation. By default, this notebook will produce a TF-TRT FP32 model at `/opt/tensorflow/tensorflow-source/tensorflow/examples/image-classification/frozen-graph/frozen_models_trt_fp32/frozen_models_trt_fp32.pb`.
+
+As part of the conversion, the notebook will also carry out benchmarking and print out the throughput statistics. 
+
+
+<!-- #endregion -->
+
+## Build the C++ example
+The NVIDIA NGC container should have everything you need to run this example installed already.
+
+To build it, first, you need to copy the build scripts `tftrt_build.sh` to `/opt/tensorflow`:
+
+```
+cp tftrt-build.sh /opt/tensorflow
+```
+
+Then from `/opt/tensorflow`, run the build command:
+
+```bash
+cd /opt/tensorflow 
+bash ./tftrt-build.sh
+```
+
+That should build a binary executable `tftrt_label_image` that you can then run like this:
+
+```bash
+tensorflow-source/bazel-bin/tensorflow/examples/image_classification/frozen-graph/tftrt_label_image \
+--graph=/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/frozen-graph/frozen_models_trt_fp32/frozen_models_trt_fp32.pb \
+--image=/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/frozen-graph/data/img0.JPG
+```
+
+This uses the default image `img0.JPG` which was download as part of the conversion notebook, and should
+output something similar to this:
+
+```
+2022-04-29 04:20:24.377345: I tensorflow/examples/image_classification/frozen-graph/main.cc:276] malamute (250): 0.575496
+2022-04-29 04:20:24.377370: I tensorflow/examples/image_classification/frozen-graph/main.cc:276] Saint Bernard (248): 0.399285
+2022-04-29 04:20:24.377380: I tensorflow/examples/image_classification/frozen-graph/main.cc:276] Eskimo dog (249): 0.0228338
+2022-04-29 04:20:24.377387: I tensorflow/examples/image_classification/frozen-graph/main.cc:276] Ibizan hound (174): 0.00127912
+2022-04-29 04:20:24.377394: I tensorflow/examples/image_classification/frozen-graph/main.cc:276] Mexican hairless (269): 0.000520922
+```
+
+The program will also benchmark and output the throughput. Observe the improved throughput offered by moving from Python to C++ serving.
+
+Next, try it out on your own images by supplying the --image= argument, e.g.
+
+```bash
+tensorflow-source/bazel-bin/tensorflow/examples/image_classification/frozen-graph/tftrt_label_image \
+--graph=/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/frozen-graph/frozen_models_trt_fp32/frozen_models_trt_fp32.pb \
+--image=my_image.png
+```
+
+## What's next
+
+Try to build TF-TRT FP16 and INT8 models and test on your own data, and serve them with C++.
+
+```bash
+
+```
diff --git a/...e_classification/TF-TRT_CPP_inference.png → ...ion/frozen-graph/TF-TRT_CPP_inference.png b/...e_classification/TF-TRT_CPP_inference.png → ...ion/frozen-graph/TF-TRT_CPP_inference.png
diff --git a/...examples-cpp/image_classification/main.cc → ...image_classification/frozen-graph/main.cc b/...examples-cpp/image_classification/main.cc → ...image_classification/frozen-graph/main.cc
@@ -302,11 +302,11 @@ int main(int argc, char* argv[]) {
   // These are the command-line flags the program can understand.
   // They define where the graph and input data is located, and what kind of
   // input the model expects.
-  string image =  "/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/data/img0.JPG";
+  string image = "/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/frozen-graph/data/img0.JPG";
   string graph =
-      "/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/data/resnet-50.pb";
+      "/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/frozen-graph/data/resnet-50.pb";
   string labels =
-      "/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/data/imagenet_slim_labels.txt";
+      "/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/frozen-graph/data/imagenet_slim_labels.txt";
   int32_t input_width = 224;
   int32_t input_height = 224;
   float input_mean = 127;

diff --git a/...s-cpp/image_classification/tftrt-build.sh → ...lassification/frozen-graph/tftrt-build.sh b/...s-cpp/image_classification/tftrt-build.sh → ...lassification/frozen-graph/tftrt-build.sh
diff --git a/...age_classification/tftrt-conversion.ipynb → ...ation/frozen-graph/tftrt-conversion.ipynb b/...age_classification/tftrt-conversion.ipynb → ...ation/frozen-graph/tftrt-conversion.ipynb
diff --git a/tftrt/examples-cpp/image_classification/saved-model/BUILD b/tftrt/examples-cpp/image_classification/saved-model/BUILD
@@ -0,0 +1,50 @@
+# Description:
+#   TensorFlow C++ inference example with TF-TRT model.
+
+load("//tensorflow:tensorflow.bzl", "tf_cc_binary")
+
+package(
+    default_visibility = ["//tensorflow:internal"],
+    licenses = ["notice"],
+)
+
+tf_cc_binary(
+    name = "tftrt_label_image",
+    srcs = [
+        "main.cc",
+    ],
+    linkopts = select({
+        "//tensorflow:android": [
+            "-pie",
+            "-landroid",
+            "-ljnigraphics",
+            "-llog",
+            "-lm",
+            "-z defs",
+            "-s",
+            "-Wl,--exclude-libs,ALL",
+        ],
+        "//conditions:default": ["-lm"],
+    }),
+    deps = select({
+        "//tensorflow:android": [
+            # cc:cc_ops is used to include image ops (for label_image)
+            # Jpg, gif, and png related code won't be included
+            "//tensorflow/cc:cc_ops",            
+            "//tensorflow/core:portable_tensorflow_lib",
+            # cc:android_tensorflow_image_op is for including jpeg/gif/png
+            # decoder to enable real-image evaluation on Android
+            "//tensorflow/core/kernels/image:android_tensorflow_image_op",
+        ],
+        "//conditions:default": [
+            "//tensorflow/cc:cc_ops",
+            "//tensorflow/cc/saved_model:loader",
+            "//tensorflow/core:core_cpu",
+            "//tensorflow/core:framework",
+            "//tensorflow/core:framework_internal",
+            "//tensorflow/core:lib",
+            "//tensorflow/core:protos_all_cc",
+            "//tensorflow/core:tensorflow",            
+        ],
+    }),
+)
diff --git a/tftrt/examples-cpp/image_classification/saved-model/README.md b/tftrt/examples-cpp/image_classification/saved-model/README.md
@@ -0,0 +1,109 @@
+<!-- #region -->
+<img src="https://developer.download.nvidia.com//notebooks/dlsw-notebooks/tftrt_cpp_saved-model/nvidia_logo.png" style="width: 90px; float: right;">
+
+
+# TF-TRT C++ Image Recognition Demo
+
+This example shows how you can load a native TF Keras ResNet-50 model, convert it to a TF-TRT optimized model (via the TF-TRT Python API), save the model as a saved model, and then finally load and serve the model with the TF C++ API. The process can be demonstrated with the below workflow diagram:
+
+
+![TF-TRT C++ Inference workflow](TF-TRT_CPP_inference_saved_model.png "TF-TRT C++ Inference")
+
+This example is built based upon the original Google's TensorFlow C++ image classification [example](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/label_image), on top of which we added the TF-TRT conversion part and adapted the C++ code for loading and inferencing with the TF-TRT model.
+
+## Docker environment
+Docker images provide a convinient and repeatable environment for experimentation. This workflow was tested in the NVIDIA NGC TensorFlow 22.01 docker container that comes with a TensorFlow 2.x build. Tools required for building this example, such as Bazel, NVIDIA CUDA, CUDNN, NCCL libraries are all readily setup.
+
+To replecate the below steps, start by pulling the NGC TF container:
+
+```
+docker pull nvcr.io/nvidia/tensorflow:22.01-tf2-py3
+```
+Then start the container with nvidia-docker:
+
+```
+nvidia-docker run --rm -it -p 8888:8888 --name TFTRT_CPP nvcr.io/nvidia/tensorflow:22.01-tf2-py3
+```
+
+You will land at `/workspace` within the docker container. Clone the TF-TRT example repository with:
+
+```
+git clone https://github.com/tensorflow/tensorrt
+cd tensorrt 
+```
+
+Then copy the content of this C++ example directory to the TensorFlow example source directory:
+
+```
+cp -r ./tftrt/examples-cpp/image_classification/ /opt/tensorflow/tensorflow-source/tensorflow/examples/
+cd /opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/saved-model
+```
+
+<!-- #region -->
+## Convert to TF-TRT Model
+
+Start Jupyter lab with:
+
+```
+jupyter lab -ip 0.0.0.0
+```
+
+A Jupyter notebook for downloading the Keras ResNet-50 model and TF-TRT conversion is provided in `tf-trt-conversion.ipynb` for your  experimentation. By default, this notebook will produce a TF-TRT FP32 saved model at `/opt/tensorflow/tensorflow-source/tensorflow/examples/image-classification/saved-model/resnet50_saved_model_TFTRT_FP32_frozen`.
+
+As part of the conversion, the notebook will also carry out benchmarking and print out the throughput statistics. 
+
+
+<!-- #endregion -->
+
+## Build the C++ example
+The NVIDIA NGC container should have everything you need to run this example installed already.
+
+To build it, first, you need to copy the build scripts `tftrt_build.sh` to `/opt/tensorflow`:
+
+```
+cp tftrt-build.sh /opt/tensorflow
+```
+
+Then from `/opt/tensorflow`, run the build command:
+
+```bash
+cd /opt/tensorflow 
+bash ./tftrt-build.sh
+```
+
+That should build a binary executable `tftrt_label_image` that you can then run like this:
+
+```bash
+tensorflow-source/bazel-bin/tensorflow/examples/image_classification/saved-model/tftrt_label_image \
+--export_dir=/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/saved-model/resnet50_saved_model_TFTRT_FP32_frozen \
+--image=/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/saved-model/data/img0.JPG
+```
+
+This uses the default image `img0.JPG` which was download as part of the conversion notebook, and should
+output something similar to this:
+
+```
+2022-04-29 04:19:28.397102: I tensorflow/examples/image_classification/saved-model/main.cc:331] malamute (250): 0.575497
+2022-04-29 04:19:28.397126: I tensorflow/examples/image_classification/saved-model/main.cc:331] Saint Bernard (248): 0.399284
+2022-04-29 04:19:28.397134: I tensorflow/examples/image_classification/saved-model/main.cc:331] Eskimo dog (249): 0.0228338
+2022-04-29 04:19:28.397141: I tensorflow/examples/image_classification/saved-model/main.cc:331] Ibizan hound (174): 0.00127912
+2022-04-29 04:19:28.397147: I tensorflow/examples/image_classification/saved-model/main.cc:331] Mexican hairless (269): 0.000520922
+```
+
+The program will also benchmark and output the throughput. Observe the improved throughput offered by moving from Python to C++ serving.
+
+Next, try it out on your own images by supplying the --image= argument, e.g.
+
+```bash
+tensorflow-source/bazel-bin/tensorflow/examples/image_classification/saved-model/tftrt_label_image \
+--export_dir=/opt/tensorflow/tensorflow-source/tensorflow/examples/image_classification/saved-model/resnet50_saved_model_TFTRT_FP32_frozen \
+--image=my_image.png
+```
+
+## What's next
+
+Try to build TF-TRT FP16 and INT8 models and test on your own data, and serve them with C++.
+
+```bash
+
+```
diff --git a/...mples-cpp/image_classification/saved-model/TF-TRT_CPP_inference_saved_model.png b/...mples-cpp/image_classification/saved-model/TF-TRT_CPP_inference_saved_model.png