Reference Classification Application

In order to build an application on the MLSoC, we will first write a python application that will serve as a reference, and then begin to build our GStreamer application. In this first section, we have the code and resources for the python based application.

Resources and setup

Please download and place this archive in your Palette docker container.

Download Reference App

In your environment, uncompress the application, go into the uncompressed folder, make sure to install the requirements:

sima-user@docker-image-id$ tar xvf building_applications_mlsoc_1.4.0.tar.xz
    building_applications_mlsoc_1.4.0/
    building_applications_mlsoc_1.4.0/requirements.txt
    building_applications_mlsoc_1.4.0/data/
    building_applications_mlsoc_1.4.0/data/imagenet_labels.txt
    building_applications_mlsoc_1.4.0/data/openimages_v7_images_and_labels.pkl
    building_applications_mlsoc_1.4.0/data/golden_retriever_207.jpg
    building_applications_mlsoc_1.4.0/README.md
    building_applications_mlsoc_1.4.0/models/
    building_applications_mlsoc_1.4.0/models/download_resnet50.py
    building_applications_mlsoc_1.4.0/src/
    building_applications_mlsoc_1.4.0/src/x86_reference_app/
    building_applications_mlsoc_1.4.0/src/x86_reference_app/resnet50_reference_classification_app.py
    building_applications_mlsoc_1.4.0/src/modelsdk_quantize_model/
    building_applications_mlsoc_1.4.0/src/modelsdk_quantize_model/resnet50_quant.py
sima-user@docker-image-id$ cd building_applications_mlsoc_1.4.0/
sima-user@docker-image-id$ tree -L 3
    .
    ├── data
    │   ├── golden_retriever_207.jpg
    │   ├── imagenet_labels.txt
    │   └── openimages_v7_images_and_labels.pkl
    ├── models
    │   └── download_resnet50.py
    ├── README.md
    ├── requirements.txt
    └── src
        ├── modelsdk_quantize_model
        │   └── resnet50_quant.py
        └── x86_reference_app
            └── resnet50_reference_classification_app.py
sima-user@docker-image-id$ pip3 install -r requirements.txt

Note

data: Any data we will use in the reference application or the quantization script
- golden_retriever_207.jpg: A test image we will use in our scripts.
- imagenet_labels.txt: The list of labels from ImageNet we will use to perform inference and retrieve what class was predicted.
- openimages_v7_images_and_labels.pkl: The labels and images from Open Images that we will use to calibrate our model during quantization.
models: Where we will store our model
- download_resnet50.py: Downloads a pre-trained ResNet50 model, add a softmax node at the end, and saves the model.
- The output of this script is the core model we will be using throughout the guide.
src: Application and quantization scripts
- modelsdk_quantize_model/resnet50_quant.py:
  
  Script used to quantize our resnet50_model.onnx model using SiMa.ai’s Palette ModelSDK APIs (output from download_resnet50.py).
- x86_reference_app/resnet50_reference_classification_app.py:
  
  Our sample python application that can run on any machine and uses onnxruntime to run the resnet50_model.onnx along with all pre and post processing.
  
  his will be our reference application, and we will be developing this exact application on the MLSoC.

Python example for x86/Mac

Download and save the model

Throughout this guide, we will be using a ResNet50 model from TorchVision and we will add a softmax to the output. To download the model and add the softmax operator, the output will be stored in the directory models/resnet50_model.onnx:

sima-user@docker-image-id$ python models/download_resnet50.py
    Downloading: "https://download.pytorch.org/models/resnet50-11ad3fa6.pth" to /root/.cache/torch/hub/checkpoints/resnet50-11ad3fa6.pth
    100%|███████████████████████████████████████████████████████████████████████████████████████████| 97.8M/97.8M [00:00<00:00, 114MB/s]
    ============== Diagnostic Run torch.onnx.export version 2.0.1+cpu ==============
    verbose: False, log level: Level.ERROR
    ======================= 0 NONE 0 NOTE 0 WARNING 0 ERROR ========================

    Model exported successfully to /home/docker/sima-cli/building_applications_mlsoc_1.4.0/models/resnet50_export.onnx
    Simplified model saved to /home/docker/sima-cli/building_applications_mlsoc_1.4.0/models/resnet50_model.onnx
console$ tree -L3
    .
    ├── data
    │   ├── golden_retriever_207.jpg
    │   ├── imagenet_labels.txt
    │   └── openimages_v7_images_and_labels.pkl
    ├── models
    │   ├── download_resnet50.py
    │   └── resnet50_model.onnx
    ├── README.md
    ├── requirements.txt
    └── src
        ├── modelsdk_quantize_model
        │   └── resnet50_quant.py
        └── x86_reference_app
            └── resnet50_reference_classification_app.py

Run the reference application

The following Python script demonstrates how to classify images using an ONNX ResNet50 model and onnxruntime. This script can be run on any x86/Mac machine with the necessary dependencies installed.

The reference application itself:

resnet50_reference_classification_app.py

#**************************************************************************
#||                        SiMa.ai CONFIDENTIAL                          ||
#||   Unpublished Copyright (c) 2023-2024 SiMa.ai, All Rights Reserved.  ||
#**************************************************************************
# NOTICE:  All information contained herein is, and remains the property of
# SiMa.ai. The intellectual and technical concepts contained herein are
# proprietary to SiMa and may be covered by U.S. and Foreign Patents,
# patents in process, and are protected by trade secret or copyright law.
#
# Dissemination of this information or reproduction of this material is
# strictly forbidden unless prior written permission is obtained from
# SiMa.ai.  Access to the source code contained herein is hereby forbidden
# to anyone except current SiMa.ai employees, managers or contractors who
# have executed Confidentiality and Non-disclosure agreements explicitly
# covering such access.
#
# The copyright notice above does not evidence any actual or intended
# publication or disclosure  of  this source code, which includes information
# that is confidential and/or proprietary, and is a trade secret, of SiMa.ai.
#
# ANY REPRODUCTION, MODIFICATION, DISTRIBUTION, PUBLIC PERFORMANCE, OR PUBLIC
# DISPLAY OF OR THROUGH USE OF THIS SOURCE CODE WITHOUT THE EXPRESS WRITTEN
# CONSENT OF SiMa.ai IS STRICTLY PROHIBITED, AND IN VIOLATION OF APPLICABLE
# LAWS AND INTERNATIONAL TREATIES. THE RECEIPT OR POSSESSION OF THIS SOURCE
# CODE AND/OR RELATED INFORMATION DOES NOT CONVEY OR IMPLY ANY RIGHTS TO
# REPRODUCE, DISCLOSE OR DISTRIBUTE ITS CONTENTS, OR TO MANUFACTURE, USE, OR
# SELL ANYTHING THAT IT  MAY DESCRIBE, IN WHOLE OR IN PART.
#
#**************************************************************************

import cv2
import shutil
import argparse
import numpy as np
import onnxruntime as ort

from pathlib import Path
from typing import Union

# Constants
ROOT_PATH = Path(__file__).parent.resolve()
INPUT_IMAGES_PATH = str(Path(ROOT_PATH, "../../data/"))
IMAGENET_LABELS_PATH = str(Path(ROOT_PATH, "../../data/imagenet_labels.txt"))
MODEL_PATH = str(Path(ROOT_PATH, "../../models/resnet50_model.onnx"))
SAVE_OUTPUTS_FOR_DEBUG = True
OUTPUT_DEBUG_PATH = ROOT_PATH/"debug"

####################
# Helper Functions #
####################

def recreate_directory(dir_path: Union[Path, str]):
    # Ensure dir_path is a Path object if given a string
    dir_path = Path(dir_path)

    # Check if the directory exists
    if dir_path.exists():
        # Remove the directory and all its contents
        shutil.rmtree(dir_path)
        print(f"Deleted directory: {dir_path}")

    # Recreate the directory
    dir_path.mkdir(parents=True, exist_ok=True)
    print(f"Recreated directory: {dir_path}")

###################
# Setup Functions #
###################

def setup(model_path: str, labels_path: str):
    # Load labels and ONNX model
    with open(labels_path, "r") as f:
        labels = [line.strip() for line in f.readlines()]

    # Load the ONNX model
    session = ort.InferenceSession(model_path)
    input_name = session.get_inputs()[0].name

    return labels, session, input_name

######################
# Pipeline Functions #
######################

def read_image(image_path: str, dump_output: bool = False):
    image = cv2.imread(str(image_path), cv2.IMREAD_UNCHANGED)
    rgb_image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)

    image_name = Path(image_path).stem
    if dump_output:
        debug_image_name = image_name + "_rgb.bin"
        rgb_image.tofile(str(OUTPUT_DEBUG_PATH/debug_image_name))

    return rgb_image, image_name


# Function to preprocess the image
def preprocess_image(image: np.ndarray, input_shape: tuple = (224, 224), scale_factor: tuple = 255.0,
                    dump_output_image_name: str = None, dump_output: bool = False):
    mean = [0.485, 0.456, 0.406]
    stddv = [0.229, 0.224, 0.225]

    resized_image = cv2.resize(image, input_shape)
    image_data = resized_image / scale_factor
    image_data = (image_data - mean) / stddv

    if dump_output:
        # Dump reference output before transposing because MLA uses NHWC format not NCHW
        assert dump_output_image_name is not None

        debug_image_path = OUTPUT_DEBUG_PATH/f"{dump_output_image_name}_preprocessed_rgb_nhwc_fp32.bin"
        image_data.tofile(debug_image_path)

    image_data = np.transpose(image_data.astype('float32'), (2, 0, 1))  # Change to (C, H, W)
    image_data = np.expand_dims(image_data, axis=0)  # Add batch dimension

    return image_data

# Function to post-process the output
def postprocess_output(output: np.ndarray, labels: dict,
                    dump_output_image_name: str = None, dump_output: bool = False):
    probabilities = output[0][0]
    max_idx = np.argmax(probabilities)

    if dump_output:
        # Dump reference output before transposing because MLA uses NHWC format not NCHW
        assert dump_output_image_name is not None

        debug_image_path = OUTPUT_DEBUG_PATH/f"{dump_output_image_name}_inference_output_probabilities.bin"
        probabilities.tofile(debug_image_path)

    return labels[max_idx], probabilities[max_idx]

def main(images_path: str, model_path: str, labels_path: str = IMAGENET_LABELS_PATH):
    """ Runs an application pipeline composed of the following 5 stages:

        Load image -> Preprocess image -> Run ResNet50 (inference) -> Postprocess outputs -> Display results
    """

    # Initialization #
    labels, inference_session, input_name = setup(model_path=model_path, labels_path=labels_path)
    image_paths = list(Path(images_path).glob("*.jpg"))

    # 5 Stage application pipeline #
    print(f"\nProcessing all images in: {images_path}\n")

    for idx, image_path in enumerate(image_paths):
        print(f"Processing image [{idx}] --> ", end="")

        # Load image #
        image, image_name = read_image(str(image_path), dump_output=SAVE_OUTPUTS_FOR_DEBUG)

        # Preprocess #
        input_data = preprocess_image(image, dump_output_image_name=image_name, dump_output=SAVE_OUTPUTS_FOR_DEBUG)

        # Run inference #
        output = inference_session.run(None, {input_name: input_data})

        # Postprocess #
        class_name, confidence = postprocess_output(output, labels, dump_output_image_name=image_name, dump_output=SAVE_OUTPUTS_FOR_DEBUG)

        # Display results #
        print(f"Class: {class_name} Confidence: {confidence * 100.0:.2f}%")


if __name__ == "__main__":
    parser = argparse.ArgumentParser(description="Classify an image using an ONNX ResNet50 model.")

    parser.add_argument("-i", "--images_path", type=str, required=False, default=INPUT_IMAGES_PATH, help="Path to the image file.")
    parser.add_argument("-m", "--model_path", type=str, required=False,  default=MODEL_PATH, help="Path to the image file.")
    args = parser.parse_args()

    # Setup output directory
    if SAVE_OUTPUTS_FOR_DEBUG:
        recreate_directory(OUTPUT_DEBUG_PATH)

    main(images_path=args.images_path, model_path=args.model_path)

Note

Notice how the 5 stages of our application pipeline are described in lines 150-163

# Load image #
image, image_name = read_image(str(image_path), dump_output=SAVE_OUTPUTS_FOR_DEBUG)

# Preprocess #
input_data = preprocess_image(image, dump_output_image_name=image_name, dump_output=SAVE_OUTPUTS_FOR_DEBUG)

# Run inference #
output = inference_session.run(None, {input_name: input_data})

# Postprocess #
class_name, confidence = postprocess_output(output, labels, dump_output_image_name=image_name, dump_output=SAVE_OUTPUTS_FOR_DEBUG)

# Display results #
print(f"Class: {class_name} Confidence: {confidence * 100.0:.2f}%")

Note

Notice how key functions have the parameter dump_output: bool. This dumps binary files with the results of each function. These outputs will become our reference outputs as we build our GStreamer pipeline to validate if we have the correct expected values out of each GStreamer plugin.

To run the script:

sima-user@docker-image-id$ python src/x86_reference_app/resnet50_reference_classification_app.py
    Recreated directory: /home/docker/sima-cli/building_applications_mlsoc_1.4.0/src/x86_reference_app/debug

    Processing all images in: /home/docker/sima-cli/building_applications_mlsoc_1.4.0/src/x86_reference_app/../../data

    Processing image [0] --> Class: 207: 'golden retriever', Confidence: 98.38%

The resulting directory structure should look like:

sima-user@docker-image-id$ tree -L 4
    .
    ├── data
    │   ├── golden_retriever_207.jpg
    │   ├── imagenet_labels.txt
    │   └── openimages_v7_images_and_labels.pkl
    ├── models
    │   ├── download_resnet50.py
    │   └── resnet50_model.onnx
    ├── README.md
    ├── requirements.txt
    └── src
        ├── modelsdk_quantize_model
        │   └── resnet50_quant.py
        └── x86_reference_app
            ├── debug
            │   ├── golden_retriever_207_inference_output_probabilities.bin
            │   ├── golden_retriever_207_preprocessed_rgb_nhwc_fp32.bin
            │   └── golden_retriever_207_rgb.bin
            └── resnet50_reference_classification_app.py

Note

Notice how a newly created debug directory appears that has intermediate dumps of data we will use later in this guide.

Run on the MLSoC CPU (Arm A65)

To run the same resnet50_reference_classification_app.py application on the A65 of the MLSoC, simply:

Replicate the same directory structure in a directory on the MLSoC devkit
- scp the application, data and models directories to the board
ssh into the board
pip3 install onnxruntime if you have not already done so
Run the application

Conclusion and next steps

In this section, we:

Developed a reference application that will serve as a guideline of what we need to develop in the MLSoC.
The reference application could be executed on a host machine or on the MSLoC.
The application also dumps each of the intermediate outputs which will be used during development to ensure we have the correct functional output out of our GStreamer plugins.

Next we will take our first step into MLSoC development: taking our trained model, and using the ModelSDK to compile and optimize it for running on the MLA.