.. _Get FPS:

Get Model FPS
#############


The Palette software enables users to determine the frames-per-second (FPS) KPIs of a machine learning (ML) model using a mode called the **Accelerator Mode**.
Specifically, a Python script, ``network_eval.py`` is provided to generate KPIs for a given model.

To use the Accelerator Mode, first you will compile the ML model to get a single ``.lm`` file within our compiled ``tar.gz`` file.
To obtain this ``.lm`` file, you will use one of our compilation scripts from our `GitHub page <https://github.com/SiMa-ai/models?tab=readme-ov-file>`_.

Executing Model Files using Network Evaluation
----------------------------------------------


.. button-link:: https://docs.sima.ai/pkg_downloads/SDK1.4.0/get_fps.zip
    :color: primary
    :shadow:

    Download the Example

#. Unzip to a local directory and move the unzipped folder ``get_fps`` under your ``workspace`` directory:

    .. code-block:: console

        sima-user@sima-user-machine:~$ cd ~/Downloads
        sima-user@sima-user-machine:~/Downloads$ unzip get_fps.zip
        sima-user@sima-user-machine:~/Downloads$ mv get_fps ~/workspace/

#. Go to the Network Eval directory ``/home/docker/sima-cli/get_fps/`` within the SDK container.

    .. code-block:: console

        sima-user@docker-image-id:/home# cd /home/docker/sima-cli/get_fps
        sima-user@docker-image-id:/home/docker/sima-cli/get_fps$ ls
            apis  network_eval  utils
        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# chown <YOUR_USERNAME> ../get_fps
        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# sudo apt-get update
        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# sudo apt-get install sshpass

#. Install packages defined in the ``requirements.txt`` file: 

    .. code-block:: console

        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# pip3 install -r apis/requirements.txt

#. Git clone SiMa's GitHub and download the model ``resnet50-v1-7.onnx``, you can check all the links for each original model file in the ``README.md`` of the repo:

    .. code-block:: console

        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# sudo chmod 777 ../get_fps/
        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# git clone https://github.com/SiMa-ai/models.git
            Cloning into 'models'...
            remote: Enumerating objects: 988, done.
            remote: Counting objects: 100% (988/988), done.
            remote: Compressing objects: 100% (292/292), done.
            remote: Total 988 (delta 488), reused 971 (delta 474), pack-reused 0
            Receiving objects: 100% (988/988), 23.58 MiB | 4.53 MiB/s, done.
            Resolving deltas: 100% (488/488), done.
        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# ls
            apis  models  network_eval  utils

#. Download the original ``.onnx`` file using ``wget``:

    .. code-block:: console

        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# wget -O models/resnet50-v1-7_fp32_224_224.onnx https://github.com/onnx/models/raw/main/validated/vision/classification/resnet/model/resnet50-v1-7.onnx
            /main/validated/vision/classification/resnet/model/resnet50-v1-7.onnx
            --2024-03-15 11:33:09--  https://github.com/onnx/models/raw/main/validated/vision/classification/resnet/model/resnet50-v1-7.onnx
            Resolving github.com (github.com)... 140.82.121.3, ::ffff:140.82.121.3
            Connecting to github.com (github.com)|140.82.121.3|:443... connected.
            HTTP request sent, awaiting response... 302 Found
            Location: https://media.githubusercontent.com/media/onnx/models/main/validated/vision/classification/resnet/model/resnet50-v1-7.onnx [following]
            --2024-03-15 11:33:10--  https://media.githubusercontent.com/media/onnx/models/main/validated/vision/classification/resnet/model/resnet50-v1-7.onnx
            Resolving media.githubusercontent.com (media.githubusercontent.com)... 185.199.110.133, 185.199.109.133, 185.199.108.133, ...
            Connecting to media.githubusercontent.com (media.githubusercontent.com)|185.199.110.133|:443... connected.
            HTTP request sent, awaiting response... 200 OK
            Length: 102583340 (98M) [application/octet-stream]
            Saving to: ‘models/resnet50-v1-7_fp32_224_224.onnx’

            models/resnet50-v1-7_fp32_224_22 100%[==========================================================>]  97.83M  8.35MB/s    in 12s     

            2024-03-15 11:33:25 (8.39 MB/s) - ‘models/resnet50-v1-7_fp32_224_224.onnx’ saved [102583340/102583340]
        
#. Compile the downloaded model using the script ``models/scripts/resnet50-v1-7_fp32_224_224/resnet50-v1-7_fp32_224_224.py``:

    .. code-block:: console

        sima-user@docker-image-id:/home/docker/sima-cli/get_fps/models# python3 models/scripts/resnet50-v1-7_fp32_224_224/resnet50-v1-7_fp32_224_224.py 
            Model SDK version: 1.4.0
            {'model_path': 'models/resnet50-v1-7_fp32_224_224.onnx', 'shape_dict': {'data': [1, 3, 224, 224]}, 'dtype_dict': {'data': <ScalarType.float32: 6>}}
            2024-03-15 11:23:13,725 - autotvm - WARNING - One or more operators have not been tuned. Please tune your model for better performance. Use DEBUG logging level to see more details.
            Running calibration ...DONE
            2024-03-15 11:23:56,736 - afe.ir.defines - WARNING - In node MLA_0/conv2d_add_relu_0, Precision of weights was reduced to avoid numeric saturation. Saturation was detected in the bias term.
            ...
            2024-03-15 11:23:56,874 - afe.ir.defines - WARNING - In node MLA_0/conv2d_add_35, Precision of weights was reduced to avoid numeric saturation. Saturation was detected in the zero point.
            Running quantization ...DONE
            Max absolute error between outputs of loaded net and quantized net = 0.6192820072174072

#. Untar the ``result/resnet50-v1-7_fp32_224_224_asym_True_per_ch_True/mpk/resnet50-v1-7_fp32_224_224_mpk.tar.gz`` file:

    .. code-block:: console

        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# tar zxvf result/resnet50-v1-7_fp32_224_224_asym_True_per_ch_True/mpk/resnet50-v1-7_fp32_224_224_mpk.tar.gz

#. See the various command-line arguments to ``network_eval.py`` using the ``-h`` option:

    .. code-block:: console

        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# python network_eval/network_eval.py -h
            usage: network_eval.py [-h] --model_file_path MODEL_FILE_PATH --mpk_json_path MPK_JSON_PATH --dv_host DV_HOST [--dv_port DV_PORT]
                       [--dv_user DV_USER] --image_size IMAGE_SIZE [IMAGE_SIZE ...] [-v] [--bypass_tunnel]
                       [--layer_stats_path LAYER_STATS_PATH] [--max_frames MAX_FRAMES] [--batch_size BATCH_SIZE]

            Emit FPS KPIs for networks that run on the MLA and/or A65

            options:
            -h, --help            show this help message and exit
            --model_file_path MODEL_FILE_PATH
                                    Path to .lm or .tar.gz file
            --mpk_json_path MPK_JSON_PATH
                                    Path to MPK JSON file
            --dv_host DV_HOST     DevKit IP Address / FQDN
            --dv_port DV_PORT     DevKit port on which the mla_rt_service is running
            --dv_user DV_USER     DevKit ssh username
            --image_size IMAGE_SIZE [IMAGE_SIZE ...]
                                    RGB image size specified as: H W C
            -v, --verbose         increase output verbosity
            --bypass_tunnel       set to bypass ssh tunnel
            --layer_stats_path LAYER_STATS_PATH
                                    Path to layer stats YAML file
            --max_frames MAX_FRAMES
                                    Max number of frames to run
            --batch_size BATCH_SIZE
                                    Batch size - default 1

#. Run the ``network_eval.py`` script, providing the paths of the ``.lm`` and ``.json`` files:

    .. code-block:: console

        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# python network_eval/network_eval.py \
        --model_file_path resnet50-v1-7_fp32_224_224_stage1_mla.lm \
        --mpk_json_path resnet50-v1-7_fp32_224_224_mpk.json \
        --dv_host <devkit IP address> --dv_port 8000 --image_size 224 224 3 -v
            Running model in MLA-only mode
            Creating the Forwarding from host
            The authenticity of host '10.42.0.240 (10.42.0.240)' can't be established.
            ECDSA key fingerprint is SHA256:zfEgZ7NPK5uE3WkrPjx9VsoVnsGvIHoav/prFVMLuSQ.
            Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
            sima@10.42.0.240's password: 
            Copying the model files to DevKit
            sima@10.42.0.240's password: 
            FPS = 855
            FPS = 874
            FPS = 883
            FPS = 880

Once started, the ``network_eval.py`` script runs forever. You will need to hit ``Ctrl+C`` to interrupt the execution.


Verifying Runtime Statistics
****************************

#. By specifying the ``--layer_stats_path`` option and passing the ``*_stats.yaml`` file corresponding to the ``.lm`` file, we can request the ``network_eval.py`` script to instead return a new ``output.yaml`` file in which the total execution time of the layer is computed.

    .. code-block:: console

        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# python network_eval/network_eval.py \
        --model_file_path resnet50-v1-7_fp32_224_224_stage1_mla.lm \
        --mpk_json_path resnet50-v1-7_fp32_224_224_mpk.json \
        --dv_host <devkit IP address> --image_size 224 224 3 \
        --layer_stats_path  resnet50-v1-7_fp32_224_224_stage1_mla_stats.yaml

#. After running this command, the new yaml file is displayed in the current directory with the suffix ``*_output.yaml``. In this example, the file is named ``resnet50-v1-7_fp32_224_224_stage1_mla_stats_output.yaml`` and the record for layer ``MLA_0/conv2d_add_relu_7`` now contains a single value that represents the total execution time in the layer shown below:

    .. code-block:: console

        sima-user@docker-image-id:/home/docker/sima-cli/get_fps# ls
            apis
            models
            network_eval
            network_eval.log
            resnet50-v1-7_fp32_224_224_mpk.json
            resnet50-v1-7_fp32_224_224_stage1_mla.lm
            resnet50-v1-7_fp32_224_224_stage1_mla_stats_output.yaml
            resnet50-v1-7_fp32_224_224_stage1_mla_stats.yaml
            result
            utils

    .. code-block:: yaml

        ...
        8:
        "name: ": MLA_0/conv2d_add_relu_7
        "run_time: ": 5.01us
        ...

#. The total execution time of the ``MLA_0/conv2d_add_relu_7`` layer is 5.01 microseconds. The 'name' of the layer corresponds to the name that is viewable in Netron when opening the ``.sima.json`` file that was generated using the ``Model.save()`` API. The 'run_time' represents the amount of time the layer took to execute on the MLA in microseconds. This time accounts for compute and memory cycles for this layer only.
#. You may now examine the Netron graph (``*_sima.json`` file generated using the ``Model.save()`` API) and ``*.output.yaml`` file side-by-side in order to view the execution stats of each layer.