Logo

Getting Started

  • System Architectures
  • SiMa.ai DevKits
    • Modalix
    • Modalix Early Access
    • MLSoC
  • Set Up DevKit
    • Standalone Mode
      • Configure Serial Connection
      • Configure DevKit Network
      • Configure NVMe Storage
    • PCIe Mode
    • Firmware Update
      • Update with sima-cli
      • Update with Palette
      • Update with Boot Image
      • Update with Net Boot
  • Hello SiMa!
    • Application Pipeline
      • A Typical ML Application
      • ML Preprocessing and Postprocessing
      • ResNet50 Classification Application
      • Mapping the ResNet50 Application to the MLSoC
      • Tesselation and Quantization
    • Run Demos
    • Compile Your Model
    • Build Your First Pipeline
      • Build with Edgematic
      • Build with Palette
      • PCIe Pipeline Adaptation
      • Run Multiple (Concurrent) Pipelines

ML Development

  • Palette
    • Software Overview
      • Palette Components
        • ModelSDK
        • Palette CLI / MPK
        • GStreamer Plugin Zoo
        • MLA Compiler
        • Debugging and Logging
        • Arm Cross-Compiler
      • Development Flow
    • System Requirements
      • CPU, Memory, and Storage
      • Operating Systems
      • Palette SDK / MLSoC Firmware
      • Docker Engine
      • Python
      • Ports
      • Firewall Settings
      • SSH Capabilities
    • Software Installation
      • Installation Requirements
      • Downloading Palette Software
      • Installing on Linux
        • Dependencies Checklist
        • Installing the Software
        • Running/Launching Software
        • Verifying the Installation
      • Installing on Windows using Hyper-V
      • Uninstalling the Palette Software
    • Get Model FPS
      • Setup Tool
      • Download Model
      • Compile Model
      • Execute
      • Verify
    • Get Model Accuracy
      • Architecture
      • Overview
      • Prepare Dataset
      • PyTorch Inference
      • Compile Model
      • SiMa Inference
      • Example
    • MPK Tool
      • Device Connectivity using mpk
      • mpk device
        • More Options
      • mpk create
      • mpk project create
      • mpk deploy
      • mpk launch
      • mpk list
      • mpk kill
      • mpk remove
      • mpk firmware-upgrade
      • Performance Estimator
    • MLPerf Benchmark
      • Accessing the MLPerf Files
      • Running MLPerf Tests
        • Batch1 Performance Mode Test
        • Batch1 Accuracy Mode Test
        • Batch8 Performance Mode Test
        • Batch8 Accuracy Mode Test
        • Batch14 Performance Mode Test
        • Batch14 Accuracy Mode Test
  • Edgematic 🔥
    • Introduction to Edgematic
      • Getting Started
        • Layout
        • Starter Examples
    • Terminology
    • Edgematic Dashboard
    • Edgematic Features
      • Machine Learning Models
      • GStreamer Pipelines
      • GStreamer Plugins
      • Applications
      • KPI Measurement
    • Building Applications
      • Image File Inferencing
      • RTSP Video Stream Inferencing
    • Application Catalog
      • Developer Community Applications
      • SiMa-Provided Applications
    • Model Catalog
      • Image Classification Models
      • Object Detection Models
      • Pose Estimation Models
      • Depth Estimation Models
      • Anomaly Detection & Feature Extraction
    • Plugin Catalog
      • simaaidecoder
      • simaaiencoder
      • simaaiedetmemory
      • nms
        • Features
        • Usage
        • Configuration
      • nmsyolov5
        • Features
        • Usage
        • Configuration
      • overlay_v2
        • Description
        • Configuration
      • regressboxes
        • Features
        • Configuration
      • simaai_color_converter
        • Configuration
      • simaai_muxer
        • Configuration
      • simaai_resize
        • Features
        • Configuration
      • simaaibufferconcatenator
        • Configuration
      • simaaimetaparser
        • Direct Mode (Primary)
        • Header Parsing Mode (Fallback)
        • Configuration
      • simaaisrc
        • Parameters
        • Configuration
      • transform_anchors
        • Features
        • Usage
        • Configuration
      • yolov8boxcomposer
        • Features
        • Usage
        • Configuration
  • Develop with GStreamer
    • Key Components
    • Plugins
    • Development Workflow
    • Key Features
      • CAPS Negotiation
      • GStreamer Application
        • Features
        • Components
        • Usage
  • PePPi
    • Installation
    • Typical Workflow
    • Project Structure
      • project.yaml
      • main.py
      • Pre-processing and Normalization
    • Build and Deploy
    • Examples
    • Tutorials
      • Ethernet Tutorial
        • Purpose
        • Configuration Overview
        • Project Configuration
        • Script Behavior
        • Model Details
        • Pipeline Architecture
        • Build and Deploy
        • Performance Considerations
      • PCIe Tutorial
        • Purpose
        • Configuration Overview
        • Project Configuration
        • Script Behavior
        • Model Details
        • Host Pipeline Setup
        • Build and Deploy
      • Filesrc Tutorial
        • Purpose
        • Configuration Overview
        • Project Configuration
        • Script Behavior
        • Model Details
        • Pipeline Architecture
        • Build and Deploy
    • Logging
    • Models with BoxDecoder Support
  • GenAI Model Compilation
    • Introduction
    • Supported Models
    • Limitations
    • System Requirements
    • Prerequisites
    • Sample Code
  • ModelSDK
    • Model Compatibility Guide
      • Terminology
      • Supported Operations
        • Custom Implementation
      • Unsupported Operators
      • A65 Supported Operators
      • MLA Supported Operators
      • Individual Operators
    • ONNX Operators Support List
    • Post Training Quantization
      • load_model
      • quantize
        • with_custom_quantization_configs()
        • LoadedNet.convert_to_sima_quantization()
        • Model.analyze_quantization_error()
        • Model.execute_in_accelerator_mode()
      • compile
        • Compile with Default Options
        • Compiling for Batch Sizes > 1
        • Printing tar.gz Contents
        • Per-Layer Runtime Statistics
    • Quantization Aware Training
      • Pre-requisites
      • QAT Process
      • QAT Workflow
      • Model Training Process
        • Scaffolding
        • Finalization
        • Export
        • Training Loop
        • PyTorch Integration
        • User API
    • Graph Surgery
      • What is Graph Surgery
      • Why Do You Need to Perform Graph Surgery
      • Graph Surgery Using SiMa Tools
        • Analyzing a Model
        • ONNX Model
      • Example Case Studies
        • UFLD: Simple Operator Replacement
        • YoloV8: Output to be Modified
        • Post Processing
        • ViT-s14
        • Pre-Processing
        • Post-Processing

App Development

  • PCIe Mode App
    • Build Host App With C++ API
  • Standalone Mode App
    • Cross Compilation Environment
    • Build Custom C/C++ App
    • Build Third-Party Component
    • Run Pipeline without MPK Tools
    • Understand the MPK Package

Tools

  • sima-cli
    • Installation
    • Authentication
    • View Commands
    • View Command Help
    • Uninstall
    • Check Installed Version
    • Update sima-cli
  • OptiView
    • Installation
    • Connect to DevKit
    • Media Source Manager
    • Code Editor and Runner
    • Web Video Viewer

References

  • PCIe Mode Host APIs
    • C++ API
      • simaai::SimaMlsocApi
        • setLogVerbosity()
        • enumerateDeviceGuids()
        • openDevice()
        • load()
        • load()
        • load()
        • unload()
        • isAppActive()
        • isAppActive()
        • isDeviceOpen()
        • runSynchronous()
        • runAsynchronous()
        • closeDevice()
        • resetDevice()
        • log()
        • getLastErrorCode()
        • setMaxOutstandingRequests()
        • getMaxOutstandingRequests()
        • setRequestTimeout()
        • getRequestTimeout()
        • resetQueue()
        • SimaMlsocApi()
        • operator=()
        • SimaMlsocApi()
        • operator=()
        • getInstance()
      • simaai::SiMaDevice
        • slotNumber
        • deviceType
        • guid
      • simaai::SiMaModel
      • simaai::SiMaTensor
        • SiMaTensor()
        • reset()
        • SiMaTensor()
        • reset()
        • getPtr()
        • getShape()
        • getSizeInBytes()
        • getSeqId()
        • getAppId()
        • getOpCode()
        • setSeqId()
        • setAppId()
        • setOpCode()
    • Co-Processing API
      • ApplicationManager
        • ~ApplicationManager()
        • getVersion()
        • connect()
        • installApplication()
        • deployApplication()
        • deviceFirmwareUpgrade()
        • launchApplication()
        • stopApplication()
        • stopApplication()
        • listApplication()
        • removeApplication()
        • disconnect()
      • DeviceConnection
        • ~DeviceConnection()
        • discoverDevices()
        • discoverEthernetDevices()
        • listDevices()
        • connect()
        • disconnect()
        • upgrade()
        • reset()
        • reboot()
      • DeviceConnectionFactory
        • createDeviceConnection()
      • DeviceFactory
        • createEthernetDevice()
        • createEthernetDevice()
        • createEthernetDevice()
      • Device
        • Device()
        • ~Device()
        • getModel()
        • getGuid()
        • getConnectionMode()
        • isConnected()
        • getCurrentConnectionStatusStr()
        • isLocal()
        • setConnected()
        • setLocal()
        • setGuid()
        • associateCommunicationChannel()
        • disAssociateCommunicationChannel()
        • getCommunicationChannel()
        • getTarget()
        • getUserName()
        • getPassword()
        • getCurrentConnectionStatus()
        • setUserName()
        • setPassword()
        • toString()
      • BaseSiMaErrorCode
        • getCode()
        • ToString()
        • getType()
        • isError()
        • clone()
      • SiMaCoProcessingAPIErrorCode
        • ErrorCode
        • SiMaCoProcessingAPIErrorCode()
        • SiMaCoProcessingAPIErrorCode()
        • ~SiMaCoProcessingAPIErrorCode()
        • getCode()
        • getType()
        • isError()
        • ToString()
        • clone()
      • SiMaLog
        • SiMaLog()
        • operator=()
        • log()
        • getLogFilePath()
        • getInstance()
    • Python API
      • sima_host
        • Attributes
        • Classes
        • Functions
        • Module Contents
  • ModelSDK APIs
    • afe.apis.model
      • Classes
      • Module Contents
        • Model
    • afe.apis.loaded_net
      • Attributes
      • Classes
      • Functions
      • Module Contents
        • GroundTruth
        • LoadedNet
        • load_model()
    • afe.load
      • Submodules
        • afe.load.importers
        • afe.load.loader
    • afe.apis.defines
      • Attributes
      • Classes
      • Functions
      • Module Contents
        • InputValues
        • gen1_target
        • gen2_target
        • ExceptionFuncType
        • ColorSpaceStandard
        • BT_COLOR_COEFF
        • YUV2RGB_FULL_RANGE_CONSTANTS
        • ColorConversion
        • ChromaSampling
        • ResizeMethod
        • ResizeDepositLocation
        • CalibrationMethod
        • MinMaxMethod
        • HistogramMSEMethod
        • MovingAverageMinMaxMethod
        • HistogramEntropyMethod
        • HistogramPercentileMethod
        • default_calibration()
        • QuantizationScheme
        • quantization_scheme()
        • bfloat16_scheme()
        • QuantizationParams
        • default_quantization
        • int16_quantization
    • sima_qat.qat_api
      • Attributes
      • Classes
      • Functions
      • Module Contents
        • device_modifier_ops
        • sima_prepare_qat_model()
        • sima_finalize_qat_model()
        • sima_export_onnx()
        • SimaQatWrapper
        • check_graph_nodes()
        • replace_dropout()
        • replace_batchnorm()
    • sima_utils.transformer
      • Submodules
        • sima_utils.transformer.default_llm_config
        • sima_utils.transformer.default_vision_config
        • sima_utils.transformer.devkit
        • sima_utils.transformer.gguf_conversion
        • sima_utils.transformer.hf_transformer
        • sima_utils.transformer.llm_tokenizer
        • sima_utils.transformer.model
        • sima_utils.transformer.onnx_builder
        • sima_utils.transformer.preproc
        • sima_utils.transformer.prompt_template
        • sima_utils.transformer.tokenizer
        • sima_utils.transformer.utils
        • sima_utils.transformer.vision_preprocessor
        • sima_utils.transformer.vlm_config
        • sima_utils.transformer.whisper_config
      • Classes
      • Package Contents
        • VlmArchType
        • VlmConfig
        • VlmHelper
  • PePPi APIs
    • sima
      • Submodules
        • sima.gst_utils
        • sima.mini_pipeline_base
        • sima.operations
        • sima.video_reader
      • Classes
      • Functions
      • Package Contents
        • resize()
        • cvtColor()
        • SimaBoxRender
        • set_log_level()
        • MLSoCSession
        • VideoReader
        • VideoWriter
  • GStreamer Plugins
    • pciesink
      • Properties
      • Usage
      • Installation
    • pciesrc
      • Properties
      • Usage
      • Installation
      • Pipeline Integration
    • simaaiboxdecode
      • Properties
      • Usage
      • Configuration
      • Configuration file example
      • Installation
      • Integration into a GStreamer Pipeline
    • simaaidecoder
      • Properties
      • Usage
      • Installation
      • Integration into a GStreamer Pipeline
    • simaaiencoder
      • Properties
      • Usage
      • Installation
    • simaaimetaparser
      • Properties
      • Usage
      • Installation
      • Integration into GStreamer Pipeline
    • simaaiprocesscvu
      • Properties
      • Usage
      • Installation
    • simaaiprocessmla
      • Properties
      • Usage
      • Config File Example
      • Installation
      • Integration
    • simaaisigmoid
      • Properties
      • Usage
      • Installation
      • Integration
    • simaaisrc
      • Properties
      • Usage
      • Installation
      • Integrating into a GStreamer Pipeline
  • CVU Graphs Library
    • CVU Graphs
      • SIMA_COLORCONVERT
        • Description
        • Graph Info
        • Example Config
        • CVU Configuration Application
      • SIMA_RESIZE
        • Description
        • Graph Info
        • Example Config
        • CVU Configuration Application
      • SIMA_TESSELATE
        • Description
        • Graph Info
        • Example Config
        • CVU Configuration Application
      • SIMA_GENERIC_PREPROC
        • Description
        • Graph Info
        • Example Config
        • CVU Configuration Application
      • SIMA_DETESS_DEQUANT
        • Description
        • Graph Info
        • Example Config
        • CVU Configuration Application
  • Technical Notes
    • Install LLiMa on Modalix Early Access Kit
      • Prerequisites
      • Updating Firmware
      • Installing via SD Card
      • Installing via NFS
      • Running the Application
    • Setup NFS
  • Software Downloads
    • Software Updates
    • ML Models
    • File Examples
    • AWS SageMaker Blog
  • Reference Documents
    • Quick Start Guide
    • Product Briefs
    • Market Briefs
    • Device Datasheets
    • Hardware Reference Manuals

Release Notes

  • Palette SDK version 1.7
    • New Features and Enhancements
    • Performance Validations
    • Known Issues
  • Palette SDK version 1.6
    • New Features and Enhancements
    • Performance Validations
    • Known Issues
  • Palette SDK Version 1.5
    • New Features & Enhancements
    • Performance Validations
    • Bug Fixes
    • Known Issues
Documentation
  • ModelSDK APIs
  • sima_utils.transformer
  • sima_utils.transformer.tokenizer
  • View page source

sima_utils.transformer.tokenizer

Submodules

  • sima_utils.transformer.tokenizer.sentencepiece_model_pb2
  • sima_utils.transformer.tokenizer.whisper_tokenizer
Previous Next

© Copyright 2025, SiMa Technologies.

Built with Sphinx using a theme provided by Read the Docs.
Other Versions v: latest
Tags
latest
v1.4.0
v1.6.0
v1.5.0