sima_utils.transformer.model.baseο
Classesο
File generation mode. |
|
Precision used when generating files. |
|
Model evaluation mode. |
|
Base implementation for visual-language model file generation. |
Module Contentsο
- class sima_utils.transformer.model.base.FileGenModeο
File generation mode.
- ONNXο
- MODEL_SDK_QUANTIZEο
- MODEL_SDK_COMPILEο
- DEVKITο
- ALLο
- class sima_utils.transformer.model.base.FileGenPrecisionο
Precision used when generating files.
- BF16 = 'bf16'ο
- A_BF16_W_INT8 = 'a_bf16_w_int8'ο
- A_BF16_W_INT4 = 'a_bf16_w_int4'ο
- class sima_utils.transformer.model.base.EvalModeο
Model evaluation mode.
- HF = 'hf'ο
- ONNX = 'onnx'ο
- SDK = 'sdk'ο
- class sima_utils.transformer.model.base.BaseModelο
Base implementation for visual-language model file generation.
- cfgο
Configuration of the model.
- model_nameο
Name of the model. This will be used to determine the generated filesβ names.
- onnx_pathο
Path to store the ONNX files.
- sima_pathο
Path to store the SiMa-specific files.
- hf_modelο
LocalHuggingFaceModel object for obtaining the parameters to generate ONNX files.
- onnx_file_nameο
File name of the generated ONNX file.
- weight_prefixο
The prefix of weight tensor names in the source model.
- model_name: strο
- onnx_path: pathlib.Path = 'onnx_files'ο
- sima_path: pathlib.Path = 'sima_files'ο
- hf_model: sima_utils.transformer.hf_transformer.LocalHuggingFaceModel | None = Noneο
- vlm_helper: sima_utils.transformer.vlm_config.VlmHelper | None = Noneο
- gen_files(gen_mode: FileGenMode, *, precision: FileGenPrecision = FileGenPrecision.BF16, log_level: int = logging.NOTSET, resume: bool = False)ο
Generates files based on the provided file generation mode.
- Parameters:
gen_mode β File generation mode.
precision β The precision to be used for Model SDK quantization mode.
log_level β Logging level.
resume β Set to generate only when the file cannot be found.
- run_model(eval_mode: EvalMode, ifms: list[numpy.ndarray]) list[numpy.ndarray] ο
Runs the model based on the evaluation mode.
- property vision_model_name: strο
- property language_model_name: strο
- property onnx_file_name: pathlib.Pathο
- property sima_model_sdk_path: pathlib.Pathο
Path to the generated quantized Model SDK files.
- property sima_mpk_path: pathlib.Pathο
Path to the generated MPK files.
- property sdk_file_name: pathlib.Pathο
Path to the generated quantized Model SDK file.
- property mpk_file_name: pathlib.Pathο
Path to the generated quantized Model SDK file.
- property sima_devkit_path: pathlib.Pathο
Path to the generated files for DEVKIT.
- get_gen_file_name(gen_mode: FileGenMode) pathlib.Path ο
- gen_onnx_files()ο
Generates ONNX files.
- gen_model_sdk_files(precision: FileGenPrecision, log_level: int)ο
Generates quantized Model SDK files.
- Parameters:
precision β Precision used for quantization.
log_level β Logging level.
- gen_mpk_files(log_level: int) afe.apis.model.Model ο
Generates MPK files.
- Parameters:
log_level β Logging level.
- gen_devkit_files(resume: bool = False)ο
Generates files for devkit.
- check_hf_param(name: str) bool ο
Checks if a parameter tensor exists in the LocalHuggingFaceModel object.
- Parameters:
name β Full name of the parameter.
- Returns:
True if the parameter tensor exists.
- get_hf_param(name: str) numpy.ndarray ο
Gets the parameter tensor from the LocalHuggingFaceModel object.
- Parameters:
name β Full name of the parameter.
- Returns:
The parameter tensor in numpy array.
- create_onnx_builder()ο
Creates onnx builder.
- gen_files_from_model_list(model_list: list[tuple[BaseModel, FileGenPrecision]], gen_mode: FileGenMode, num_processes: int, log_level: int, resume: bool)ο