sima_utils.transformer.model.language_pre_model
Classes
Base implementation for the pre cache model of the language model. |
Module Contents
- class sima_utils.transformer.model.language_pre_model.LanguagePreModel
Base implementation for the pre cache model of the language model.
- num_tokens
Number of tokens. Set to a value greater than 1 to consume multiple input tokens in one model.
- layer_idx
Transformer layer index.
- num_tokens: int
- layer_idx: int
- gen_onnx_files()