sima_utils.transformer.model.language_pre_model

Classes

LanguagePreModel

Base implementation for the pre cache model of the language model.

Module Contents

class sima_utils.transformer.model.language_pre_model.LanguagePreModel

Base implementation for the pre cache model of the language model.

num_tokens

Number of tokens. Set to a value greater than 1 to consume multiple input tokens in one model.

layer_idx

Transformer layer index.

num_tokens: int
layer_idx: int
gen_onnx_files()