sima_utils.transformer.utils

Attributes

mla_row_size

mla_num_tiles

mla_max_num_rows_per_tile

mla_max_num_rows

Functions

ceil_div(→ int)

ceil_div_row(→ int)

round_up_to(→ int)

round_up_to_row(→ int)

calc_freq_real_imag(→ tuple[numpy.ndarray, numpy.ndarray])

Module Contents

sima_utils.transformer.utils.mla_row_size: int = 16
sima_utils.transformer.utils.mla_num_tiles: int = 100
sima_utils.transformer.utils.mla_max_num_rows_per_tile: int = 4096
sima_utils.transformer.utils.mla_max_num_rows: int = 409600
sima_utils.transformer.utils.ceil_div(x: int, y: int) int
sima_utils.transformer.utils.ceil_div_row(x: int) int
sima_utils.transformer.utils.round_up_to(x: int, y: int) int
sima_utils.transformer.utils.round_up_to_row(x: int) int
sima_utils.transformer.utils.calc_freq_real_imag(max_num_tokens: int, rope_type: str, theta: float, head_dim: int, scaling_cfg: dict, idx_base: int = 0) tuple[numpy.ndarray, numpy.ndarray]