vllm.model_executor.layers.rotary_embedding.yarn_scaling_rope
 
  Bases: RotaryEmbedding
RotaryEmbedding extended with YaRN method.
Credits to Peng et al. github.com/jquesnelle/yarn
Source code in vllm/model_executor/layers/rotary_embedding/yarn_scaling_rope.py
  
 __init__(
    head_size: int,
    rotary_dim: int,
    max_position_embeddings: int,
    base: float,
    is_neox_style: bool,
    scaling_factor: float,
    dtype: dtype,
    *,
    extrapolation_factor: float = 1,
    attn_factor: float = 1,
    beta_fast: int = 32,
    beta_slow: int = 1,
) -> None
Source code in vllm/model_executor/layers/rotary_embedding/yarn_scaling_rope.py
  
 _compute_cos_sin_cache() -> Tensor