vllm.model_executor.models.transformers.legacy ¶
Transformers backend mixin for legacy models.
LegacyMixin ¶
Source code in vllm/model_executor/models/transformers/legacy.py
hf_to_vllm_mapper class-attribute
instance-attribute
¶
hf_to_vllm_mapper = WeightsMapper(
orig_to_new_prefix={
"roberta": "model",
"bert": "model",
"": "model.",
"model.model.": "model.",
"model.score": "classifier",
"model.classifier": "classifier",
},
orig_to_new_suffix={
".gamma": ".weight",
".beta": ".bias",
},
)
__init__ ¶
__init__(*, vllm_config: VllmConfig, prefix: str = '')
Source code in vllm/model_executor/models/transformers/legacy.py
forward ¶
forward(
input_ids: Tensor | None,
positions: Tensor,
intermediate_tensors: IntermediateTensors | None = None,
inputs_embeds: Tensor | None = None,
) -> Tensor | IntermediateTensors