vllm.model_executor.models.transformers.pooling ¶
Transformers backend mixins for pooling models.
EmbeddingMixin ¶
Bases: VllmModelForPooling
Source code in vllm/model_executor/models/transformers/pooling.py
pooler instance-attribute
¶
pooler = DispatchPooler(
{
"token_embed": for_token_embed(pooler_config),
"embed": for_embed(pooler_config),
}
)
__init__ ¶
__init__(*, vllm_config: VllmConfig, prefix: str = '')
Source code in vllm/model_executor/models/transformers/pooling.py
SequenceClassificationMixin ¶
Bases: SupportsCrossEncoding
, VllmModelForPooling
Source code in vllm/model_executor/models/transformers/pooling.py
pooler instance-attribute
¶
pooler = DispatchPooler(
{
"token_classify": for_token_classify(
pooler_config, classifier=classifier
),
"classify": ClassifierPooler(
pooling=CLSPool(),
classifier=classifier,
act_fn="classify",
),
"score": ClassifierPooler(
pooling=CLSPool(),
classifier=classifier,
act_fn="score",
),
}
)
__init__ ¶
__init__(*, vllm_config: VllmConfig, prefix: str = '')