vllm.lora.layers.replicated_linear ¶
ReplicatedLinearWithLoRA ¶
Bases: BaseLinearLayerWithLoRA
Source code in vllm/lora/layers/replicated_linear.py
__init__ ¶
__init__(base_layer: ReplicatedLinear) -> None
can_replace_layer classmethod
¶
can_replace_layer(
source_layer: Module,
lora_config: LoRAConfig,
packed_modules_list: list,
model_config: PretrainedConfig | None,
) -> bool
forward ¶
Forward of ReplicatedLinearWithLoRA
Parameters:
Name | Type | Description | Default |
---|---|---|---|
input_ | Tensor | Tensor whose last dimension is | required |
Returns:
Type | Description |
---|---|
Tensor | tuple[Tensor, Tensor | None] |
|
Tensor | tuple[Tensor, Tensor | None] |
|
Source code in vllm/lora/layers/replicated_linear.py
slice_lora_a ¶
Slice lora a if splitting for tensor parallelism.
slice_lora_b ¶
Slice lora b if splitting with tensor parallelism.