Skip to content

Commit 3a41732

Browse files
committed
Handle misrtral vllm initialize
1 parent 930f46a commit 3a41732

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

models/tt_transformers/tt/generator_vllm.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -235,7 +235,7 @@ def __init__(self, *args, **kwargs):
235235
self.max_gen_len = self.model_args[0].max_seq_len - 1
236236

237237
@classmethod
238-
def initialize_vllm_model(cls, hf_config, mesh_device, max_batch_size, tt_data_parallel=1):
238+
def initialize_vllm_model(cls, hf_config, mesh_device, max_batch_size, max_seq_len=32768, tt_data_parallel=1):
239239
max_seq_len = 1024 * 128
240240

241241
submesh_devices = create_submeshes(mesh_device, tt_data_parallel)

0 commit comments

Comments
 (0)