Skip to content

Commit e324df8

Browse files
Change optimizations for Gemma3ForCausalLM vLLM to accuracy
1 parent 66ea9d7 commit e324df8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

models/tt_transformers/tt/generator_vllm.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -391,7 +391,7 @@ def initialize_vllm_model(
391391
max_seq_len=max_seq_len,
392392
n_layers=n_layers,
393393
dtype=ttnn.bfloat16,
394-
optimizations=DecodersPrecision.performance,
394+
optimizations=DecodersPrecision.accuracy,
395395
)
396396
return cls(tt_model, model_args, mesh_device)
397397

0 commit comments

Comments
 (0)