[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

kuafou · 2025-10-29T23:04:29Z

Description

This PR fixes a compatibility issue with recent vLLM changes that now require model classes to implement a get_input_embeddings() method.
Without this method, vLLM fails its interface validation during model registration, breaking TPU model integration.

To address this, we add a dummy get_input_embeddings() implementation to the vLLM-compatible wrapper class in
tpu_inference/models/common/model_loader.py.
Similar to the existing dummy forward() method, this implementation only satisfies vLLM’s type checks and raises
NotImplementedError if invoked. This prevents JAX model initialization during import or introspection.

Why this change is needed

vLLM recently introduced a strict requirement for model classes to define get_input_embeddings()
(link).
TPU inference uses a dummy PyTorch wrapper to register JAX models into vLLM’s registry.
Since this wrapper lacked get_input_embeddings, vLLM failed model registration checks.

Implementation details

Added unimplemented_get_input_embeddings() dummy function to the wrapper type.
Registered it inside the dynamically created wrapper class.
Added a test tests/test_vllm_wrapper.py to ensure:
- The wrapper defines get_input_embeddings().
- The method raises NotImplementedError.
- The class passes is_vllm_model() validation.

Related Issue

Fixes: #951

Tests

pytest -v tests/models/common/test_model_loader.py

Checklist

Before submitting this PR, please make sure:

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have made or will make corresponding changes to any relevant documentation.

tests/test_vllm_wrapper.py

Signed-off-by: Allen Jia <kuafou@gmail.com>

karan

Thank you for the PR.

kuafou force-pushed the qi/fix-vllm-model-wrapper branch from 79d6f2d to 80ab177 Compare October 29, 2025 23:22

py4 requested a review from karan October 30, 2025 19:30

karan requested changes Nov 3, 2025

View reviewed changes

tests/test_vllm_wrapper.py Outdated Show resolved Hide resolved

add dummy get_input_embeddings to fix vllm model type check

494d22d

Signed-off-by: Allen Jia <kuafou@gmail.com>

kuafou force-pushed the qi/fix-vllm-model-wrapper branch from 80ab177 to 59e1a8e Compare November 5, 2025 18:23

add test to test_model_loader.py

bc8fe75

Signed-off-by: Allen Jia <kuafou@gmail.com>

kuafou force-pushed the qi/fix-vllm-model-wrapper branch from 59e1a8e to bc8fe75 Compare November 5, 2025 18:29

karan approved these changes Nov 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

Uh oh!

kuafou commented Oct 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

karan left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

Are you sure you want to change the base?

[FIX] Add dummy get_input_embeddings to fix vLLM model type check #971

Uh oh!

Conversation

kuafou commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Why this change is needed

Implementation details

Related Issue

Tests

Checklist

Uh oh!

Uh oh!

karan left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kuafou commented Oct 29, 2025 •

edited

Loading