Multimodal #66

anmarques · 2024-11-04T19:14:04Z

This PR adds support for benchmarking multimodal models.

It mostly extends existing infrastructure to add support to requests containing images. For emulated requests it downloads images from an illustrated version from Pride and Prejudice and randomly selects from them.

The load_images logic is currently limited to download from url. It should be extended to HF datasets or local files in the future.

I tested by running the following command:

guidellm --data="prompt_tokens=128,generated_tokens=128,images=1" --data-type emulated --model microsoft/Phi-3.5-vision-instruct --target "http://localhost:8000/v1" --max-seconds 20

On 2xA5000 I had to set max_concurrenty=4 to run this command due to memory limitations.

…hey become serializable

… url

…hey become serializable

…timodal

rgreenberg1 · 2025-02-18T14:07:15Z

This is image models only right? text-to-image only?

anmarques added 12 commits September 4, 2024 22:09

Add mean and percentile info as computed_field properties such that t…

4312cb4

…hey become serializable

quality fixes

46e1076

quality fix

65fafde

Quality fixes

cc8d2c6

Add class to describe image samples and loading logic for images from…

bb9bc0c

… url

Add class to describe image samples and loading logic for images from…

59002b5

… url

Add url used to download images from for emulated requests

cb1f244

Add support to images in requests

24e6527

quality fixes

3946709

Quality fixes

7d93b02

Quality fixes

a441dad

Quality fixes

570670b

anmarques requested a review from markurtz November 5, 2024 01:11

anmarques added 17 commits November 5, 2024 02:43

Add new dependencies

984da28

Allow images to be resized to specific resolution

355f368

Ignore EOS

43f14d4

Ignore EOS

d9819e9

Merge branch 'output_summary' into multimodal

89e8c6b

Add image processing dependencies

503a56c

Fix support to images

ffcb28d

Fix serialization

6106a71

Fix image registration

8171820

Fix pydantic format

e845510

Use resized image

d1ad0f8

Update pyproject.toml

40e8e92

Update pyproject.toml

0d8eb2f

Update .pre-commit-config.yaml

511d3cb

Adds aiohttp backend

bca2614

Merge branch 'output_summary' into http_backend

1f7a638

Merge branch 'http_backend' into multimodal

b9751d0

anmarques added 18 commits December 11, 2024 04:22

Add support for aiohttp backend

72db6a4

Add mean and percentile info as computed_field properties such that t…

ba4187d

…hey become serializable

quality fixes

1566348

quality fix

e37166f

Quality fixes

9039845

Ignore EOS

036917e

Ignore EOS

6e6691c

Add image processing dependencies

ff742f1

Fix support to images

2b1706f

Fix serialization

0161032

Fix image registration

35379d3

Fix pydantic format

d47887c

Use resized image

33c13ec

Update pyproject.toml

cb324d8

Update pyproject.toml

4d3bc8d

Update .pre-commit-config.yaml

30e874e

Adds aiohttp backend

4426822

Add support for aiohttp backend

b784515

markurtz force-pushed the multimodal branch from 72db6a4 to b784515 Compare February 6, 2025 03:42

anmarques added 2 commits February 8, 2025 01:34

Merge branch 'multimodal' of github.com:neuralmagic/guidellm into mul…

535b5e9

…timodal

Refactor generate_benchmark_report to set default values for parameters

b5bac80

rgreenberg1 added the multi-modal Support for benchmarking new multi-modal models label Feb 28, 2025

rgreenberg1 added this to GuideLLM Kanban Board Feb 28, 2025

rgreenberg1 moved this to In review in GuideLLM Kanban Board Feb 28, 2025

rgreenberg1 added this to the GuideLLM v0.2.0 - CI/CD Finalization, Documentation Expansion, and Backend Support milestone Feb 28, 2025

rgreenberg1 removed this from the GuideLLM v0.2.0 - CI/CD Finalization, Documentation Expansion, and Backend Support milestone Mar 13, 2025

sjmonson assigned markurtz Jun 27, 2025

sjmonson marked this pull request as draft July 16, 2025 19:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multimodal #66

Multimodal #66

anmarques commented Nov 4, 2024 •

edited

Loading

Uh oh!

rgreenberg1 commented Feb 18, 2025

Uh oh!

Uh oh!

Multimodal #66

Are you sure you want to change the base?

Multimodal #66

Conversation

anmarques commented Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rgreenberg1 commented Feb 18, 2025

Uh oh!

Uh oh!

anmarques commented Nov 4, 2024 •

edited

Loading